Journal of Research in Pharmacy Practice

: 2020  |  Volume : 9  |  Issue : 1  |  Page : 24--29

Adaptation and validation of the screening tool of older people's prescriptions instrument for the Indonesian population

Siti Fauziyah1, Retnosari Andrajati2, Ratu Ayu Dewi Sartika3, Maksum Radji2,  
1 Department of Pharmacy, University of Indonesia, Depok; Department of Pharmacy, Dr. Mintohardjo Navy Hospital, Jakarta, Indonesia
2 Department of Pharmacy, University of Indonesia, Depok, Indonesia
3 Department of Nutrition, University of Indonesia, Depok, Indonesia

Correspondence Address:
Prof. Dr. Maksum Radji
Department of Pharmacy, University of Indonesia, Depok
Mrs. Siti Fauziyah
Department of Pharmacy, University of Indonesia, Depok; Department of Pharmacy, Dr. Mintohardjo Navy Hospital, Jakarta


Objective: In this study, we aimed to prepare and validate an Indonesian version for the Screening Tool of Older People's Prescriptions (STOPP), which is an instrument to identify inappropriate medications for elderly patients. Methods: The Indonesian version of STOPP (STOPP_INA) was developed using modified transcultural adaptation guidelines from the American Academy of Orthopedic Surgeons. Our method consisted of translating original STOPP into Indonesian (forwardly translation), synthesis of forward translation, translation into English and synthesis of back translation, a review by the copyright holder of STOPP, a review by the expert team, pretest, revision of STOPP_INA, field test, and psychometric analysis of the final version of the questionnaire. The study design for this part was quasi-experimental with purposive sampling for members of the translator's team, expert's team, and respondents in the pretest, but they were different from field testing that used purposive and postsurvey sampling for respondents. Content validity and face validity were used to construct the validity of STOPP_INA by assessing item-level content validity and correlation between items and total values. Internal consistency was measured with Cronbach's alpha coefficient. Findings: The expert panel agreed on a list of 81 criteria. Five (62.50%) of expert team members agreed and could be continued to the field test without revision of STOPP_INA and 3 (37.50%) agreed with a revision. The research subjects in the psychometric test had 230 respondents, 5 (2.17%) resigned, with an average of item-level content validity index of 0.99. The construct validity analysis showed that 5-item criteria are “not valid,” namely in A1, A3, B7, B10, and C3. Reliability analysis showed the Cronbach's Alpha and Cronbach's Alpha Based on Standardized Items were 0.978 and 0.979. Conclusion: The expert team was be agreed on 81 criteria (100%) of adaptation of STOPP version 2 criteria. There were 5 criteria that not valid statistically, they could not be removed from the instrument because they can influence content and construct of the instrument. The STOPP_INA has been developed for the Indonesian population, currently being tested in clinical practice against elderly patients undergoing hospitalization.

How to cite this article:
Fauziyah S, Andrajati R, Sartika RA, Radji M. Adaptation and validation of the screening tool of older people's prescriptions instrument for the Indonesian population.J Res Pharm Pract 2020;9:24-29

How to cite this URL:
Fauziyah S, Andrajati R, Sartika RA, Radji M. Adaptation and validation of the screening tool of older people's prescriptions instrument for the Indonesian population. J Res Pharm Pract [serial online] 2020 [cited 2020 Jun 4 ];9:24-29
Available from:

Full Text


In 2014, the prevalence of morbidity for the elderly in Indonesia reached 25.05%, and 66.01% of them consumed medicines.[1] A change and decrease in various physiological, hormonal, and organ functions could result with increasing age. This caused susceptibility the body to disease. The ageing process in the elderly also resulted in changes in body composition, pharmacokinetic and pharmacodynamics that could increase sensitivity to certain drugs.[2],[3]

Multimorbidity and the use of large amounts of medicines caused potentially inappropriate medications (PIM), polypharmacy,[4] hospitalization, adverse drug reactions,[5],[6] and fall in elderly patients.[7],[8] This resulted in an increase of treatment costs.[9] An effort to reduce the use of inappropriate drugs was by providing clinical guidance through the development of explicit treatment criteria because it will benefit practitioners in providing the best care for patients according to the latest evidence and giving an assurance of health for patients.[10],[11] Some instruments have been developed in various countries, one of which was the Screening Tool of Older Persons' Prescriptions (STOPP) and the Screening Tool to Alert Doctors to Right Treatment (START) criteria. The STOPP/START criteria were developed in Ireland and the United Kingdom in 2008 and revised in 2014.[12],[13] The STOPP/START instrument was validated using the Delphi method by 20 expert team members.[13]

The identification of appropriate medications in elderly patients was critically important because they were susceptible to diseases.[1] STOPP had been used in several studies in Indonesia[6],[14] but was never adapted to Indonesian. At present, Indonesia does not have instruments yet that function the same as STOPP. Therefore, it was necessary to develop an instrument for identifying inappropriate medications to the Indonesian version (STOPP_INA). This study aimed to adapt the English version of STOPP to Indonesian culture and to measure the validity and reliability of the instrument. The development of the instrument was carried out through the adaptation process of the STOPP version 2 criteria. The questions in this study are: was STOPP version 2 adaptation in Bahasa Indonesia acceptable? Was the STOPP_INA valid and reliable as an instrument of the identifier inappropriate medications in elderly patients?


The adaptation of the STOPP version 2 criteria got permission from Denis O'Mahony as the copyright holder and ethical clearance from the Ethics Committee of the Faculty of Medicine, The University of Indonesia (No. KET-850/UN2.F1/ETIK/PPM.00.02/2019). Informed consent was given to subjects before participating. The research was conducted on a multicenter of Indonesian hospitals.

The STOPP consisted of 81 criteria which were grouped into 13 sections such as A: Indications of the drug (A1–3); B: Cardiovascular system (B1–13); C: Coagulation System (C1–11); D: Central nervous system (D1–14); E: Renal system (E1–6); F: Gastrointestinal system (F1–4); G: Respiratory system (G1–5); H: Musculoskeletal system (H1–9); I: Urogenital system (I1–2); J: Endocrine system (J1–6); K: Medications that are predicted to increase the risk of falls (K1–4); L: Analgesic medicine (L1–3); and M: Antimuscarinic/anticholinergic drugs (M1).[13]

The adaptation of the STOPP was developed with the cross-cultural adaptation guidelines from the American Association of Bone Surgeons Committee.[15] The prose stage consisted of translation of the original STOPP into Indonesian language (forward translated), synthesis of forward translation, back translation into English, synthesis of back translation, a review from the copyright holder of STOPP, a review by the expert team, pretest, revision of STOPP_INA, field test, and psychometry of the final version of the questionnaire.[16]

Translation of STOPP (forward and back translation) involved 5 translators. They were independent translators, didn't know each other, were fluent in Indonesian and English and had different scientific backgrounds. The synthesis of the forward translation and back translation was undertaken by the researcher and translators through confirmation and discussion for the differences of meanings. Each correction of translation was recorded as data in the translation process. The stage of adaptation of instrument is presented as follows: [Figure 1].{Figure 1}

The questionnaire was presented in a paper format with two choices, namely: “agree” if the statement was relevant and “disagree” if it was not relevant. The papers were sent to members of the expert team for initial reviewing, followed by an expert panel, and were sent back to review more. The composition of this expert team consisted of two geriatricians, one pharmacologist, one endocrinologist, one cardiologist, one neurologist, one clinical pharmacist, and one linguist who was also a translator member. They were assessed using three feasibility options, namely: 1 = “the instrument could be used to testing without revision,” 2 = “could be used to testing with revision,” and 3 = could not be used to testing. The study design in the pretest was a quasi-experimental study with the test–retest method. Respondents were pharmacists who met the inclusion and exclusion criteria and were chosen using a purposive sampling technique in March 2019. The minimum number of the needed subjects was 30 respondents.[15] Eligible participants of the study were hospital pharmacists, who served in pharmaceutical care for >1-year, from secondary or tertiary hospitals in Indonesia, and were willing to be respondents in this study. Hospital pharmacists who served in managerial pharmacy or served outside the hospital pharmacy installation were excluded. Respondents completed the paper of our self-administered questionnaire, which consisted of sheets of informed consent, demographic characteristics, STOPP_INA paper, and an opinion form. The STOPP_INA was presented in five Likert scales: 1 = “strongly disagree,” 3 = “don't know,” and 5 = “strongly agree.”

The design of this part of our study was quasi-experimental with the one-shot method. The data were taken using a purposive sampling technique through survey post in July–October 2019. Respondents were a pharmacist who required of the inclusion and exclusion criteria, such as pretest respondent qualification. The minimum needed subjects were 220 respondents at a significance level of 95% (d = 0.05) and proportion (P) 80%.[17] Respondents completed a questionnaire paper that consisted of informed consent, demographic characteristics, and a final STOPP_INA which were obtained in four Likert scales, 1 = “strongly disagree” and 4 = “strongly agree.”

We used the IBM® SPSS® Statistics, International Business Machiner Corp. version 22.0 for data analysis, and a P < 0.05 was considered statistically significant. The data analysis was presented qualitatively for the modified criteria. Descriptive analysis was presented as a percentage (%). The demographic characteristics of respondents with mean ± standard deviation, the content validity and face validity with an average of item-level content validity index (I-CVI/ave), and internal consistency form pretest data. The construct validity and reliability were tested and reported with a Pearson correlation and Cronbach's alpha coefficient.[18],[19]


There were 30 (37.04%) of 81 criteria that gave different meanings in the translation process. An overview of the needed modifications of items of the STOPP questionnaire is presented in [Table 1]. The instrument feasibility assessment showed that 5 (62.50%) expert team members agreed to be continued the field test without revision and 3 (37.50%) expert team members agreed to be continued the field test with the revision.{Table 1}

I-CVI values were >0.7 for each item and an I-CVI/ave value was 0.99.[20]

The total number of subjects at the pretest stage was 34 respondents. The basic characteristics of respondents are presented in [Table 2]. The internal consistency showed that 14-item criteria were not relevant. Therefore, a retest was carried out on these items. The internal consistency of the first test and retest is presented in [Table 3].{Table 2}{Table 3}

In the field test stage, respondents were pharmacists from 320 hospitals in Indonesia and obtained 230 (71.88%) respondents. Five respondents did not complete the questionnaire, and the characteristics of respondents are presented in [Table 2]. A mean score of each item criterion was more than 3 points, except for Item_1 (2.86 ± 0.95), Item_2 (2.86 ± 0.89), and Item_10 (2.96 ± 0.73). The construct validity was 5-item criteria that were “not valid,” namely in Item_1/A1 (r = 0.262; P = 0,000), Item_3/A3 (r = 0.423; P = 0,000), Item_10/B7 (r = 0.401; P = 0,000), Item_13/B20 (r = 0.373; P = 0,000), and Item_19/C3 (r = 0.442; P = 0,000). The Cronbach's alpha was 0.978 and Cronbach's alpha based on standardized items was 0.979.


This study used a different validation method from the study of Luz et al. and Samaranayake et al. They used the Delphi two-round method.[21],[22] The forward translation process involved three translators, one translator was an educator who understood pharmacy and clinical pharmacy well, and the other two were educators who were experts in the languages and cultures of both countries (English and Indonesian). It aimed to get the right word selection and reduce the ambiguous meanings, so produced a better instrument equivalence.[15] Our study also conducted a review of the results of the back translation obtained from the authorities, which provides corrections to 5 criteria related to the replacement of terms, an affirmation of statements in sentences, replacement of words, improvement of wording, and an affirmation of subgroups of drugs. This process aimed to reduce errors in translation results, correct sentences to be easily understood, and assess the quality of translations with the original version.[23]

The expert team review stage begun with the submission of the manuscript, to be reviewed every for all item in STOPP_INA and responded in writing; forming panels, to share their opinions and opinions with one another; and sending the text of the reconciliation results from any difference of opinion in the panel, to be reviewed and responded to in writing. This process was carried out to obtain semantic equality, idiomatic equality, experiential equality, and conceptual equality between STOPP_INA instruments and the original version.[23] The description of the field test respondents showed that the data collection was quite good.

Data were obtained from regional 1 to regional 5, which means that it could represent the entire territory of Indonesia. Respondents' assessment of STOPP_INA used four Likert scales. This aimed to eliminate the answer to the middle value, which is “don't know.”

The measurement of content validity in this study was obtained from qualitative and quantitative measurements. Qualitative measurements resulted from the consideration of the expert team (validity by assumption),[24] which resulted in a modification in the STOPP criteria as in [Table 1]. Quantitative measurements were obtained from two subjects, namely from the expert team and respondents in the pretest stage. The content validity of the expert team review was measured with the I-CVI and the I-CVI/ave value, which means that there was a match between each measurement item with the contents of the measured variable.[20],[24]

The content validity of the pretest respondents was measured using correlation between test factors[24] that 14 items had a low conformity with a correlation value <0.45,[25] which means that they had a low alignment and consistency of items to the instrument.[26] Therefore, these items were retested at the same respondent to improve internal consistency. The face validity qualitatively showed that a correlation was obtained from the reviews and opinions of the expert team, related to the consistency of the style and format of the writing, while from respondents in the pretest stage, related to the readability and clarity of the language, not confusing, unambiguous, a sentence was not too long or too short.[24],[27] Quantitative measurements were obtained from descriptive eligibility both from the expert team and from pretest respondents, which showed that the instrument could be accepted.

The construct validity qualitatively (“validity by assumption”) was obtained from content validity and face validity, which results in a modification of STOPP_INA before field testing.[24] The quantitative, carried out empirically using field test data, through measurement of internal consistency (Cronbach's alpha), Item to Total items correlations, Inter-Item Correlation, Cronbach's Alpha if Item Deleted.[26] The reliability test analysis showed a high value of internal consistency degree for each item and all items in the instrument, which means that the STOPP_NA instrument was reliable for repeated measurements. Based on the item's correlation value to the total Item it shows 5 Item “not valid” criteria because it gave a correlation value <0.45.[25] Their item showed a low correlation value to other items, as in Item_1 (A1) with each item in the instrument, except for Item_2 (A2), while in Item_3 (A3), Item_10 (B7), Item_13 (B13), and Item_19 (C3), each has a low correlation with each other item. Therefore, they could be considered to be removed from the STOPP_INA instrument. Criteria of “not valid” did not remove from the instrument because they have related to other criteria, even though removing the criteria could increase the Cronbach's alpha significantly. This was being caused by some matter, among others: in criterion A1, had an incomplete sentence. The correction was an inserting the word “and” in-between words “indications based”; in criterion A3, had no relevance between the sentence of a statement and an explanation. The correction was a changing word “a new drug” became “other drugs of the same class/group”; in criterion B7, had been influenced ability and experience of respondents in clinical practice of geriatric care. In old age, oedema can occur due to poor circulation (sitting too often), so causing a buildup of fluid in the lower body, especially at the ankles and feet. The correction was a using of criterion that had been agreed by the expert team; in criterion B10, had an imperfection of sentence order. The correction was an inserting of explanation sentence before the word “except”; in criterion C3: had not given the name of medicines. The correction of C3 was an adding of the name of medicines. The Adaptation and validation of STOPP version 2 for Indonesian population could be accepted 81 criteria (100.00%), was different from the STOPP-START adaptation study for the Sri Lanka population that had been rejected 8% item of original instruments.[22]

The Indonesian version of STOPP criteria has been developed. We hope the instrument can be used in clinical practice and research on medication among the elderly. Currently, the STOPP_INA are being tested in clinical practice against elderly patients undergoing hospitalization for ensuring the capability of the instrument as a tool of identification PIM. The final adapted and validated version of the questionnaire is available online in the journal's website as a [Supplement Table 1].[INLINE:1]

 Authors' Contribution

All authors contributed to the design, the questionnaire developing, data collection, and analysis. All authors participated in the editing, reviewing, and approval of the final version of the manuscript.


We express our gratitude and a special appreciation Prof. Dr Denis O´Mahony who has given permission for adapting the STOPP criteria version 2 for Indonesia and Prof. Dr. dr. Siti Setiati, SpPD-K. Ger, M. Epid, FINASIM who has given correction and suggestion for this research. We would like to thank the team of experts namely Prof. Dr. dr. Armen Muchtar, DAF., DCP., SP.FK (K)., Prof. Dr. dr. Pradana Soewondo, SpPD-KEMD., Dr. dr. Czeresna Heriawan Soejono, SpPD-K. Ger, M. Epid., Susi Sunarya, PhD., dr. Budi Wahyono, SpS., dr. Inez Ariadne Siregar, SpJP., and dra. Yulia Trisna, M. Pharm., Apt., and so thanks to all pharmacists who have participated in this study.

Financial support and sponsorship

We would like to express our gratitude to the Directorate of Research and Community Services for the Research Grant 2019 of Doctoral Program of Indonesia University.

Conflicts of interest

There are no conflicts of interest.


1Susilo D, Chamami A, Nugroho SW. Elderly Population Statistics 2014; National Socioeconomic Survey Results. Central Bureau of Statistics. 12th ed. Jakarta, Indonesia: 2015.
2Niccoli T, Partridge L. Ageing as a risk factor for disease. Curr Biol 2012;22:R741-52.
3Di Giorgio C, Provenzani A, Polidori P. Potentially inappropriate drug prescribing in elderly hospitalized patients: An analysis and comparison of explicit criteria. Int J Clin Pharm 2016;38:462-8.
4Andrew MK, Purcell CA, Marshall EG, Varatharasan N, Clarke B, Bowles SK. Polypharmacy and use of potentially inappropriate medications in long-term care facilities: Does coordinated primary care make a difference? Int J Pharm Pract 2018;26:318-24.
5Rosted E, Schultz M, Sanders S. Frailty and polypharmacy in elderly patients are associated with a high readmission risk. Dan Med J 2016;63. pii: A5274.
6Radiyanti, Rahmawati F, Probosuseno. Peresepan obat tdak tepat dan adverse drug events pada pasien geriatri rawat inap di rumah sakit umum. J Manaj dan Pelayanan Farm. 2016;6:47-54.
7Masumoto S, Sato M, Maeno T, Ichinohe Y, Maeno T. Potentially inappropriate medications with polypharmacy increase the risk of falls in older Japanese patients: 1-year prospective cohort study. Geriatr Gerontol Int 2018;18:1064-70.
8Fialová D, Laffon B, Marinković V, Tasić L, Doro P, Sόos G, et al. Medication use in older patients and age-blind approach: Narrative literature review (insufficient evidence on the efficacy and safety of drugs in older age, frequent use of PIMs and polypharmacy, and underuse of highly beneficial nonpharmacological strategies). Eur J Clin Pharmacol 2019;75:451-66.
9Hill-Taylor B, Walsh KA, Stewart S, Hayden J, Byrne S, Sketris IS. Effectiveness of the STOPP/START (Screening Tool of Older Persons' potentially inappropriate Prescriptions/Screening Tool to Alert doctors to the Right Treatment) criteria: Systematic review and meta-analysis of randomized controlled studies. J Clin Pharm Ther 2016;41:158-69.
10National Health and Medical Research Council. Guide to the Devlopment, Evalution and Implemetation of Clinical Practice Guidelines. Canberra: National Health and Medical Research Council; 1999. Available from: [Last accessed on 2020 Feb 05].
11Curtin D, Gallagher PF, O'Mahony D. Explicit criteria as clinical tools to minimize inappropriate medication use and its consequences. Ther Adv Drug Saf 2019;10:1-10.
12Gallagher P, O'Mahony D. STOPP (Screening Tool of Older Persons' potentially inappropriate Prescriptions): Application to acutely ill elderly patients and comparison with Beers' criteria. Age Ageing 2008;37:673-9.
13O'Mahony D, O'Sullivan D, Byrne S, O'Connor MN, Ryan C, Gallagher P. STOPP/START criteria for potentially inappropriate prescribing in older people: Version 2. Age Ageing 2015;44:213-8.
14Dyah AP, Nurul K. Pharmacist intervention can reduce the potential use of inappropriate drugs medications in Indonesian geriatric patients. J Appl Pharm Sci 2020;10:88-95.
15Beaton DE, Bombardier C, Guillemin F, Ferraz MB. Guidelines for the process of cross-cultural adaptation of self-report measures. Spine (Phila Pa 1976) 2000;25:3186-91.
16Hambleton RK, Patsula L. Increasing the validity of adapted tests: Myths to be avoided and guidelines for improving test adaptation practices. J Appl Test Technol 1999;1:1-13.
17Lemeshow S, Hosmer DW, Klar J, Lwanga SK. Adequacy of Sample Size in Health Studies. World Health Organization. England: John Wiley & Sons Ltd.; 1991. p. 95.
18Norozi E, Miri MR, Soltani R, Eslami AA, Harivandi AR, Dastjerdi R. Cultural adaptation and psychometric properties of the persian version of the circumstances, Motivation, and readiness scale. Int J High Risk Behav Addict 2016;5:e23242.
19Polit DF, Beck CT. The content validity index: Are you sure you know what's being reported? Critique and recommendations. Res Nurs Health 2006;29:489-97.
20Polit DF, Beck CT, Owen SV. Is the CVI an acceptable indicator of content validity? Appraisal and recommendations. Res Nurs Health 2007;30:459-67.
21Luz AC, Oliveira MG, Noblat L. Cross-cultural adaptation and content validation of START. Sao Paulo Med J 2016;134:20-7.
22Samaranayake NR, Balasuriya A, Fernando GH, Samaraweera D, Shanika LG, Wanigasuriya JK, et al. 'Modified STOPP-START criteria for Sri Lanka'; translating to a resource limited healthcare setting by Delphi consensus. BMC Geriatr 2019;19:282.
23Geisinger KF. Cross-cultural normative assessment: Translation and adaptation issues influencing the normative interpretation of assessment instruments. Psychol Assess 1994;6:304-12.
24Murti B. Validitas Dan Reliabilitas Pengukuran. Matrikulasi Progr Stud Doktoral, Fak Kedokteran, UNS; 2011. p. 1-19.
25DeVon HA, Block ME, Moyle-Wright P, Ernst DM, Hayden SJ, Lazzara DJ, et al. A psychometric toolbox for testing validity and reliability. J Nurs Scholarsh 2007;39:155-64.
26Hajjar ST. Statistical analysis: Internal-consistency reliability and construct validity. Int J Quant Qual Res Methods 2018;6:27-38.
27Arafat S, Chowdhury H, Qusar M, Hafez M. Cross cultural adaptation and psychometric validation of research instruments: A methodological review. J Behav Heal 2016;5:129.