Screening for frailty in primary care: a systematic review of the psychometric properties of the frailty index in community-dwelling older people
BMC Geriatrics volume 14, Article number: 27 (2014)
To better accommodate for the complex care needs of frail, older people, general practitioners must be capable of easily identifying frailty in daily clinical practice, for example, by using the frailty index (FI). To explore whether the FI is a valid and adequate screening instrument for primary care, we conducted a systematic review of its psychometric properties.
We searched the Cochrane, PubMed and Embase databases and included original studies focusing on the criterion validity, construct validity and responsiveness of the FI when applied in community-dwelling older people. We evaluated the quality of the studies included using the Quality in Prognosis Studies (QUIPS) tool. This systematic review was conducted based on the PRISMA statement.
Of the twenty studies identified, eighteen reported on FIs derived from research data, one reported upon an FI derived from an administrative database of home-care clients, and one reported upon an FI derived from routine primary care data. In general, the FI showed good criterion and construct validity but lacked studies on responsiveness. When compared with studies that used data gathered for research purposes, there are indications that the FI mean score and range might be different in datasets using routine primary care data; however, this finding needs further investigation.
Our results suggest that the FI is a valid frailty screening instrument. However, further research using routine Electronic Medical Record data is necessary to investigate whether the psychometric properties of the FI are generalizable to a primary care setting and to facilitate its interpretation and implementation in daily clinical practice.
PROSPERO systematic review register number: CRD42013003737.
Among other issues, ageing within the population poses a major burden on healthcare due to the increasing prevalence of frailty among older people . Frailty is defined as a state of increased vulnerability due to somatic, environmental or psychosocial factors . To better accommodate for the complex care needs of frail, older people, a transition towards proactive, population-based care is required, which will improve clinical outcomes and cost-effectiveness [3, 4]. To facilitate this care transition, general practitioners (GPs) must be capable of identifying frail older patients within their daily clinical practice.
The Frailty Index (FI) is one of the screening tools for frailty . An FI comprises a list of health deficits (e.g. symptoms, signs, impairments, and diseases) that are indicative of frailty. The proportion of deficits present forms the patient’s FI score, which can range from zero to one . When an FI consists of at least 30 deficits, different numbers and types of deficits may be used without major influence on the properties of the FI, which enables application in and comparison between different datasets .
There is considerable debate over whether the FI can be used for frailty screening in daily primary care. Some authors have stated that the FI has not been validated in this setting, that the instrument is of limited value due to its perceived complexity, that the FI has only moderate discriminative ability, and that other frailty instruments, such as the Tilburg Frailty Indicator, are more promising [8–11]. Others have argued that the FI is a significant predictor of adverse health outcomes, that it covers all important frailty factors, that it can be easily derived from routine administrative healthcare data, and they have called for further exploration of the FI’s merits in primary care [12–14].
To further assess the potential of the FI as a screening and monitoring instrument for frailty in primary care, knowledge of its characteristics is essential. Therefore, we performed a systematic review of the literature and assessed the psychometric properties of the FI in identifying frailty among community-dwelling older people.
Search strategy, selection criteria and data extraction
We searched the Cochrane, PubMed, and Embase databases using the terms ‘frailty AND (index OR deficit OR deficits OR cumulative OR accumulation)’. We searched for studies published from August 8th, 2001 onwards, which is the publication date of the landmark study presenting the FI concept . The search was limited to studies in English, and databases were searched until October 30th, 2012. The first and third author (ID and GK) screened titles and abstracts independently and selected studies for full-text assessment. These full-text studies were assessed by the first author for inclusion, and in cases where doubt existed, an independent assessment by the last author (MS) followed. Citations from the included articles were also searched for additional relevant publications by the first author. Eligibility disagreements were resolved by consensus.
Studies were included that met the following criteria: first, the studies focused on an FI. The FI was defined as a list of health deficits for which patients were screened and that provided an FI score that reflected the proportion of deficits present on the predefined list ; second, only original research was included that assessed one of the following psychometric properties of the FI: criterion validity, construct validity or responsiveness; third, the studies focused primarily on community-dwelling older people. Community-dwelling older people were defined as older people who lived independently at home; older people who lived at home while receiving home care; and older people living in assisted living facilities. In the Netherlands, GPs provide care to older people in all these different living situations, and virtually all older people in these living situations are inscribed in a general practice. Studies were excluded when the FI was based on a comprehensive geriatric assessment (CGA), because it is not feasible to perform a CGA for all older patients in general practice. Also, studies were excluded when the entire study population was living in a nursing home, was hospitalized or was selected because of one specific disease in common. Secondary reports of FI datasets that did not report additional psychometric properties were excluded (see Additional file 1 for full details of inclusion and exclusion criteria). Based on these predefined criteria, the first author extracted data on general study characteristics, frailty index characteristics and assessed psychometric properties.
Psychometric properties– definitions
Currently, there is no consensus about a frailty reference standard against which the criterion validity of the FI could be assessed. However, since there is general agreement that the concept of frailty reflects a state of increased vulnerability to adverse health outcomes, criterion validity is defined as the ability of an FI to predict adverse health outcomes . An Area Under the Curve (AUC) of < 0.70 was considered poor; an AUC of 0.70-0.89 was considered adequate; and an AUC of ≥ 0.90 was considered excellent . Construct validity refers to the coherence of the FI with other frailty measures or related conditions and constructs, including comorbidity, disability, self-rated health, age, and gender . Responsiveness reflects the ability of the FI to detect clinically important changes over time in the frailty construct (see Additional file 1 for a detailed description of the various psychometric properties) .
Study quality was evaluated using the Quality in Prognosis Studies (QUIPS) tool, which considers six potential domains of bias: inclusion, attrition, prognostic factor measurement, confounders, outcome measurement, and analysis and reporting . Each domain comprises a number of prompting items, which enable assessment of the domain as having a high, moderate or low risk of bias.
The QUIPS tool was considered the most appropriate quality appraisal tool because, conceptually, the frailty index is a prognostic instrument. We modified three domains of the QUIPS tool. First, in our review, we were interested only in the descriptive, rather than explanatory, relationships of the FI to adverse health outcomes and other measures; thus, we considered the domain ‘confounders’ irrelevant. Second, the domain ‘outcome measurement’ only accommodated studies in which the FI correlated with adverse outcomes, i.e., criterion validity studies. We modified this domain such that the QUIPS tool also applied to studies in which the FI was correlated cross-sectionally or longitudinally with other frailty measures or related constructs, i.e., construct validity or responsiveness studies. Third, in the domain ‘prognostic factor measurement’, we redefined the prompting item ‘Valid and Reliable Measurement of Prognostic Factor’ as ‘Valid and Reliable Construction of Prognostic Factor’ because the FI deficit list must be constructed based on specific criteria [2, 19]: first, deficits should be acquired and related to health status; thus, ‘blue eyes’ is not an appropriate deficit whereas ‘heart failure’ is appropriate; second, deficit prevalence should increase with age; third, deficits should not ‘saturate’ too early, for example, presbyopia is present in almost all older people, thus, it is not appropriate as a deficit; fourth, the combination of deficits in an FI should cover a range of systems; fifth, the same FI should be used in follow-up measures; and finally, the FI should comprise at least 30 deficits and deficit prevalence should be at least 1%  (see Additional file 2 for the modified QUIPS form that was used for the quality appraisal of the studies included).
This systematic review was registered prospectively in the PROSPERO international prospective register of systematic reviews (CRD42013003737).
This research was supported by a grant from ZON-MW, The Netherlands Organization for Health Research and Development (reference 311040201). The sponsor had no influence on the research design, data collection, data interpretation, the writing of this report or the decision to publish.
After removing duplicates, our search resulted in 867 studies (Figure 1). We excluded 809 studies after screening the titles/abstracts and 38 studies after full-text assessment. We have listed the full bibliographic details and the reason for exclusion of each of these studies (available upon request). No additional studies were found in manual reference searching; thus, we used twenty studies for our final review.
Description of study characteristics
One study was a cross-sectional study , and nineteen studies were cohort studies with a follow-up ranging from one to twelve years (Table 1). Eighteen studies used survey datasets; one study used an administrative dataset of home-care clients , and one study was based on the analysis of routine administrative primary care data .
In ten studies, the study population was population-based, consisting of a representative mixture of independently living and institutionalized older people, with the majority of people living independently [6, 23–31]. Eight studies included only independently living older people [19, 20, 22, 32–36]; and two studies focused specifically on older people receiving home care or older people in assisted living facilities [21, 37]. The number of participants ranged from 754 to 36,424 older people with a mean age varying from 70.1 to 84.9 years, and the percentage of women varied from 50.0 to 76.7%.
The FIs used in the studies were based on 13 to 92 health deficits. Most studies scored deficits dichotomously [6, 21–26, 29–31]. Eight studies applied multilevel scoring [19, 28, 32–37] and used, for example, a Likert-scale . Two studies did not report how the deficits were scored [20, 27]. Two studies assigned extra weight to predefined deficits [23, 31], for example, to ‘polypharmacy’ . The mean FI scores varied from 0.13 to 0.26, and except for two studies that reported a lower maximum FI score [22, 31], the maximum reported FI score varied from 0.60 to 0.70.
Four studies showed a low risk of bias for each of the five domains of the QUIPS tool considered, namely inclusion, attrition, prognostic factor measurement, outcome measurement, and analysis and reporting. Fourteen studies showed a moderate-to-high risk of bias in one or two domains; and two studies showed a moderate-to-high risk of bias in three or four domains (Table 2). Risks of bias were highest in the domain of study attrition, which was due to very low response rates  or an unclear response rate [19, 25, 34]. In one cohort study, attrition was not assessed because only the cross-sectional study component was considered . For the remaining fourteen cohort studies, losses to follow-up were < 16%.
In the domain of prognostic factor measurement, eleven studies were judged as having a moderate risk of bias [19, 20, 22, 24, 27, 28, 30–32, 34, 36]. Of these eleven studies, four studies did not report their entire FI deficit list [20, 26, 27, 32], three used data-driven cut-off points for the FI [24, 26, 30], and nine did not report the percentage of missing FI data or how missing FI data were managed [19, 20, 22, 24, 30–32, 34, 36]. In the remaining nine studies showing a low risk of bias in the prognostic factor measurement, eight reported a percentage of missing data of <5% [21, 23, 25, 28, 29, 33, 35, 37], and one study did not report the percentage of missing data . Six studies managed missing data by excluding the missing deficits from the denominator when calculating the FI [6, 25, 28, 32, 35, 37]. Two studies imputed the missing FI data [23, 29]. All twenty studies complied with the criteria for adequate FI construction as described in the ‘Methods’ section.
In total, in the 20 included studies, 5.1% of domains, i.e., inclusion, attrition, prognostic factor measurement, outcome measurement, and analysis and reporting as assessed with the QUIPS tool showed a high risk of bias, 25.5% of domains showed a moderate risk of bias, and 69.4% of domains showed a low risk of bias (full QUIPS appraisal forms for each study are available upon request).
Psychometric properties of the FI
Fifteen studies assessed the criterion validity of the FI by evaluating the predictive ability of the FI for mortality, institutionalization, hospitalization, number of days in hospital, morbidity, Emergency Department (ED) visits, out-of-hours GP consultations, falls, fractures, change in ADL score, and change in mental score (Table 3). In each study, the FI was incorporated into a multivariable regression model that was corrected for age, gender and a variety of other co-variables. In each model, the FI was a significant predictor of the assessed outcome.
Twelve studies focused on the prediction of mortality, for which hazard ratios of 1.01 (SE ± 0.003; per deficit increase in the frailty index) to 6.45 (95% CI 4.10-10.14, most-frail group (FI score 0.35-0.65) versus the least-frail group (FI score < 0.07) were reported [34, 33]. A multivariable model with age, gender, co-morbidity and an FI resulted in an Area Under the Curve (AUC) of 0.691 (95% CI 0.648-0.733) for one-year mortality . Used as a single independent variable, the FI predicted two-year mortality with an AUC of 0.780 (± 0.020 SE) and a ten-year mortality with an AUC of 0.720 (± 0.020 SE) .
For other outcome measures, comparable AUCs were as follows: 0.610 (95% CI 0.576-0.644) for one-year hospitalization risk and 0.667 (95% CI 0.625-0.707) for a one-year risk of moving to long-term care . For the prediction of time to the combined outcome of ED/out-of-hours GP surgery visits, nursing home admission and mortality, the c-statistic of the FI used as a single independent variable was 0.686 (95% CI 0.664-0.708). When the FI was combined in a model with age, gender, and consultation gap, the c-statistic improved to 0.702 (95% CI 0.680-0.724) .
One study tested the added value of the FI in a multivariable model for predicting adverse health outcomes. For mortality and transition to long-term care, the AUCs of the models including an FI were significantly higher than the AUCs of a model comprising only age, gender and co-morbidity (p < 0.03). For hospitalization, the AUC of the full model with age, gender, co-morbidity and an FI was significantly higher than the AUC of a model comprising only age and gender (p < 0.001) .
Eleven studies evaluated the construct validity of the FI [6, 20, 21, 24–28, 34, 36, 37]. The FI showed a strong positive correlation with the Functional Reach test (r = 0.73) , Consolice Study of Brain Ageing (CSBA) score (r = 0.72) , Frailty Phenotype (0.65) , and Edmonton Frail Scale (EFS; r = 0.61) , a strong negative correlation with the Mini Mental State Examination score (r = −0.58) , and a moderate correlation with the Changes in Health, End-Stage Disease and Signs and Symptoms (CHESS) Scale (r = 0.35) . When the dichotomized FI was compared with the Frailty Phenotype where the latter was used as a reference standard, the FI showed a sensitivity of 45.9 to 60.7% and a specificity of 83.5 to 90.0% [20, 24]. When compared with the Functional Domains model, the sensitivity of the FI was 38%, and its specificity was 91.5% . When using a three-level risk categorization, the weighted kappa of the FI compared with the Frailty Phenotype was 0.17 (95% CI 0.13-0.20), and the weighted kappa of the FI compared with the CHESS scale was 0.36 (95% CI 0.31-0.40).
The FI displayed moderate correlation with the concept of self-rated health (r = 0.49), which was expressed as an index of self-rated health deficits . When the crude correlation of the FI was assessed with age, a weak to moderate correlation of 0.193, 0.241 and 0.320, respectively, was reported [6, 25, 26]. One study compared the age trajectories of the FI score within community-dwelling and institutional/clinical cohorts , with higher levels of comorbidity and disability in the latter. The FI score increased gradually with age in community-dwelling cohorts, whereas the FI score was high at all ages in the institutional/clinical cohorts.
One study examined specifically an FI with only symptoms and signs as deficits and demonstrated that older people with higher FI scores showed more functional impairments in (I) ADL and more co-morbidity than patients with lower FI scores .
Without formally assessing correlations within a construct validity context, sixteen studies reported that older people and women show higher FI scores [6, 19, 20, 22, 23, 25–37], and only one study reported a lower percentage of women in the most-frail group .
No studies reported on the responsiveness of the FI in daily clinical practice.
In this systematic review, we demonstrate that the FI adequately predicts a wide range of adverse health outcomes and that its discriminative capability is poor to adequate. The FI correlates strongly with other frailty measures, except for the CHESS scale. However, this scale is not a frailty measure per se but was designed to measure ‘health instability’ and to specifically predict mortality in institutionalized older people . The FI score increases steadily with age, and the maximum FI score reported was 0.70, indicating that no ceiling effect exists.
Our review has a number of strengths. First, we used a broad, sensitive search strategy with a low risk of missing relevant studies. Thus, we identified a large number of studies with consistent results across a variety of FIs in different populations. Second, we only considered relevant psychometric properties. We omitted reliability because the FI is an automated screening procedure and therefore not susceptible to intra- or interrater variability. Internal consistency was not examined because the FI is a formative model, i.e., the items form the construct together and therefore do not need to be correlated . Third, the definitions used were tailored specifically to those aspects considered essential for frailty measures and based on a standardized taxonomy [15, 17]. Fourth, we tailored our detailed inclusion and exclusion criteria to support our aim, which was to select those FI studies relevant for primary care. For example, we excluded studies with an FI based on a comprehensive geriatric assessment because it is not feasible to perform such an assessment for each older patient in primary care. Fifth, we appraised included studies critically using the QUIPS tool, which provided comprehensive quality assessment that demonstrated overall good quality of the methodology used in the included studies. The majority of studies reported sufficient details on their study sample, used appropriate criteria for FI construction, and reported few missing data. Moreover, the reported loss to follow-up was typically well below 20%; thus, biased results were unlikely .
Our review also has several limitations. First, there is a risk of publication bias because studies with negative results are less likely to be published . Because no register exists for validation studies, publication bias could not be formally assessed. Second, due to the withdrawal of one of the authors (GK), the first author (ID) performed the full-text assessment and quality appraisal partially alone, which may have caused potential selection bias. However, strict predefined selection and quality appraisal criteria were applied (see Additional files 1 and 2), and in cases where doubt existed, full-texts were assessed independently by the last author (MS). Third, most of the included studies on construct validity lacked prespecified hypotheses, which increases the risk of bias because, retrospectively, alternative explanations for low correlations may be sought . Because the majority of correlations were robust, this risk appears limited. Finally, an individual patient data meta-analysis would have been preferable when summarizing research on the criterion validity of the FI. However, because the nature and number of deficits differed between the studies, it was not feasible to merge these data. Moreover, due to study heterogeneity, a meta-analysis on the outcome measures was not possible .
Apart from the FI, another frailty screening instrument that has shown good criterion and construct validity is the Frailty Phenotype . One may question whether this performance-based measure would be preferable to implement in general practice, since it has also good face validity, consisting of five easily interpretable parameters (unintentional weight loss, self-reported exhaustion, weakness, slow walking speed, and low physical activity). However, compared to the FI, the Frailty Phenotype would require extra time and resources to enable execution in daily clinical care, and in direct comparison, the FI has been shown to better predict mortality risk among older people .
Our results are consistent with previous FI reviews that also reported on criterion validity and construct validity of the FI [7, 13, 43]. Our review updates these findings, and whereas these previous reviews were narrative in nature, our review is the first to systematically review the FI’s psychometric properties that are relevant to primary care.
In the majority of the included studies on the FI’s criterion validity, its predictive ability for mortality is examined. This does not mean that the FI is meant to be a ‘mortality prediction’ instrument; rather, by including the FI in a multivariable model including age, the FI score aims to explain the variable vulnerability to adverse health outcomes in people of the same age. This heterogeneity in frailty levels is also reflected by the relatively low correlation coefficients that we found between FI and age; whereas, in general, the correlation coefficient for the mean FI scores versus age was high (e.g. r = 0.985, ), the correlation coefficient for the individual FI scores versus age was at maximum 0.320 .
To assess the construct validity of the FI, we focused on its correlation with other frailty measures, age, gender, disability, comorbidity, and self-rated health . However, the concordance of the FI with a broad array of other measures has also been investigated, and a high FI score has been demonstrated to correlate with a high and low BMI , smoking [45, 46], impaired psychological well-being , psychiatric illness , impaired mobility , impaired cognition and Alzheimer’s disease [50, 51], pain , high levels of gonadotropins , neighborhood deprivation and low individual socio-economic status , rural residence [55, 56], and low education and little social support or participation . The FI may also serve as a basis to calculate ‘biological age’. Individuals with an FI score that is relatively high for their age and gender show a biological age that is higher than their chronological age, and this biological age is also a significant predictor of mortality .
There is no evidence supporting responsiveness or utility. However, some studies reflected upon the potential utility of the FI and noted two major advantages: first, the FI can be constructed from available data whether from administrative routine primary care data , specific measurements, such as the interRAI-AL instrument , or comprehensive geriatric assessment data [26, 29]. Second, the FI score can be calculated using software thereby facilitating its clinical application [24, 37]. However, only in one study the FI was actually studied in routine clinical data, so these potential advantages need to be further explored.
One may argue that studies relating FI score change to baseline factors, such as mobility and baseline frailty state, and studies modeling FI score change [49, 59] do describe responsiveness. These studies demonstrate that FI score development over time can be adequately described using a time dependent Poisson distribution, and that the probability of improvement, stability and worsening of the FI score is directly related to the baseline number of deficits, age, and mobility status. However, we did not consider these studies as responsiveness studies, since they did not study pre-specified hypotheses regarding the expected correlations between changes in the score on the FI instrument, and changes in other variables, such as scores on other instruments, or demographic or clinical variables . An important finding of our systematic review is that eighteen out of twenty studies explored the FI’s psychometric properties in datasets gathered specifically for research purposes. These studies consistently showed a higher maximum and mean FI score compared with the study that investigated the FI using routine primary care data . however, because only one study with an FI using routine primary care data was included, there is not enough evidence to support conclusions about any structural differences in mathematical properties of the FI. More FIs applied in routine primary care data sets should be studied to further explore these potentially different mathematical properties. The narrower FI score range in the study using routine primary care data reflects unexpectedly low deficit prevalences, which may be caused by several reasons: first, patients may experience symptoms or problems with which they do not present themselves to the GP; second, there may be suboptimal data registration in the EMR [60, 61], and third, the FI may need to include more items on level of functioning, mobility or health attitude instead of merely relying on morbidity deficits. Also, except for the polypharmacy deficit, this FI was based on one single data source out of the Electronic Medical Records (EMRs), namely symptoms and diagnoses encoded according to the International Classification of Primary Care (ICPC, ). Care should be taken to construct an FI that captures all information available in the EMR by using, for example, not only ICPC-encoded data but also diagnostic measurement data, such as body mass index or laboratory tests, and elaborate medication data, encoded according to the Anatomic Therapeutic Chemical (ATC) .
In this systematic review, the FI demonstrates good criterion and construct validity, but its discriminatory ability is poor to moderate. In general, the FI appears to be an easily interpretable instrument that is practical to manage; however, studies that focus on its responsiveness, interpretability or utility are lacking. These results support the potential of the FI as a screening instrument for frailty in primary care and also demonstrate that further research into its psychometric properties is required. FIs based on research data show lower FI scores than those based on routine primary care data. Given its implementation in clinical practice, future validation studies of the FI should focus primarily on its application in routine primary care data.
De Lepeleire J, Degryse J, Illiffe S, Mann E, Buntinx F: Family physicians need easy instruments for frailty. Age Ageing. 2008, 37: 484-485. 10.1093/ageing/afn115.
Moorhouse P, Rockwood K: Frailty and its quantitative clinical evaluation. J R Coll Surg Edinb. 2012, 42: 333-340.
Lacas A, Rockwood K: Frailty in primary care: a review of its conceptualization and implications for practice. BMC Med. 2012, 10: 4-10.1186/1741-7015-10-4.
Loo TS, Davis RB, Lipsitz LA, Irish J, Bates CK, Agarwal K, Markson L, Hamel MB: Electronic medical record reminders and panel management to improve primary care of elderly patients. Arch Intern Med. 2011, 171: 1552-1558. 10.1001/archinternmed.2011.394.
Heuberger RA: The frailty syndrome: a comprehensive review. J Nutr Gerontol Geriatr. 2011, 30: 315-368. 10.1080/21551197.2011.623931.
Mitnitski AB, Mogilner AJ, Rockwood K: Accumulation of deficits as a proxy measure of aging. Scientific World J. 2001, 8: 323-36.
Rockwood K, Mitnitski A: Frailty in relation to the accumulation of deficits. J Gerontol A Biol Sci Med Sci. 2007, 62: 722-727. 10.1093/gerona/62.7.722.
Hoogendijk EO, van der Horst HE, Deeg DJ, Frijters DH, Prins BA, Jansen AP, Nijpels G, van Hout HP: The identification of frail older adults in primary care: Comparing the accuracy of five simple instruments. Age Ageing. 2013, 42: 262-265. 10.1093/ageing/afs163.
Pialoux T, Goyard J, Lesourd B: Screening tools for frailty in primary health care: A systematic review. Geriatr Gerontol Int. 2012, 12: 189-197. 10.1111/j.1447-0594.2011.00797.x.
Pijpers E, Ferreira I, Stehouwer CD, Nieuwenhuijzen Kruseman AC: The frailty dilemma. review of the predictive accuracy of major frailty scores. Eur J Int Med. 2012, 23: 118-123. 10.1016/j.ejim.2011.09.003.
Sternberg SA, Wershof Schwartz A, Karunananthan S, Bergman H, Mark Clarfield A: The identification of frailty: a systematic literature review. J Am Geriatr Soc. 2011, 59: 2129-2138. 10.1111/j.1532-5415.2011.03597.x.
Clegg A, Young J, Iliffe S, Rikkert MO, Rockwood K: Frailty in elderly people. Lancet. 2013, 381: 752-762. 10.1016/S0140-6736(12)62167-9.
Rockwood K, Mitnitski A: Frailty defined by deficit accumulation and geriatric medicine defined by frailty. Clin Geriatr Med. 2011, 27: 17-26. 10.1016/j.cger.2010.08.008.
De Vries NM, Staal JB, van Ravensberg CD, Hobbelen JS, Olde Rikkert MG, Nijhuis-van der Sanden MW: Outcome instruments to measure frailty: a systematic review. Ageing Res Rev. 2011, 10: 104-114. 10.1016/j.arr.2010.09.001.
Rockwood K: What would make a definition of frailty successful?. Age Ageing. 2005, 34: 432-434. 10.1093/ageing/afi146.
McDowell I, Newell C: Measuring health: A guide to rating scales and questionnaires. 1996, New York: Oxford University Press, 2
Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, Bouter LM, de Vet HC: Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007, 60: 34-42. 10.1016/j.jclinepi.2006.03.012.
Hayden JA, van der Windt DA, Cartwright JL, Cote P, Bombardier C: Assessing bias in studies of prognostic factors. Ann Intern Med. 2013, 158: 280-286. 10.7326/0003-4819-158-4-201302190-00009.
Searle SD, Mitnitski A, Gahbauer EA, Gill TM, Rockwood K: A standard procedure for creating a frailty index. BMC Geriatr. 2008, 8: 24-10.1186/1471-2318-8-24.
Cigolle CT, Ofstedal MB, Tian Z, Blaum CS: Comparing models of frailty: The health and retirement study. J Am Geriatr Soc. 2009, 57: 830-839. 10.1111/j.1532-5415.2009.02225.x.
Armstrong JJ, Stolee P, Hirdes JP, Poss JW: Examining three frailty conceptualizations in their ability to predict negative outcomes for home-care clients. Age Ageing. 2010, 39: 755-758. 10.1093/ageing/afq121.
Drubbel I, de Wit NJ, Bleijenberg N, Eijkemans RJ, Schuurmans MJ, Numans ME: Prediction of adverse health outcomes in older people using a frailty index based on routine primary care data. J Gerontol A Biol Sci Med Sci. 2013, 68: 301-308. 10.1093/gerona/gls161.
Gu D, Dupre ME, Sautter J, Zhu H, Liu Y, Yi Z: Frailty and mortality among Chinese at advanced ages. J Gerontol B Psychol Sci Soc Sci. 2009, 64: 279-289.
Kulminski AM, Ukraintseva SV, Kulminskaya IV, Arbeev KG, Land K, Yashin AI: Cumulative deficits better characterize susceptibility to death in elderly people than phenotypic frailty: lessons from the cardiovascular health study. J Am Geriatr Soc. 2008, 56: 898-903. 10.1111/j.1532-5415.2008.01656.x.
Kulminski A, Yashin A, Arbeev K, Akushevich I, Ukraintseva S, Land K, Manton K: Cumulative index of health disorders as an indicator of aging-associated processes in the elderly: results from analyses of the national long term care survey. Mech Ageing Dev. 2007, 128: 250-258. 10.1016/j.mad.2006.12.004.
Lucicesare A, Hubbard RE, Fallah N, Forti P, Searle SD, Mitnitski A, Ravaglia G, Rockwood K: Comparison of two frailty measures in the conselice study of brain ageing. J Nutr Health Aging. 2010, 14: 278-281. 10.1007/s12603-010-0061-6.
Lucicesare A, Hubbard RE, Searle SD, Rockwood K: An index of self-rated health deficits in relation to frailty and adverse outcomes in older adults. Aging Clin Exp Res. 2010, 22: 255-260. 10.1007/BF03324805.
Rockwood K, Andrew M, Mitnitski A: A comparison of two approaches to measuring frailty in elderly people. J Gerontol A Biol Sci Med Sci. 2007, 62: 738-743. 10.1093/gerona/62.7.738.
Song X, Mitnitski A, Rockwood K: Prevalence and 10-year outcomes of frailty in older adults in relation to deficit accumulation. J Am Geriatr Soc. 2010, 58: 681-687. 10.1111/j.1532-5415.2010.02764.x.
Woo J, Leung J, Morley JE: Comparison of frailty indicators based on clinical phenotype and the multiple deficit approach in predicting mortality and physical limitation. J Am Geriatr Soc. 2012, 60: 1478-1486. 10.1111/j.1532-5415.2012.04074.x.
Woo J, Goggins W, Sham A, Ho SC: Public health significance of the frailty index. Disabil Rehabil. 2006, 28: 515-521. 10.1080/09638280500215867.
Fang X, Shi J, Song X, Mitnitski A, Tang Z, Wang C, Yu P, Rockwood K: Frailty in relation to the risk of falls, fractures, and mortality in older chinese adults: results from the Beijing longitudinal study of aging. J Nutr Health Aging. 2012, 33: 1-5.
Garcia-Gonzalez JJ, Garcia-Pena C, Franco-Marina F, Gutierrez-Robledo LM: A frailty index to predict the mortality risk in a population of senior Mexican adults. BMC Geriatr. 2009, 9: 47-10.1186/1471-2318-9-47.
Mitnitski A, Song X, Skoog I, Broe GA, Cox JL, Grunfeld E, Rockwood K: Relative fitness and frailty of elderly men and women in developed countries and their relationship with mortality. J Am Geriatr Soc. 2005, 53: 2184-2189. 10.1111/j.1532-5415.2005.00506.x.
Shi J, Song X, Yu P, Tang Z, Mitnitski A, Fang X, Rockwood K: Analysis of frailty and survival from late middle age in the Beijing longitudinal study of aging. BMC Geriatr. 2011, 11: 17-10.1186/1471-2318-11-17.
Theou O, Rockwood MR, Mitnitski A, Rockwood K: Disability and co-morbidity in relation to frailty: How much do they overlap?. Arch Gerontol Geriatr. 2012, 55: e1-e8. 10.1016/j.archger.2012.03.001.
Hogan DB, Freiheit EA, Strain LA, Patten SB, Schmaltz HN, Rolfson D, Maxwell CJ: Comparing frailty measures in their ability to predict adverse outcome among older residents of assisted living. BMC Geriatr. 2012, 12: 56-10.1186/1471-2318-12-56.
Hirdes JP, Frijters DH, Teare GF, The MDS-CHESS scale: A new measure to predict mortality in institutionalized older people. J Am Geriatr Soc. 2003, 51: 96-100. 10.1034/j.1601-5215.2002.51017.x.
Mokkink LB, Terwee CB, Knol DL, Stratford PW, Alonso J, Patrick DL, Bouter LM, de Vet HC: The COSMIN checklist for evaluating the methodological quality of studies on measurement properties: a clarification of its content. BMC Med Res Methodol. 2010, 10: 22-10.1186/1471-2288-10-22. 2288-10-22
Laupacis A, Wells G, Richardson WS, Tugwell P: Users’ guides to the medical literature. V. how to use an article about prognosis. evidence-based medicine working group. JAMA. 1994, 272: 234-237. 10.1001/jama.1994.03520030076032.
Egger M, Schneider M, Davey Smith G: Spurious precision? Meta-analysis of observational studies. BMJ. 1998, 316: 140-144. 10.1136/bmj.316.7125.140.
Fried LP, Tangen CM, Walston J, Newman AB, Hirsch C, Gottdiener J, Seeman T, Tracy R, Kop WJ, Burke G, McBurnie MA, Cardiovascular Health Study Collaborative Research Group: Frailty in older adults: evidence for a phenotype. J Gerontol A Biol Sci Med Sci. 2001, 56: M146-M156. 10.1093/gerona/56.3.M146.
Rockwood K, Mitnitski A: Frailty, fitness, and the mathematics of deficit accumulation. Rev Clin Gerontol. 2007, 17: 1-12.
Hubbard RE, Lang IA, Llewellyn DJ, Rockwood K: Frailty, body mass index, and abdominal obesity in older people. J Gerontol A Biol Sci Med Sci. 2010, 65: 377-381.
Hubbard RE, Searle SD, Mitnitski A, Rockwood K: Effect of smoking on the accumulation of deficits, frailty and survival in older adults: a secondary analysis from the Canadian study of health and aging. J Nutr Health Aging. 2009, 13: 468-472. 10.1007/s12603-009-0085-y.
Wang C, Song X, Mitnitski A, Yu P, Fang X, Tang Z, Shi J, Rockwood K: Gender differences in the relationship between smoking and frailty: Results from the Beijing longitudinal study of aging. J Gerontol A Biol Sci Med Sci. 2013, 68: 338-346. 10.1093/gerona/gls166.
Andrew MK, Fisk JD, Rockwood K: Psychological well-being in relation to frailty: A frailty identity crisis?. Int Psychogeriatr. 2012, 24: 1347-1353. 10.1017/S1041610212000269.
Andrew MK, Rockwood K: Psychiatric illness in relation to frailty in community-dwelling elderly people without dementia: a report from the canadian study of health and aging. Can J Aging. 2007, 26: 33-38. 10.3138/8774-758W-702Q-2531.
Fallah N, Mitnitski A, Searle SD, Gahbauer EA, Gill TM, Rockwood K: Transitions in frailty status in older adults in relation to mobility: a multistate modeling approach employing a deficit count. J Am Geriatr Soc. 2011, 59: 524-529. 10.1111/j.1532-5415.2011.03300.x.
Mitnitski AB, Fallah N, Rockwood K: Changes in cognition in relation to frailty in older canadians. Alzheimers Dement. 2009, 5: 391-
Song X, Mitnitski A, Rockwood K: Frailty and alzheimer’s disease in older men and women: results from a population-based study. Alzheimers Dement. 2009, 5: 129-
Shega JW, Dale W, Andrew M, Paice J, Rockwood K, Weiner DK: Persistent pain and frailty: a case for homeostenosis. J Am Geriatr Soc. 2012, 60: 113-117. 10.1111/j.1532-5415.2011.03769.x.
Tajar A, O’Connell MD, Mitnitski AB, O'Neill TW, Searle SD, Huhtaniemi IT, Finn JD, Bartfai G, Boonen S, Casanueva FF, Forti G, Giwercman A, Han TS, Kula K, Labrie F, Lean ME, Pendleton N, Punab M, Silman AJ, Vanderschueren D, Rockwood K, Wu FC, European Male Aging Study Group: Frailty in relation to variations in hormone levels of the hypothalamic-pituitary-testicular axis in older men: results from the European male aging study. J Am Geriatr Soc. 2011, 59: 814-821. 10.1111/j.1532-5415.2011.03398.x.
Lang IA, Hubbard RE, Andrew MK, Llewellyn DJ, Melzer D, Rockwood K: Neighborhood deprivation, individual socioeconomic status, and frailty in older adults. J Am Geriatr Soc. 2009, 57: 1776-1780. 10.1111/j.1532-5415.2009.02480.x.
Song X, MacKnight C, Latta R, Mitnitski AB, Rockwood K: Frailty and survival of rural and urban seniors: results from the Canadian study of health and aging. Aging Clin Exp Res. 2007, 19: 145-153. 10.1007/BF03324681.
Yu P, Song X, Shi J, Mitnitski A, Tang Z, Fang X, Rockwood K: Frailty and survival of older Chinese adults in urban and rural areas: results from the Beijing longitudinal study of aging. Arch Gerontol Geriatr. 2012, 54: 3-8. 10.1016/j.archger.2011.04.020.
Woo J, Goggins W, Sham A, Ho SC: Social determinants of frailty. Gerontology. 2005, 51: 402-408. 10.1159/000088705.
Goggins WB, Woo J, Sham A, Ho SC: Frailty index as a measure of biological age in a Chinese population. J Gerontol A Biol Sci Med Sci. 2005, 60: 1046-1051. 10.1093/gerona/60.8.1046.
Mitnitski A, Bao L, Rockwood K: Going from bad to worse: a stochastic model of transitions in deficit accumulation, in relation to mortality. Mech Ageing Dev. 2006, 127: 490-493. 10.1016/j.mad.2006.01.007.
Kahn NA, Visscher S, Verheij RA: The quality of the electronic medical records of general practitioners. (report in Dutch - de kwaliteit van het elektronisch patiëntendossier van huisartsen gemeten). 2011, NIVEL: Utrecht
Soler JK, Pringle M: Research using electronic patient records in general practice. The EGPRN meeting in Bertinoro, Italy, May 2009. Eur J Gen Pract. 2010, 16: 186-189. 10.3109/13814788.2010.501374.
Lamberts HWM: ICPC international classification of primary care. 1987, Oxford: Oxford University Press
WHO Collaborating Centre for Drug Statistics Methodology: Guidelines for ATC classification and DDD assignment 2011. 2010, Oslo
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2318/14/27/prepub
We thank Dr. Hans Reitsma for his advice on the possibilities of performing a meta-analysis, and we thank Dr. Saskia Weldam for her suggestions on the eligibility criteria and quality assessment.
The authors declare that they have no competing interests.
ID, MJS, MEN and NJW contributed to the study concept and design. ID drafted the manuscript. ID and GK selected relevant studies independently, and ID extracted data and assessed the quality of the included studies. ID, NJW, NB, MEN and MJS provided a critical review of this manuscript. All authors read and approved the final version of this manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Drubbel, I., Numans, M.E., Kranenburg, G. et al. Screening for frailty in primary care: a systematic review of the psychometric properties of the frailty index in community-dwelling older people. BMC Geriatr 14, 27 (2014). https://doi.org/10.1186/1471-2318-14-27