- Research article
- Open Access
- Open Peer Review
Accessing the discriminatory performance of FRAIL-NH in two-class and three-class frailty and examining its agreement with the frailty index among nursing home residents in mainland China
BMC Geriatrics volume 19, Article number: 296 (2019)
FRAIL-NH has been commonly used to assess frailty in nursing home residents and validated in many ethnic populations; however, it has not been validated in mainland China, where such an assessment tool is lacking. This study aimed to (1) assess the discriminatory performance of FRAIL-NH in two-class frailty (non-frail+ pre-frail vs. frail) and three-class frailty (non-frail vs. pre-frail vs. frail), based on the Frailty Index (FI), (2) determine the appropriate cutoff points for FRAIL-NH that distinguish two-class and three-class frailty, and (3) examine the agreement in classification between FRAIL-NH and FI.
A cross-sectional study of 302 residents aged 60 years or older from six nursing homes in Changsha was conducted. The FRAIL-NH scale and 34-item FI were used to measure frailty. Two-way and three-way receiver operating characteristic (ROC) curves were used to estimate the performance of FRAIL-NH. Cohen’s Kappa statistics were used to examine the agreement between these two measures.
The agreement between FRAIL-NH and FI ranged from 0.33 to 0.55. Regardless of what FI cutoff points were based on, the volume under the ROC surface (VUS) for FRAIL-NH from the three-way ROC were higher than the VUS of a useless test (1/6), and the area under the ROC curve (AUC) for FRAIL-NH from the two-way ROC were higher than the clinically meaningless value (0.5). When using FI cutoff points of 0.20 for pre-frail and 0.45 for frail, FRAIL-NH cutoff points of 1 and 9 in classifying three-class frailty had the highest VUS and the largest correct classification rates. Whichever FI was chosen, the performance of FRAIL-NH in distinguishing between pre-frailty and frailty, and between non-frailty and pre-frailty was equivalent. According to FRAIL-NH, the proportion of individuals with frailty misclassified as pre-frailty was higher than that of individuals with non-frailty misclassified as pre-frailty.
FRAIL-NH can be used as a preliminary frailty screening tool in nursing homes in mainland China. FI should be further used especially for those classified as pre-frailty by FRAIL-NH. It is not advisable to simply combine adjacent two classes of FRAIL-NH to create a new frailty variable in research settings.
The management of frailty in nursing home residents is a significant problem in China. First, the prevalence of frailty measured by the Tilburg Frailty Indicator (TFI) and the Physical Frailty Phenotype (PPF) from cross-sectional studies is high in nursing home residents in China (55.1–55.7%), and it is expected to increase continuously in the next decades [1, 2]. The increasing prevalence of frailty in China can make frailty management more difficult in older adults. Second, many nursing home residents in China live a sedentary lifestyle, which may contribute to further declines in physical function and an acceleration of the frailty process . Third, the lack of frailty measures specific for nursing home residents in China has exacerbated this problem because a substantial proportion of pre-frail or frail older adults has not been identified and thus could not receive timely and appropriate intervention. Adopting an effective frailty assessment tool specific to nursing home residents is a crucial first step for managing frailty in China.
Frailty Index (FI) and PPF are well-recognized and commonly used frailty assessment tools in nursing home settings ; however, they are not always feasible in routine practice in Chinese nursing homes because they are complex and time-consuming to use and are difficult to operate [5,6,7,8]. Other frailty assessment tools (e.g., TFI , Clinical Frailty Scale ) were mainly developed for community-dwellers and inpatients and were inappropriate for nursing home residents. Thus, the true prevalence of frailty in Chinese nursing homes could not be accurately assessed by these tools. Kaehr et al. developed the FRAIL-NH scale specific to nursing home residents after combining the core characteristics of FI and PFF [11, 12]. FRAIL-NH is a short, easy-to-administer and valid frailty assessment tool with high predictive ability of adverse outcomes [6, 11, 13]. It has been validated in many countries around the world, including the U.S. , France , Spain , Australia  and Hong Kong in China . However, to the best of our knowledge, it has not been validated in mainland China.
The clinical and research utility of FRAIL-NH in assessing frailty needs to be evaluated adequately before its ultimate use in mainland China as a screening tool. First, it is essential to determine the discriminative ability of FRAIL-NH in classifying different levels of frailty and the corresponding cutoff points. ROC curve analysis is a valid method to estimate the discriminatory ability of screening tools and to determine their best cutoff points that can directly aid diagnostic decisions . Different FRAIL-NH cutoff points have been established in countries such as Australia and Spain [7, 8, 11, 13]; however, these cutoff points might be inappropriate for nursing home older adults in mainland China because the cutoff points vary in different cultures and ethnicities . Determining the appropriate FRAIL-NH cutoff points offers a simple and accurate way to identify older people who are at risk of frailty and provide timely intervention. Besides, the performance of FRAIL-NH in discriminating non-frailty, pre-frailty and frailty has not been extensively investigated.
Second, as another useful metrics for clinical utility, the agreement of FRAIL-NH measured by kappa statistics needs to be examined with an alternative measure, usually an established gold standard. FI, which captures the multidimensionality of frailty with stronger predictive power for adverse outcomes [16, 17], has often been considered the gold standard of frailty diagnosis in clinical and research practice [18,19,20]. Many studies have compared the agreement between PPF and FI [21,22,23,24], but the agreement between the FRAIL-NH and FI has not been examined.
Third, the research utility of FRAIL-NH also needs to be evaluated. It is a common practice that researchers may create a two-class frailty variable (non-frail+ pre-frail vs. frail) by simply combining the adjacent categories of three-class frailty (non-frail, pre-frail vs. frail) due to statistical consideration such as a small sample size in a particular category. This raises methodological concerns as cutoff points for two-class and three-class frailty might be different.
Taken all together, this study sought to (1) assess the discriminatory performance of FRAIL-NH in two-class frailty (non-frail+ pre-frail vs. frail) and three-class frailty (non-frail vs. pre-frail vs. frail), based on FI; (2) determine the appropriate cutoff points for FRAIL-NH that distinguishes two-class and three-class frailty; and (3) examine the agreement in classification between FRAIL-NH and FI.
Study design, setting, and sample
A cross-sectional study was conducted in six nursing homes in Changsha, a provincial city in China between July and August 2018. A total of 302 nursing home residents were included in this study based on the inclusion and exclusion criteria, and the completeness of data. Details about the study design and process were described elsewhere . The study was approved by the Nursing and Behavioral Medicine Research Ethics Committee of Central South University Xiangya Nursing School (IRB approval number: 2018012).
The FRAIL-NH includes seven items: fatigue, resistance, ambulation, incontinence, weight loss, nutritional approach, and help with dressing [11, 12]. Possible total scores ranged from 0 (the best state) to 14 (the worst state). Detailed information about the FRAIL-NH is presented in Additional file 1: Table S1.
The FI is a count of impairments and illnesses, collectively known as deficits . Each deficit of FI was coded 0 or 1 representing absence or presence, respectively. At least 30 age-related health deficits should be included to calculate a FI . We developed a 34-item FI based on previous studies [5, 13, 26, 27]. The FI score was defined as the ratio between existing deficits and the number of evaluated deficits. Thus, the FI ranged from 0 to 1 (no deficit present, to all deficits present). The detailed information of the FI items and coding are shown in Additional file 1: Table S2.
To detect the severity of frailty and the agreement in classification between both measures, FI, which is a continuous score, was categorized based on cutoff points proposed in studies by Song et al. , Hoover et al. [5, 27], and Saum et al. , because they have widely been applied.
Sociodemographic data including age, gender, educational level, marital status, weight, and height, were collected. Participants’ education levels were categorized into three groups: uneducated/primary, secondary, and university. Marital status was categorized into two groups: never married/widowed or divorced, and married. Body mass index (BMI) was calculated as weight divided by height squared (kg/m2).
Both two-class and three-class frailty was used when examining the agreement and ROC curves. Based on Song et al. , we transformed a FI score into a three-class variable: FI ≤ 0.08 is non-frail; 0.08 < FI < 0.25 is pre-frail; FI ≥ 0.25 is frail. Based on Saum et al. , we also transformed a FI score into another three-class variable: FI ≤ 0.20 (non-frail), 0.20 < FI <0.45 (pre-frail), and FI ≥ 0.45 (frail). Additionally, the FI score was originally a four-class variable (non-frail, pre-frail, frail, most frail) in Hoover’s study [5, 27]. We used this variable to construct three-class frailty by collapsing the frail and most frail groups according to previous studies . Therefore, a score of FI ≤ 0.10 was considered “non-frail,” a score of 0.10 < FI ≤ 0.21 was considered “pre-frail,” and a score of 0.21 < FI ≤ 1 was considered “frail.” Based on these cutoff points, the non-frail group and pre-frail group were combined again, and a FI cutoff points of 0.25, 0.21, and 0.45 were used to create two-class frailty according to Song et al. , Hoover et al. [5, 27], and Saum et al. , respectively.
Two-way ROC analyses were used to evaluate the performance of FRAIL-NH for two-class frailty and to determine the best cutoff points of FRAIL-NH when distinguishing two-class frailty. The two-dimensional ROC curve was constructed with sensitivity and specificity. The best cutoff points of FRAIL-NH can be obtained by examining the FRAIL-NH score that maximized sensitivity and specificity in differentiating two-class frailty based on the FI. The areas under the ROC curves (AUC) were computed to estimate the relative classification ability of FRAIL-NH for two-class frailty. An AUC of greater than 0.9, 0.7–0.9, or 0.5–0.7 represented high, moderate, and low diagnostic accuracy for frailty, respectively .
Three-way ROC analysis is a valid method to assess the performance of the test simultaneously in all classes [14, 31]. In this study, three-way ROC analyses were applied to determine the corresponding aspects of FRAIL-NH for three-class frailty, such as the discriminatory performance and the best cutoff points. Through the three-way ROC analysis, a three-dimensional ROC surface—where its coordinates represent three correct classification rates (CCR) obtained for each class (X = CCR1, Y = CCR2, Z = CCR3)—was described. The best pair of cutoff points on the ROC surface can be achieved by the pair that corresponds to the coordinate on the ROC surface with minimized squared distance to the perfect classification coordinates (1, 1, 1), where the CCRs of the three classes were 100% and the discriminative ability of the FRAIL-NH was the largest . Of this pair of cutoff points, one was employed to distinguish between “non-frail” and “pre-frail” and the other for “pre-frail” and “frail.” The volume under the ROC surface (VUS), an extension of the AUC, is commonly used as an overall performance index of the discriminative accuracy [14, 31]. The VUS varies from 1/6 (useless classification/classifying by chance alone) to 1 (perfect classification), respectively . Therefore, only when the VUS was higher than 1/6 = 0.17 was the diagnostic test considered “good” . The parametric method was used to calculate VUS, its standard error (SE) and confidence interval (CI) when each class follows a normal distribution. Otherwise, the nonparametric method was applied. Besides, to provide further insight into the differences in the three classes, pair-wise comparisons via two-way ROC curves were also performed in a post hoc test to examine the ability of the FRAIL-NH to discriminate between each pair of frailty classes. The area under the three pairwise ROC curves was calculated for the curve corresponding to the comparison between non-frailty vs. pre-frailty, between non-frailty vs. frailty, and between pre-frailty vs. frailty.
The MATLAB 2016a program (MathWorks Natick, MA) and R 3.5.3 software were used for the three-way ROC analysis. The package DiagTest3grp developed by Luo and Xiong  in the R program was used to calculate VUS, SE, CIs, the Youden Index and to determine the best cutoff points. The code developed in MATLAB was used to draw the ROC surface and to perform the two-way ROC analyses as a post hoc test.
Of the total 302 participants, 3.3% (n = 10) had missed data on the Mini-Mental State Examination (MMSE); 3.3% (n = 10) had missed data on the Mini Nutritional Assessment Short Form (MNA-SF). Missing data were mainly because residents with cognitive impairment were not able to respond to specific items and residents with hand problems could not write to complete some MMSE items. No data were missing for the FRAIL-NH. The mean substitution was used to handle missing data. The mean score for the remaining items was imputed for the missing items. IBM SPSS Statistics version 18.0 (IBM Corp., Armonk, NY) was used for descriptive statistics and Cohen’s kappa statistics. Descriptive statistics were reported as means ± standard deviation (SD) for continuous variables or percentages for categorical variables. Cohen’s kappa statistics were used to measure the agreement in classification between FRAIL-NH and FI. Agreement was identified as poor for a Kappa coefficient of ≤0.20, fair for 0.21–0.40, moderate for 0.41–0.60, good for 0.61–0.80, and excellent for 0.81–1.00 .
Table 1 shows the sociodemographic characteristics of 302 nursing home older adults. They were aged between 60 and 100 years (mean age 82.71 ± 8.49); 71.2% of them were female. The majority were never married/divorced/widowed (77.5%), uneducated or with primary education (48.0%), and within normal BMI (61.6%). Mean FRAIL-NH and FI scores were 4.11 ± 3.65, and 0.27 ± 0.11, respectively.
Table 2 shows the cutoff points of FRAIL-NH to classify two-class frailty based on FI. The two-way ROC curve analysis showed that the AUC for the FRAIL-NH was 0.87 (95% CI: 0.84–0.91), 0.86 (95% CI: 0.82–0.90), and 0.93 (95% CI: 0.89–0.97) when using a FI cutoff point of 0.25, 0.21, and 0.45, respectively. The optimal cutoff points of FRAIL-NH in classifying two-way frailty were 4 (66.1% sensitivity, 90.2% specificity), 2 (87.6% sensitivity, 66.3% specificity), 8 (94.1% sensitivity, 82.8% specificity) when using a FI cutoff point of 0.25, 0.21, and 0.45, respectively, according to the maximum principle of Youden’s Index.
Table 3 presents the cutoff points of FRAIL-NH to classify three-class frailty based on FI. The three-way ROC curve analysis showed that the VUS for the FRAIL-NH was 0.68 (95% CI: 0.57–0.76), 0.62 (95% CI: 0.55–0.72), and 0.76 (95% CI: 0.70–0.81) according to the cutoff points of the FI developed by Song et al. , Hoover et al. [5, 27] and Saum et al. , respectively. The optimal pair of FRAIL-NH cutoff points for three-class frailty were 0 and 4 (FRAIL-NH = 0 for non-frail, 0 < FRAIL-NH ≤ 4 for pre-frail, FRAIL-NH > 4 for frail) based on the cutoff points of the FI suggested by Song et al.  or Hoover et al. [5, 27], and 1 and 9 (0 ≤ FRAIL-NH ≤ 1 for non-frail, 1 < FRAIL-NH ≤ 9 for pre-frail, FRAIL-NH > 9 for frail) based on the cutoff points of the FI suggested by Saum et al. . Accordingly, the CCR corresponding to three classes was as follows: CCR1 = 80.0%, CCR2 = 66.1%, CCR3 = 66.1% based on the cutoff points of the FI suggested by Song et al. ; CCR1 = 65.0%, CCR2 = 67.9%, CCR3 = 60.7% according to the cutoff points of the FI developed by Hoover et al. [5, 27]; and CCR1 = 70.9%, CCR2 = 64.3%, CCR3 = 88.2% according to the cutoff points of the FI developed by Saum et al. .
To further analyze the discriminative ability of FRAIL-NH between each pair of frailty classes in three-class frailty, a pairwise post-hoc test was conducted. The ROC curves of each pair were shown in Fig. 1a-c. When the cutoff points of the FI developed by Song et al.  were considered, the areas under the three pairwise ROC curves (Fig. 1a) were 0.819 for the curve corresponding to the comparison between subjects with non-frailty versus pre-frailty, 0.963 for the comparison between non-frail individuals and frail individuals, and 0.805 for the comparison between pre-frail individuals versus subjects with frailty.
When the cutoff points of the FI constructed by Hoover et al. [5, 27] were considered, the areas under the three pairwise ROC curves (Fig. 1b) were 0.764 for the curve corresponding to the comparison between subjects with non-frailty versus pre-frailty, 0.910 for the comparison between non-frail individuals and frail individuals, and 0.781 for the comparison between pre-frail individuals versus subjects with frailty. When the cutoff points of the FI created by Saum et al.  were considered, the areas under the three pairwise ROC curves (Fig. 1c) were 0.820 for the curve corresponding to the comparison between subjects with non-frailty versus pre-frailty, 0.976 for the comparison between non-frail individuals and frail individuals, and 0.817 for the comparison between pre-frail individuals versus subjects with frailty.
Table 4 shows the agreement in classification for two-class frailty between FRAIL-NH and the FI. The Kappa agreement of two categories of these two frailty measures was 0.529 (P < 0.001), 0.551 (P < 0.001) and 0.330 (P < 0.001) when using a FI cutoff point of 0.25, 0.21 and 0.45, respectively. It was observed that when the FI cutoff points of 0.21 and the FRAIL-NH cutoff points of 2 were used for frailty, the agreement value of two classes of these measures was at a maximum, 80.5% ([67 + 176]/302). Based on these cutoff points, the prevalence of frailty was 66.6% according to the FI and 69.5% according to the FRAIL-NH.
Table 5 presents the agreement in classification for three-class frailty between FRAIL-NH and the FI. The Kappa agreement of three classes of both frailty measures was 0.404 (P < 0.001), 0.350 (P < 0.001) and 0.445 (P < 0.001) when the cutoff points of the FI suggested by Song et al. , Hoover et al. [5, 27], and Saum et al.  were used, respectively. When the FI cutoff points of 0.20 and 0.45, and the FRAIL-NH cutoff points of 0 and four were used, the agreement value of three classes of both frailty measures was at a maximum, with 70.2% ([61 + 139 + 12]/302). A total of 180 (59.6%), 201 (66.6%) and 17 (5.6%) individuals were classified as frail by the FI suggested by Song et al. , Hoover et al. [5, 27], and Saum et al. , respectively. However, 41.1, 44.8 and 29.4% of those were misclassified as pre-frailty by FRAIL-NH, respectively. Based on the largest agreement, the prevalence of pre-frailty and frailty measured by FI were 65.9 and 5.6%, respectively; those measured by FRAIL-NH were 56 and 13.6%, respectively.
To the best of our knowledge, this is the first study to assess the discriminatory performance of FRAIL-NH and to examine the agreement between FRAIL-NH and FI among nursing homes in mainland China, considering two-class and three-class frailty at the same time. Our study found that frailty could be identified by the 9 < FRAIL-NH ≤ 14, or 0.45 ≤ FI ≤ 1; pre-frailty could be identified by 1 < FRAIL-NH ≤ 9, or 0.20 < FI < 0.45. FRAIL-NH is a useful tool for frailty screening in Chinese nursing homes; however, it can be used only for preliminary screening because of moderate discrimination between adjacent two frailty classes. The level of agreement between FRAIL-NH and FI ranged from fair to moderate due to their large heterogeneity of captured domains and assessment items. Besides, it is notable that the cutoff points of FRAIL-NH for frailty and the prevalence of frailty could vary depending on the number of classifications.
The results from the three-way ROC analysis showed that whatever FI cutoff points were based on, the VUS values for FRAIL-NH—even the lower limit of the 95% CI of each VUS—were higher than the VUS of a useless test (1/6). This indicates that FRAIL-NH is a useful tool for frailty screening in Chinese nursing homes. This indication can also be drawn from the AUC values for FRAIL-NH that were obtained in two-way ROC analysis because these values were higher than the clinically meaningless value (0.5) and represented moderate to high discriminative accuracy for FRAIL-NH. Relatively, when the cutoff points of the FI developed by Saum et al.  were used, the FRAIL-NH cutoff points of 1 and 9 in classifying three-class frailty had the highest VUS and the largest CCRs. Therefore, the FRAIL-NH cutoff points of 1 and 9 may be the most appropriate to use for categorizing non-frailty, pre-frailty and frailty in Chinese nursing homes. Similarly, the FRAIL-NH cutoff point of 8 in classifying two-class frailty had the highest sensitivity and relatively high specificity. Therefore, it may be the most appropriate to use in Chinese nursing homes when a two-class situation is taken into account.
It should be noted that these FRAIL-NH cutoff points determined in this study need to be further tested in cohort studies. Besides, according to the cutoff points of the FI developed by Song et al.  and Hoover et al. [5, 27], the pair of FRAIL-NH cutoff points established in this study were 0 and four, which were consistent with those determined in 2380 nursing home older adults in Hong Kong, based on the suggested cutoff points of activities of daily living (ADL) . Although the reference standard in this study was different from that in our study, the cutoff points of FRAIL-NH determined in those two studies were still the same. This may be because there were some overlaps in measurement items between FI and ADL. Besides, the cultural and ethnic similarity between mainland China and Hong Kong may be another reason because frailty thresholds vary in different cultures and ethnicities . The mean FRAIL-NH and FI scores obtained in our study were lower than those in a Australian study (4.7 ± 4.1 and 0.35 ± 0.13, respectively) . In that study, aged care facility residents aged 65 and older (mean age 87.5) were investigated, which is likely to contribute to the higher frail level. A few studies have proved that the level of frailty increases with age [4, 5, 22]. We should be cautious, however, to compare this result because of the slight differences between the contents of FRAIL-NH and FI in our study and those in previous studies [7, 8, 11, 13].
The three-way ROC analysis gave a single measure for the overall performance of FRAIL-NH and a set of cutoff points for the whole classes . It cannot assess the discriminative performance of each pair of frailty classes. However, sometimes, the analyses between different sub-groups are necessary because they can provide useful information when classifying individuals. Through post hoc pairwise comparison, we found that the performance of FRAIL-NH in distinguishing between pre-frailty and frailty is the same as when distinguishing between non-frailty and pre-frailty, although its discriminatory ability was moderate to high in distinguishing between each pair of frailty classes. As expected, FRAIL-NH had a higher performance when distinguishing between non-frailty and frailty. That is to say, FRAIL-NH was moderately discriminating between two adjacent frailty classes. Therefore, FRAIL-NH can be used to preliminarily identify people with possible non-frailty and frailty as the first step in a two-stage assessment in nursing home settings or as a quick screening tool integrated into routine nursing home practice because of its brevity and low-assessment burden feature. It is worth noting that more work needs to be done to verify such findings since fewer numbers of people in the group with non-frailty in this study may impose additional restrictions on sub-group analyses. A larger sample size is required in sub-groups to conduct similar analyses.
In this study, the agreement between FRAIL-NH and FI ranged from fair to moderate, regardless of what cutoff points were used. This is likely because there was large heterogeneity in captured domains (see Additional file 1: Tables S1 and S2) and assessment items (7 vs. 34) between these two measures. Co-morbidities, cognition, mood, nutrition, physical function, continence, and polypharmacy are the broad domains covered by the FI. However, the FRAIL-NH does not include cognition. Additionally, the FRAIL-NH is more aligned with the phenotypic variables . Therefore, the agreement of FRAIL-NH with FI was not that good. In clinical practice, individuals who are frail but misclassified as pre-frail should receive more attention as frail older adults need timely intervention . In this study, according to FRAIL-NH, the proportion of frail individuals who were misclassified as pre-frail was higher than that of non-frail individuals misclassified as pre-frail, whichever FI comparators were chosen. Therefore, to avoid frail individuals being neglected, it would be necessary to further assess frailty status by the FI, especially for those classified as pre-frailty by FRAIL-NH.
Surprisingly, our study showed that when the same FI was used as a reference, the FRAIL-NH cutoff points for two-class frailty were different from those for three-class frailty.
For example, based on the FI cutoff points developed by Saum et al. , the FRAIL-NH cutoff point for frailty was 8 when two-class frailty was considered, while the corresponding cutoff point was 9 in the three-class situation. Additionally, the corresponding prevalence of frailty also differed (21.5% VS 13.6%), even when using the same measures in the same population. These findings suggest that the cutoff points of FRAIL-NH for frailty as well as the prevalence of frailty could vary depending on the number of classifications. Therefore, in clinical and research practice, it is not advisable to arbitrarily combine two adjacent classes to form one class, because it may lead to the loss of some information.
Because different frailty measures and cutoff points were used in this study, the prevalence of pre-frailty and frailty varied (26.8 to 65.9%; 5.6 to 69.5%, respectively). This is similar to what was reported in a recent systematic review (pre-frailty: 28.9–52.1%; frailty: 37.9–66.5%) . Additionally, by comparing the prevalence rates measured by FRAIL-NH and FI, we found that FI tended to classify individuals as frail, whereas FRAIL-NH tended to classify individuals as pre-frail. This could be due to the fact that, compared to FRAIL-NH, FI included multidimensional health deficits, such as psychology and cognition. Therefore, FI could categorize people who were at risk of frail while FRAIL-NH categorizes them as pre-frail.
There are several limitations that should be noted. Due to limited research conditions, we only investigated residents living in large-scale nursing homes in Changsha, a typical second-tier city in China’s central region with a specific regional representation [34, 35]. Therefore, FRAIL-NH cutoff points determined in the current study should be cautiously applied. The study’s generalizability may be limited if these cutoff points were applied to small/medium-scale nursing homes or nursing homes in cities with populations with different socioeconomic status. In addition, this study used convenience sampling, which may limit the representativeness of the research samples. Moreover, the sample size should be expanded to further verify the current findings in future studies. Finally, the fact that no health outcomes were evaluated to guide the selection of cutoff points is also a limitation of this study.
FRAIL-NH can be used as a preliminary frailty screening tool in nursing homes in mainland China. FI should be further used to determine frailty status if individualized interventions need to be designed and implemented in nursing home older adults, especially among those classified as pre-frailty by FRAIL-NH. It is not advisable to simply combine two adjacent classes of FRAIL-NH to form a new frailty variable in research settings. The FRAIL-NH cutoff points determined in this study need to be further tested in future research, and findings in this study also need to be verified in larger studies.
Availability of data and materials
The datasets used for the current study are available from the corresponding author upon reasonable request.
the area under the ROC curve
body mass index
correct classification rates
Physical Frailty Phenotype
receiver operating characteristic curves
Tilburg Frailty Indicator
volume under the ROC surface
Hou XL, J G, Wu CX, Bai DX, Xiang YP, Li X, et al. Prevalence and factors associated with frailty among institutional older adults. Chin J Nurs. 2018;53:88–93.
Si X, Jin R, Qiao X-X, Tian X, Liu XY, Dong L, et al. The reliability and validity test of the Chinese version of Tilburg frailty scale for the institutional older adults. Chin J Gerontol. 2018;38:4046–9.
Liu X-X, Hu J. The effects of an intervention on physical activity among nursing home residents in Wuhan. China J Gerontol Nurs. 2015;41:30–40.
Kojima G. Prevalence of frailty in nursing homes : a systematic review and meta-analysis. J Am Med Dir Assoc. 2018;16:940–5.
Blodgett J, Theou O, Kirkland S, Andreou P, Rockwood K. Frailty in NHANES: comparing the frailty index and phenotype. Arch Gerontol Geriatr. 2015;60:464–70.
De Silva TR, Theou O, Vellas B, Cesari M, Visvanathan R. Frailty screening ( FRAIL-NH ) and mortality in French nursing homes : results from the incidence of pneumonia and related consequences in nursing home residents study. J Am Med Dir Assoc. 2018;19:411–4.
Theou O, Tan ECK, Bell JS, Emery T, Robson L, Morley JE, et al. Frailty levels in residential aged care facilities measured using the frailty index and FRAIL-NH scale. J Am Geriatr Soc. 2016;64:e207–12.
Martínez-Velilla N, Herce PA, Herrero ÁC, Gutiérrez-Valencia M, De Asteasu MS, Mateos AS, et al. Heterogeneity of different tools for detecting the prevalence of frailty in nursing homes: feasibility and meaning of different approaches. J Am Med Dir Assoc. 2017;18:898.e1–8.
Gobbens Robbert JJ, Assen Marcel ALMV, Luijkx Katrien G, Maria Th WS, Schols Jos MGA. The Tilburg frailty Indicator: psychometric properties. J Am Med Dir Assoc. 2010;11:344–55.
Rockwood K, Song X, Macknight C, Bergman H, Hogan DB, Mcdowell I, et al. A global clinical measure of fitness and frailty in elderly people. C Can Med Assoc J. 2005;173:489–95.
Kaehr EW, Pape LC, Malmstrom TK, Morley JE. FRAIL-NH predicts outcomes in long term care. J Nutr Health Aging. 2016;20:192–8.
Kaehr E, Visvanathan R, Malmstrom TK, Mb JEM. Frailty in nursing homes: the FRAIL-NH scale. J Am Med Dir Assoc. 2015;16:87–9.
Luo H, Lum TYS, Wong GHY, Kwan JSK, Tang JYM, Chi I. Predicting adverse health outcomes in nursing homes : a 9-year longitudinal study and development of the FRAIL-minimum data set (MDS) quick screening tool. J Am Med Dir Assoc. 2018;16:1042–7.
Luo J, Xiong C. DiagTest3Grp: an R package for analyzing diagnostic tests with three ordinal groups. J Stat Softw. 2012;51:1–24.
Ma L, Tang Z, Zhang L, Sun F, Li Y. Prevalence of frailty and associated factors in the community- dwelling population of China. J Am Geriatr Soc. 2018;66:559–64.
Dent E, Kowal P, Hoogendijk EO. Frailty measurement in research and clinical practice: a review. Eur J Intern Med. 2016;31:3–10.
Theou O, Brothers TD, Mitnitski A, Rockwood K. Operationalization of frailty using eight commonly used scales and comparison of their ability to predict all-cause mortality. J Am Geriatr Soc. 2013;61:1537–51.
Chong E, Ho E, Baldevarona-llego J, Mb MC, Wu L, Tay L. Frailty in hospitalized older adults : comparing different frailty measures in predicting short- and long-term patient outcomes. J Am Med Dir Assoc. 2019;19:450–457.e3.
Drost DP, Van Munster BC. Discriminative value of frailty screening instruments in end-stage renal disease. Clin Kidney J. 2016;9:606–10.
Toosizadeh N, Wendel C, Hsu CH, Zamrini E, Mohler J. Frailty assessment in older adults using upper-extremity function: index development. BMC Geriatr. 2017;17:1–7.
Aguayo GA, Donneau AF, Vaillant MT, Schritz A, Franco OH, Stranges S, et al. Agreement between 35 published frailty scores in the general population. Am J Epidemiol. 2017;186:1–15.
Thompson MQ, Theou O, Yu S, Adams RJ, Tucker GR, Visvanathan R. Frailty prevalence and factors associated with the frailty phenotype and frailty index: findings from the north West Adelaide health study. Australas J Ageing. 2017;37:120–6.
Schoufour JD, Echteld MA, Evenhuis HM. Comparing two frailty concepts among older people with intellectual disabilities. Eur J Ageing. 2017;14:63–79.
Zhu Y, Liu Z, Wang Y, Wang Z, Shi J, Xie X, et al. Agreement between the frailty index and phenotype and their associations with falls and overnight hospitalizations. Arch Gerontol Geriatr. 2016;66:161–5.
Ge F, Liu M, Tang S, Lu Y, Szanton SL. Assessing frailty in Chinese nursing home older adults: a comparison between the FRAIL-NH scale and frailty index. J Nutr Health Aging. 2019;23:291–8.
Searle SD, Mitnitski A, Gahbauer EA, Gill TM, Rockwood K. A standard procedure for creating a frailty index. BMC Geriatr. 2008;24:1–10.
Hoover M, Rotermann M, Sanmartin C, Bernier J. Validation of an index to estimate the prevalence of frailty among community-dwelling seniors. Heal Reports. 2013;24:10–7.
Song X, Mitnitski A, Rockwood K. Prevalence and 10-year outcomes of frailty in older adults in relation to deficit accumulation. J Am Geriatr Soc. 2010;58:681–7.
Saum KU, Dieffenbach AK, Müller H, Holleczek B, Hauer K, Brenner H. Frailty prevalence and 10-year survival in community-dwelling older adults: results from the ESTHER cohort study. Eur J Epidemiol. 2014;29:171–9.
Linden A. Measuring diagnostic and predictive accuracy in disease management: an introduction to receiver operating characteristic (ROC) analysis. J Eval Clin Pract. 2010;12:132–9.
Nakas CT, Yiannoutsos CT. Ordered multiple-class ROC analysis with continuous measurements. Stat Med. 2004;23:3437–49.
Karakaya J, Akin S, Karagaoglu E, Gurlek A. The performance of hemoglobin A1c against fasting plasma glucose and oral glucose tolerance test in detecting prediabetes and diabetes. J Res Med Sci. 2014;19:1051–7.
Cohen J. A coefficient of agreement for nominal scales. Educ Psychol Meas. 1960;20:37–46.
Huang Zh-P. Mental health literacy of residents in Changsha, Xi’an, and Wuxi. Central South University; 2011.
Wang P. The influence of the social participation of the urban elderly on their spiritual life satisfaction: taking Changsha city as an example. Central South University; 2012.
The authors thank the student nurses, nursing home administrators, and residents for their valuable support and participation. We also thank Hui Mao, Yexiong Zeng, and Zun Wang for providing help for statistical analyses.
YL (co-author) is funded by Xiangya Overseas Alumni Association Yale-China Association Summer Health Research Fellowship (Grant 2018-XYOAA-1). The opinions, results, and conclusions reported in this article are those of the authors and are independent of the funding sources. The funding bodies had no role in the design of the study, the collection, analysis, interpretation of data, or in writing the manuscript. The chief investigator will have the final decision on these matters.
Ethics approval and consent to participate
Ethics approval was obtained from the Nursing and Behavioral Medicine Research Ethics Committee of Central South University of Xiangya Nursing School (2018012). All participants (or their legal representative, for those who lacked decision-making capabilities) were provided with a detailed explanation of study procedures and signed the informed consent form.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Ge, F., Liu, W., Liu, M. et al. Accessing the discriminatory performance of FRAIL-NH in two-class and three-class frailty and examining its agreement with the frailty index among nursing home residents in mainland China. BMC Geriatr 19, 296 (2019) doi:10.1186/s12877-019-1314-9
- Discriminatory performance
- Frailty index
- Nursing homes