Skip to main content

Influencing factors of Barthel index scores among the community-dwelling elderly in Hong Kong: a random intercept model



Barthel Index (BI) is one of the most widely utilized tools for assessing functional independence in activities of daily living. Most existing BI studies used populations with specific diseases (e.g., Alzheimer’s and stroke) to test prognostic factors of BI scores; however, the generalization of these findings was limited when the target populations varied.


The aim of the present study was to utilize electronic health records (EHRs) and data mining techniques to develop a generic procedure for identifying prognostic factors that influence BI score changes among community-dwelling elderly.


Longitudinal data were collected from 113 older adults (81 females; mean age = 84 years, SD = 6.9 years) in Hong Kong elderly care centers. Visualization technologies were used to align annual BI scores with individual EHRs chronologically. Linear mixed-effects (LME) regression was conducted to model longitudinal BI scores based on socio-demographics, disease conditions, and features extracted from EHRs.


The visualization presented a decline in BI scores changed by time and health history events. The LME model yielded a conditional R2 of 84%, a marginal R2 of 75%, and a Cohen’s f2 of 0.68 in the design of random intercepts for individual heterogeneity. Changes in BI scores were significantly influenced by a set of socio-demographics (i.e., sex, education, living arrangement, and hobbies), disease conditions (i.e., dementia and diabetes mellitus), and EHRs features (i.e., event counts in allergies, diagnoses, accidents, wounds, hospital admissions, injections, etc.).


The proposed visualization approach and the LME model estimation can help to trace older adults’ BI score changes and identify the influencing factors. The constructed long-term surveillance system provides reference data in clinical practice and help healthcare providers manage the time, cost, data and human resources in community-dwelling settings.

Peer Review reports


Aging reduces older adults’ physical and cognitive capacities and further affects their basic activities of daily living (ADL) [1]. Long-term clinical surveillance of older adults’ functional independence in ADL has become necessary in community-based elderly care [2, 3]. Several reliable assessment scales have been developed for monitoring functional changes over time [4, 5]. The Barthel Index (BI) is one of the most commonly used scales for measuring functional independence status and especially for assessing improvements during rehabilitation [6]. The BI scale measures respondents’ capability in 10 activities (e.g., feeding, bathing, dressing, etc.), with a total score ranging from 0 to 100 [7]. Lower BI score is associated with greater future disability, longer time and greater care needs for recovery [8]. The BI scale had well-established validity and reliability, with Cohen κ, ranging from good (0.61–0.80) to very good (0.81–1.00), and internal consistency (Cronbach α), ranging from good (0.80–0.89) to excellent (0.93) [9, 10].

Previous studies have reported some factors affecting BI score changes in specific populations, such as patients with Alzheimer’s disease [11], heart failure [12], stroke [3, 6, 13,14,15], cancer, and tumor [16]. These studies presented that BI scores could be affected by heterogeneous demographic information (e.g. age, sex), measurement time [6, 17], and multiple prognostic factors (e.g. psychological factors and social support factors) [1, 14, 18]. However, in long-term surveillance, aging will bring in physical frailty and lead to a variety of health conditions that can affect older adults’ functional status and further affect their BI scores [4, 19]. Such heterogeneity limited the generalization of the results from prospective studies that only used one type of disease cohort.

To ensure the validity of the BI assessments in a general setting among community-dwelling elderly, it is important to choose the professional researcher, nurse, care giver, or therapist for the data collection. However, assessing the BI score at each occurrence of different diseases would increase the workload for healthcare providers in the practical service [20]. Moreover, the efficiency of the traditional BI calculation was limited, particularly for various populations with different diseases, e.g., speech disorders (including dysphasia), depression, or cognitive function, leading to insufficiently sensitivity. This issue could be addressed by using together with other scales/datasets in the long-term assessment [20], for example, time-varying electronic health records (EHRs) data that included possible important prognostic factors such as depression, medical comorbidities [6].

BI scores have usually been assessed annually and recorded in EHRs at nursing homes; meanwhile, EHRs are comprised of records of individual health history, such as diagnosis of disease, medical care, vaccine injections, length of stay in hospitals, etc. [21]. As the health history events occurred irregularly and unequally among individuals, the events in the EHRs provided different resolutions with the routinely assessed BI scores [22]. To fix these issues, integrated visualization is needed to pre-process the data and extract valuable information from individual EHRs [23]. Previous studies have been conducted to visualize different types of data (e.g., health events and durations in health history) following the timelines [24, 25]. However, the visualization between health histories in individual EHRs and the trajectory of the longitudinal BI scores was not well established. Further, the time varying features from aligned visualization of individual EHRs could depict the progression of health histories but have not been widely used in the association with BI scores [6].

Thus, the present study was designed to utilize EHRs and data mining techniques to develop a generic procedure for identifying prognostic factors that influence BI score changes among community-dwelling elderly. In addition, to visualize the individual EHRs and monitor the trajectories of BI scores, we extracted features from long-term EHRs and conducted statistical models to handle the heterogeneity among individuals. We utilized linear mixed-effects (LME) regression to examine whether annually assessed BI scores in geriatric residents were associated with socio-demographics, disease conditions, and extracted features in EHRs. We also compared the fitting performance of competitive models and used the modeling results to identify the factors that had statistically significant effects on the change of BI scores.



This retrospective study included 113 participants aged 65 or above from Hong Kong elderly care centers. The EHRs of the participants between 06 August 2005 and 06 July 2016 were retrieved as the dataset. All the EHRs contained: a) more than one completed BI assessment period; b) more than two repeated BI scores; and c) additional records of socio-demographics and health histories.

The BI total score was utilized in the present study as it has a good (0.80–0.89) to excellent (0.93) internal consistency in previous studies (e.g., rehabilitation settings) [10, 20]. It ensured the full utilization of its numeric information [6] and overcome the ordinal non-hierarchical nature of the section scores in BI scales [20]. Registered nurses in the tested nursing homes performed the BI evaluation for each individual at various time points. The mean time between two BI assessments was 315.5 days, with the median as 336.0 days (IQR 301.0 to 340.0 days). A total of 605 observations of BI scores were included, covering the 492 assessment periods and with a mean of 5.4 assessment times (SD = ±1.9) per participant.

In the EHRs, the participants’ baseline characters were comprised of BI scores, socio-demographics (i.e., age, sex, marriage, religion, education, living arrangements, hobbies), and disease conditions (see Table 1). Health history in EHRs included allergy remarks, acute accidents, diagnosis of disease, revisit records, injection records, hospital admissions and discharges, medical cares, specialized nursing, wound care, and off-home records for periods away from the nursing home. We categorized the participants into two groups: the “active” cohorts, including 55 people who were alive at the end of the data collection time; and the “inactive” group, including 58 residents who left the nursing centers before the end of follow-up for specific reasons (e.g., died or moved to other facilities). This study was approved by the Research Ethics Committee of City University of Hong Kong (reference no.: 2–1-201510_01).

Table 1 Baseline characteristics

Data processing

The EHRs were restructured to aggregate various types of individual-linked data [26], which yielded flexible formulae matching the needs and authorities of specific retrospective cohorts [27]. In structured EHRs, individual health histories were captured in chronological order based on the individual assessment period of BI scores. The utilization and application of the structured EHRs lay in two directions: first, for the individual visualization, we employed the integrated plots to align the longitudinal BI scores with the acute events in health histories chronologically; second, for the influencing factor detection, we implemented the LME model to depict the association between longitudinal BI scores and prognostic factors [6], including socio-demographics, disease conditions, and extracted features from EHRs.

Seventeen disease conditions were retrieved from the diagnosis records in EHRs. The diseases with the top frequency of occurrence rate (above 10%) were hypertension, cataract, dementia, diabetes mellitus, anaemia, pneumonia, urinary tract infection, stroke, cerebrovascular accident (CVA), atrial fibrillation (AF), depression, and heart failure. Other diseases that were relevant to the change of BI scores were also listed, including Parkinson [19], Alzheimer [28], traumatic [29], sclerosis [30, 31], and epilepsy [32].

The extracted features from EHRs included 20 variables from observational records in health histories. At each assessment period of BI scores, we calculated the count of occurrences for time-to-event data and calculated the length of time for the duration data [33, 34]. The count of time-to-event data came from observational records regarding allergies, accidents, diagnoses, medicines, special medical cares, hospital admissions, injections, revisits, and off-home. The duration data described the length of stay in hospitals and the length of dates in taking medicines. Additional 9 features were the counts of the vaccine injection records.

Data analysis

Univariate analysis was conducted to compare the BI scores among disease groups and a significant difference in BI score indicated heterogeneities among individuals. Student’s unpaired t-test was used to detect significant disease conditions in the group mean BI scores.

A linear mixed effect (LME) model with random intercepts was used to capture the heteroscedasticity for individuals [6, 35], and obtain the fixed effects to interpret variables of 46 prognostic factors. The full inclusions of the covariates in the LME model were 17 socio-demographics, 9 disease conditions with the significant difference in the Student’s unpaired t-test, and 20 health history features from structured EHRs.

To depict the correlation between different kinds of prognostic factors and BI scores, we implemented the stepwise regression in a linear model (LM) versus the LME model. We adopted several goodness-of-fit criteria, including R2, AIC, BIC, and log-likelihood, to evaluate the model performance. A consistent result in LM and LME models showed that the full inclusion of covariates yielded the best performance.

To identify the factors influencing the change of BI scores, we conducted a χ2 test with corresponding p values for the full inclusion of prognostic factors in the LME model. A summary of the significant effects in the LME coefficients was provided to determine the influencing factors of BI score changes. All analyses were carried out using the R program (version 3.6.2). The LME model was implemented with the “lmer()” function in “lme4” package [36]. Significance was set at p < 0.05.


Visualization of BI scores and health events

Figure 1 shows the developed visualization approach by aligning individuals’ EHRs with their BI scores in a longitudinal way. In details, the occurrence of time-to-event data was presented in the top four bar-plot panels, with the bar height denoting the event counts. The following were the time-duration panels, with the width and the depth of the color representing the duration and frequency of health history events, respectively.

Fig. 1

Individual visualizations for (a) one participant in the “active” group (top) and (b) one participant in the “inactive” group (bottom). The end of the follow-up period in BI scores were marked by a dashed line in (b). (A: “accidents”; D: “diagnosis”; R: “revisits”; I: “injections”; H: “hospital admissions”; M: “medical cares”; O: “off-home”; S: “special nursing”; and W: “wounds”)

Univariate analysis in disease groups

Table 2 shows the results of the Student’s unpaired t-test for group means of BI scores. There were significant differences in BI scores in the cohorts of cataract, dementia, diabetes mellitus, anaemia, pneumonia, urinary tract infection, Parkinson, Alzheimer, and sclerosis (all p values < 0.05).

Table 2 Univariate analysis of the BI records in disease groups

Model comparisons and sensitivity analysis

Table 3 presents the comparison of the stepwise regression of covariates in LM and LME models. In Model-6, the full inclusion of the variables, including socio-demographics, disease conditions, and extracted features in EHRs, yielded the best performance according to all terms of the evaluation criteria. Comparing with Model-3 in the LM model, the formula of Model-6 in the LME model obtained a marginal R2 of 0.75 with fixed effects and a conditional R2 of 0.84 including individual random effects. When including EHR features, the Model-6 yielded a Cohen’s f2 of 0.68, which indicates a large effect size according to Cohen’s (1988) guidelines [37].

Table 3 Model comparison results of evaluation criteria

Influential factors identified by LME models

Table 4 showed the results of the LME model with the full inclusion of covariates. The random intercepts were among individuals and the fixed effects included socio-demographics, disease conditions, and extracted features in EHRs. The baseline of BI score was 80.6 (SD = 30.3). The Chi-square test results showed that the following factors with significant influence on BI score changes: socio-demographics (i.e., sex, education, living arrangement, hobbies of partying and watching the TV), disease conditions (i.e., dementia and diabetes mellitus), and health history features (i.e., counts of events in allergies, diagnoses, accidents, wounds, hospital admissions, and vaccine injection of pneumococcal and panenza) (all p values < 0.05).

Table 4 Coefficients in LME models with significance test results

Magnitude of significant effects

The results showed significant effects of socio-demographics, including sex, education, living arrangement, hobbies of partying and watching the TV, on BI score changes. In terms of BI score, males had an average of 19.5 points (SD = 7.3, p = 0.01) larger than females. Education status (p < 0.001) and living arrangements (p = 0.02) showed significant effects on individuals’ BI scores. Individual who had hobbies of partying and watching the TV had 17.3 points (SD = 8.0, p = 0.03) increased and 16.5 points (SD = 5.4, p = < 0.001) decreased on the BI scores separately.

For disease conditions (see Table 2), we found that dementia and diabetes mellitus showed a significant effect of − 22.4 points (SD = 6.7, p = < 0.001) and 16.8 points (SD = 6.9, p = 0.01) on BI scores separately.

There were negative effects of health history features, which included counts of events in allergies, diagnoses, accidents, and wounds. Over 90% of cases in the allergy events belonged to drug allergy, and remaining cases were from food allergens. Generally, an allergy event reported 26.8 points’ (SD = 6.3, p = < 0.001) decrease in the BI scores. Over 96% of accident records were fall events. An averaged accident event caused 2.8 points (SD = 1.3, p = 0.03) decline in the total BI scores. The diagnosis events and wound events were recorded according to the follow-up time. With per count increased, the diagnosis events and wound events had − 3.2 points (SD = 0.5, p = < 0.001) and − 9.2 points (SD = 2.4, p = < 0.001) changes on the BI scores, separately.

An increased count in the hospital events, containing a period with admissions and discharges, had a positive effect on BI score changes, with an average of 1.5 points (SD = 0.6, p = 0.02) increased. Individuals vaccinated with PANENZA for the pandemic A/H1N1 influenza outbreak had an average of 33.6 points (SD = 14.1, p = 0.02) increased in the BI score changes. Over 68% individuals had a regular injection of the Pneumo vaccine, showing a negative effect of 9.4 points (SD = 3.7, p = 0.01) decrease on the BI score changes over time.


We integrated EHRs and data mining techniques to detect the influencing factors of BI score changes among the community-dwelling elderly in Hong Kong. Our visualization provided an effective way to display the longitudinal BI scores with the individual health history. The influencing factors of BI score changes included socio-demographics (i.e., sex, education, living arrangement, and hobbies), disease conditions (i.e., dementia and diabetes mellitus), and the extracted features in EHRs (i.e., event counts in allergies, diagnoses, accidents, wounds, hospital admissions, injections, etc.).

The contributions are threefold: 1) the present findings fulfill the research gap of data mining in EHRs and study of longitudinal BI scores; 2) our study highlights the association between features extracted from EHRs and the BI score changes; 3) the LME model is able to support timely detection of the causes of BI score changes. The presented approaches can be generalized to other data that have similar structures as we utilized herein.


In the visualization plots (see Fig. 1), we examined the patterns of health events to understand the clinical features that affected BI scores. Notable increments in the counts of accidents and diagnoses of disease were found as BI scores decreased. Additionally, in the assessment period with lower BI scores, the duration of medical care and hospital admissions were longer than that in the period with higher BI scores. In line with the observed association in the visualizations, features such as the event count in accidents, diagnosis, and hospital admissions were also identified as influencing factors of BI score changes in the LME model (Table 4).

When applying individual EHRs into long-term functional assessment, the declines in BI score trajectories may be due to different effects from the features in EHRs, and further be affected by the sub-groups in age, socio-demographics, and disease conditions. The trajectories of declines in BI scores could be described by the progression of individual EHRs. Once we identified the influencing factors from the features in EHRs, we can predict the functional status from the different aspects in individual EHRs, scheduling additional and necessary care for the recovery [8, 20]. Similar insights were also employed in the comprehensive assessment system in hospital settings. Indeed, this technique has been used previously to demonstrate the association of cognitive decline in healthy older adults [22, 38].

Influencing factors

For socio-demographics, there was a significant gender effect on BI scores, i.e., females had much lower BI scores than males, consistent with previous studies [4,5,6]. One possible reason was that in our present study, the female group had relative larger age (mean = 83.6) than the male group (mean = 82.8) at the first time of BI score assessment (i.e., age at study entry). It indicates that our female subjects’ physical capability might be worse than that of males, leading to poorer BI performance. Our results showed non-significant effect of age in the LME model with an inclusion of EHR features. On the one hand, during the follow-up period, the functional status declined with age naturally in older adults [6], which led to the trajectory of BI scores decreasing gradually over time. On the other hand, the functional status might be changed during an acute event for each individual. The EHR features, such as disease progression, hospitalization, and medical care dominated the change of BI scores at that period. Thus, when including the EHR features in the LME model, the age is not significant in both unadjusted and adjusted models. In addition, we showed that education and living arrangements had significant effects on the change of BI scores. These two variables were commonly used in the prediction of BI scores [11] and functional dependence [3]. Moreover, older adults with hobby “partying” had significant positive effects on BI scores, while groups with hobby “TV” showed significant negative effects. This might be due to that people in the party were more likely to improve their performance in ADL, further preventing their BI scores from deteriorating.

For disease conditions, it is worth mentioning that our significant negative effect of dementia was consistent with previous studies [5, 39]. A short-term follow-up study showed that the BI scores had significantly different values in patients with or without dementia during hospitalization and after discharge [39]. For another significant disease, Murcia et al. (2010) supported that the distribution of diabetes mellitus had significant differences according to BI scores [5]. For EHR features, it was likely that older adults at frailty are vulnerable to acute events [4, 19]. Over 96% of accident records were fall events. It has been shown that people with low BI scores were identified as a high-risk group for falling during an in-patient stay [28]. Our study revealed such an association in both in-patient and out-patient periods, including older adults with higher BI scores. Moreover, older adults who had diagnosis records were identified as high-risk groups with certain diseases, such as infection, heart disease, and dehydration [4, 19]. Consequently, significant influencing factors in extracted features of EHRs also act as explanatory variables for monitoring the BI score changes, such as progression of allergies, wounds, diseases. Thus, using the counts of allergies, wounds, and diagnosis to monitor the change of BI scores is appropriate in community-dwelling older adults with different disease conditions.

The LME model

The stepwise procedure and model comparison analysis showed the strengths of full inclusion variables in the LME model. Unlike the categorical variables in demographics and disease conditions, which affected the baseline of BI scores in different subgroups, health history features in EHRs varied along with time, influencing the BI scores chronologically. The inclusion of features in EHRs significantly improved the modeling likelihood in both LM and LME models. Furthermore, the employed random effect of ID information enhanced the interpretation ability of individual BI scores in the LME model, which was superior to the original least squares. The LME approach takes account of the correlation between repeated measurements on the same individual. It is also used in a short-term study of regular assessed longitudinal BI scores [6].

Based on the significant association between the BI scores and EHR features, there are two aspects for the practical use of the accurate prediction of BI scores, sequentially obtained in the current and the next assessment period. In the LME approach, the EHR features acted as novel adjustments, and the BI estimates are more accurate over a wide range of different disease groups. For instance, even in the same subgroup with specific socio-demographical variables and disease conditions, each point increase in the diagnosis counts might diminish 3 points decrease in the expected (or estimated) annual BI scores. Individuals with different value of the EHR features will receive different care services. Moreover, a period with increasing EHR features might report a severe change of the BI scores. This information implies an additional requirement of the BI assessment at that period. The surveillance system allows the management of time, cost, data, and people in the care teams.

The strengths of our study lie in two application scenarios. In hospital settings, as the BI scores decreased with increasing age gradually, a steep decline of BI scores for the patient was highly associated with an acute event. Consequently, incorporating BI scores in individual EHRs in both inpatient and outpatient cares improves the services in the clinical practice. In community settings, the BI scores were assessed routinely in the annual or bi-annual periods. A significant decline of BI scores indicates severe frailty in personal functional status at that period. In this circumstance, change of the BI scores over the assessment period may be a powerful predictive tool for decreased abilities for activities of daily living, which call for intervention to provide greater care and facilitate recovery [20].

There were some limitations in the present study. First, our findings were derived from one population, i.e., Hong Kong older adults in nursing homes. It remains unclear whether the results can be generalized to other populations, which merits more studies. Even so, the visualization approach that we developed can help to visualize the datasets with similar structure as we utilized, regardless of the study population. Second, the number of participants in our study was relatively small, although the LME model handles the heterogeneity of longitudinal BI scores well for even smaller study groups. As the LME model makes full use of the data, in future work, we could harness a larger sample size in other regions, and we can potentially consider more covariates such as regional or local factors without dimensionality problems. Third, we did not examine inter-observer variations of the longitudinal BI scores. However, previous studies reported the Barthel scale with good (κ = 0.62) to near perfect (κ = 0.99) inter-observer reliability with sample size ranging from 55 to 122 [9, 20].


The present study proposed a visualization approach to correlate individual EHRs with BI scores chronologically. The LME model revealed some influencing factors for BI score changes from the perspectives of socio-demographics, disease conditions and extracted features in EHRs, among a sample of community-dwelling elderly in Hong Kong. The present findings could provide reference data on BI to facilitate elderly care providers in practical decision making and early interventions.

Availability of data and materials

The datasets generated and/or analyzed during the current study are not publicly available due to Institutional Review Board related matters but are available from the corresponding author on reasonable request.



Barthel Index


Electronic health records


Linear mixed-effects


Urinary tract infection


Cerebrovascular accident


Atrial fibrillation


  1. 1.

    Lekan DA, Wallace DC, McCoy TP, Hu J, Silva SG, Whitson HE. Frailty assessment in hospitalized older adults using the electronic health record. Biol Res Nurs. 2017;19:213–28.

    Article  Google Scholar 

  2. 2.

    Ho HK, Matsubayashi K, Wada T, Kimura M, Kita T, Saijoh K. Factors associated with ADL dependence: a comparative study of residential care home and community-dwelling elderly in Japan. Geriatr Gerontol Int. 2002;2:80–6.

    Article  Google Scholar 

  3. 3.

    Tveiten A, Ljøstad U, Mygland Å, Naess H. Functioning of long-term survivors of first-ever intracerebral hemorrhage. Acta Neurol Scand. 2014;129:269–75.

  4. 4.

    Rozzini R, Sabatini T, Cassinadri A, Boffelli S, Ferri M, Barbisoni P, et al. Relationship between functional loss before hospital admission and mortality in elderly persons with medical illness. J Gerontol - Ser A Biol Sci Med Sci. 2005;60:1180–3.

    Article  Google Scholar 

  5. 5.

    Murcia J, Llorens P, Sánchez-Payá J, Reus S, Boix V, Merino E, et al. Functional status determined by Barthel index predicts community acquired pneumonia mortality in general population. J Inf Secur. 2010;61:458–64.

    Google Scholar 

  6. 6.

    Musa KI, Keegan TJ. The change of Barthel index scores from the time of discharge until 3-month post-discharge among acute stroke patients in Malaysia: a random intercept model. PLoS One. 2018;13:1–13.

    CAS  Google Scholar 

  7. 7.

    Mahoney FI, Barthel DW. Functional evaluation: the Barthel index: a simple index of independence useful in scoring improvement in the rehabilitation of the chronically ill: Md. State Med. J. US: Medical and Chirurgical Faculty of the State of Maryland; 1965. p. 61–5.

  8. 8.

    Huybrechts KF, Jaime CJ. The Barthel index and modified Rankin scale as prognostic tools for long-term outcomes after stroke: a qualitative review of the literature. Curr Med Res Opin. 2007;23:1627–36.

  9. 9.

    Duffy L, Gajree S, Langhorne P, Stott DJ, Quinn TJ. Reliability (inter-rater agreement) of the barthel index for assessment of stroke survivors: systematic review and meta-analysis. Stroke. 2013;44:462–8.

    Article  Google Scholar 

  10. 10.

    Hartigan I, O’Mahony D. The Barthel index: comparing inter-rater reliability between nurses and doctors in an older adult rehabilitation unit. Appl Nurs Res. 2011;24:e1–7.

    Article  PubMed  Google Scholar 

  11. 11.

    Kamiya M, Osawa A, Kondo I, Sakurai T. Factors associated with cognitive function that cause a decline in the level of activities of daily living in Alzheimer’s disease. Geriatr Gerontol Int. 2018;18:50–6.

    Article  Google Scholar 

  12. 12.

    Kinugasa Y, Kato M, Sugihara S, Hirai M, Yamada K, Yanagihara K, et al. Geriatric nutritional risk index predicts functional dependency and mortality in patients with heart failure with preserved ejection fraction. Circ J. 2013;77:705–11.

    CAS  Article  Google Scholar 

  13. 13.

    Kong KH, Lee J. Temporal recovery of activities of daily living in the first year after ischemic stroke: a prospective study of patients admitted to a rehabilitation unit. NeuroRehabilitation. 2014;35:221–6.

    Article  Google Scholar 

  14. 14.

    McNaughton H, Weatherall M, Taylor W, McPherson K. Factors influencing rate of Barthel index change in hospital following stroke. Clin Rehabil. 2001;15:422–7.

    CAS  Article  Google Scholar 

  15. 15.

    Sangha H, Lipson D, Foley N, Salter K, Bhogal S, Pohani G, et al. A comparison of the Barthel index and the functional Independence measure as outcome measures in stroke rehabilitation: patterns of disability scale usage in clinical trials. Int J Rehabil Res. 2005;28:135–9.

    Article  Google Scholar 

  16. 16.

    Goldstein DP, Sklar MC, de Almeida JR, Gilbert R, Gullane P, Irish J, Brown D, Higgins K, Enepekides D, Xu W, Su J. Frailty as a predictor of outcomes in patients undergoing head and neck cancer surgery. The Laryngoscope. 2020;130(5):E340–5.

  17. 17.

    Agrawal S, Luc M, Winkowski F, Lindner K, Agrawal AK, Wozniak M, et al. Predictors of mortality in older patients admitted to a geriatric hospital. Geriatr Gerontol Int. 2019;19:70–5.

    Article  Google Scholar 

  18. 18.

    Ryg J, Engberg H, Mariadas P, Pedersen SGH, Jorgensen MG, Vinding KL, et al. Barthel index at hospital admission is associated with mortality in geriatric patients: a danish nationwide population-based cohort study. Clin Epidemiol. 2018;10:1789–800.

    Article  Google Scholar 

  19. 19.

    Watanabe K, Umegaki H, Huang C, Arakawa Martins B, Asai A, Kanda S, et al. Association between dysphagia risk and unplanned hospitalization in older patients receiving home medical care. Geriatr Gerontol Int [Internet]. 2019;19:977–81.

  20. 20.

    Quinn TJ, Langhorne P, Stott DJ. Barthel index for stroke trials: development, properties, and application. Stroke. 2011;42:1146–51.

    Article  Google Scholar 

  21. 21.

    Wongsuphasawat K, Gómez JAG, Plaisant C, Wang TD, Ben S, Taieb-Maimon M. LifeFlow: visualizing an overview of event sequences. Conf Hum Factors Comput Syst - Proc. 2011:1747–56.

  22. 22.

    Mathias JS, Agrawal A, Feinglass J, Cooper AJ, Baker DW, Choudhary A. Development of a 5 year life expectancy index in older adults using predictive mining of electronic health record data. J Am Med Inform Assoc. 2013;20:118–24.

    Article  Google Scholar 

  23. 23.

    An J, Lu X, Duan H. Integrated visualization of multi-modal electronic health record data. 2nd Int Conf Bioinform Biomed Eng iCBBE. 2008;2008:640–3.

    Google Scholar 

  24. 24.

    Plaisant C, Mushlin R, Snyder A, Li J, Heller D, Shneiderman B. LifeLines: using visualization to enhance navigation and analysis of patient records. Proc AMIA Symp. 1998;08:76–80.

    Google Scholar 

  25. 25.

    Wang TD, Plaisant C, Quinn AJ, Stanchak R, Shneiderman B, Murphy S. Aligning temporal data by sentinel events: discovering patterns in electronic health records. Conf Hum Factors Comput Syst - Proc. 2008:457–66.

  26. 26.

    Jensen PB, Jensen LJ, Brunak S. Mining electronic health records: towards better research applications and clinical care. Nat rev genet [internet]. Nat Publ Group. 2012;13:395–405.

    CAS  Article  Google Scholar 

  27. 27.

    Marschollek M, Gövercin M, Rust S, Gietzelt M, Schulze M, Wolf KH, et al. Mining geriatric assessment data for in-patient fall prediction models and high-risk subgroups. BMC Med Inform Decis Mak. 2012;12:1–6.

  28. 28.

    Fisher L, Lieberman MA. A longitudinal study of predictors of nursing home placement for patients with dementia: the contribution of family characteristics. Gerontologist. 1999;39:677–86.

    CAS  Article  Google Scholar 

  29. 29.

    Hong I, Lim Y, Han HS, Hay CC, Woo HS. Application of the Korean version of the modified Barthel index: development of a keyform for use in clinical practice. Hong Kong J Occup Ther. 2017;29:39–46.

    Article  Google Scholar 

  30. 30.

    Kidd D, Howard R, Losseff N, Thompson A. The benefit of inpatient neurorehabilitation in multiple sclerosis. Clin Rehabil. 1995;9:198–203.

    Article  Google Scholar 

  31. 31.

    Houlden H, Edwards M, McNeil J, Greenwood R. Use of the Barthel index and the functional Independence measure during early inpatient rehabilitation after single incident brain injury. Clin Rehabil. 2006;20:153–9.

    Article  Google Scholar 

  32. 32.

    Jungehulsing GJ, Heuschmann PU, Holtkamp M, Schwab S, Kolominsky-Rabas PL. Incidence and predictors of post-stroke epilepsy. Acta Neurol Scand. 2013;127:427–30.

    Article  Google Scholar 

  33. 33.

    Bandyopadhyay S, Wolfson J, Vock DM, Vazquez-Benitez G, Adomavicius G, Elidrisi M, et al. Data mining for censored time-to-event data: a Bayesian network model for predicting cardiovascular risk from electronic health record data. Data Min Knowl Disc. 2015;29:1033–69.

    Article  Google Scholar 

  34. 34.

    Ranganath R, Perotte A, Elhadad N, Blei DM. The survival filter: joint survival analysis with a latent time series. Uncertain Artif Intell - Proc 31st Conf UAI. 2015;2015:742–51.

    Google Scholar 

  35. 35.

    Henderson R. Joint modelling of longitudinal measurements and event time data. Biostatistics. 2000;1:465–80.

    CAS  Article  Google Scholar 

  36. 36.

    Bates D, Maechler M, Bolker B, Walker S, Christensen RHB, Singmann H, et al. Package ‘lme4’. Convergence. 2015;12:2.

    Google Scholar 

  37. 37.

    Selya AS, Rose JS, Dierker LC, Hedeker D, Mermelstein RJ. A practical guide to calculating Cohen’s f 2, a measure of local effect size, from PROC MIXED. Front Psychol. 2012;3:1–6.

    Article  Google Scholar 

  38. 38.

    Baker E, Iqbal E, Johnston C, Broadbent M, Shetty H, Stewart R, et al. Trajectories of dementia-related cognitive decline in a large mental health records derived patient cohort. PLoS One. 2017;12:e0178562.

  39. 39.

    Martín J, Padierna A, Anton-Ladislao A, Moro I, Quintana JM. Predictors of mortality during hospitalization and 3 months after discharge in elderly people with and without dementia. Aging Ment Health. 2019;23:1057–65.

    Article  PubMed  Google Scholar 

Download references


We thank Yan Oi Tong Ltd. for providing the data that was used. We thank Dr. Wenbin Wu, Geriatrics Department, Beijing Hospital, National Center of Gerontology, Institute of Geriatric Medicine, Chinese Academy of Medical Sciences, Beijing 100730, China. for his professional comments on the utility of our methods.


This work was partly supported by the Hong Kong RGC Theme-Based Research Scheme [No. T32–102-14 N], CityU Grant [No. 9610406; No. 9610404] and PolyU Grant [No. P0036146]. The funding body played no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information




Conceptualization: HP, HW, YZ, JC; Data curation: HP, HW, EL; Formal analysis: HP, YZ; Funding acquisition: KLT, FC, HW; Writing-original draft: HP, HW, YZ; Writing-review & editing: HW, YZ, XL; All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Hailiang Wang.

Ethics declarations

Ethics approval and consent to participate

This study was approved by the Research Ethics Committee of City University of Hong Kong (reference no.: 2–1-201510_01). All the data used in the study was anonymized before its use.

Consent for publication

Not applicable.

Competing interests

The authors have declared that no competing interests exist.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Pan, H., Zhao, Y., Wang, H. et al. Influencing factors of Barthel index scores among the community-dwelling elderly in Hong Kong: a random intercept model. BMC Geriatr 21, 484 (2021).

Download citation


  • Barthel index
  • Influencing factors
  • Electronic health records
  • Linear mixed effects model