Skip to main content

Physical performance reference values for Japanese oldest old: a SONIC study



The oldest old, defined as those aged 90 or over, is now the fastest-growing population sector. This study aimed to determine reference values for several physical performance measures (PPMs) among 90-year-olds using internationally standardized measurements and to clarify the characteristics of these indices by comparing their results for 90-year-olds with those for older people 70 and 80.


We used the Septuagenarians, Octogenarians, and Nonagenarians Investigation with Centenarians (SONIC) study data from 2010 to 2018. The study subjects were 70, 80, and 90-year-olds in the target area eligible to participate in the venue. Excluding those certified for long-term care, the final number of eligible persons is 70s cohort 1000 (2010), 80s cohort 973 (2011), and 90s cohort 690. 90s cohort only consisted of three survey waves: 2012, 2015, and 2018. We used hand grip strength and score on the Short Physical Performance Battery (SPPB) for our physical performance measurements. In addition, we statistically analyzed sex and age differences.


The simple mean ± standard deviation (SD) for the 90-year-old respondents were in men, 24.1 ± 5.4 kg in hand grip strength, 0.80 ± 0.22 m/s in usual gait speed, 17.2 ± 6.73 s in 5times chair stand, 5.89 ± 4.42 s in tandem balance, and 8.3 ± 2.2 in SPPB respectively and in women, 14.4 ± 4.0 kg in hand grip strength, 0.72 ± 0.20 m/s in usual gait speed, 17.8 ± 7.89 s in 5times chair stand, 4.72 ± 4.35 s in tandem balance, and 7.5 ± 2.4 in SPPB, respectively. For all PPMs, the age 90 cohort was statistically significantly different from the age 70 and 80 cohorts (all trends P <  0.001). Hand grip strength decreased with a similar gradient with age cohort increase of 10 years for both sexes. In contrast, SPPB lower limb score showed a larger drop between the age 80 and 90 cohorts than between the age 70 and 80 cohorts. We also constructed sex-specific appraisal standards according to quintiles.


Our study yielded inclusive sex-specific reference values and appraisal standards for major physical performance measures not certified as requiring long-term care, community-dwelling, oldest old Japanese. The characteristics of age-related decline in physical performance differed between the upper and lower extremity assessments.

Peer Review reports


The decline in physical performance with aging is a critical issue for older adults because such declines limit older citizens’ activities of daily living (ADLs) and instrumental activities of daily living (IADLs). These limitations in turn inhibit independence and autonomy, which decreases overall quality of life. In particular, in the population of the oldest old, it takes them a great deal longer to recover from illness and fracture than it does younger people [1], so it is important to detect declines in physical performance early and offer appropriate interventions, such as exercise and therapy.

In the United States and Japan, where the aging populations have grown, age 90 years or older has been proposed as the defining age for the oldest old. The defined age is based on the increase in the number of oldest old people and their current status [2, 3]. In developed countries with aging societies, the number of residents in both countries who are over age 90 is rapidly increasing, and this trend is expected to continue and spread to developing countries [4].

In previous surveys of the oldest old, researchers have used various methods to assess their physical performance [5,6,7,8]. Some researchers have used objective measurements such as gait speed and hand grip strength that are used with the younger elderly to evaluate the oldest old, and some measure functioning based on survey scores related to IADL performance [9]. Many if not most studies of the oldest old have been conducted in Europe and the United States [5,6,7,8, 10,11,12], and because the subjects of these studies were Europeans, who are taller and heavier than Japanese and other Asians, it is difficult to use them as a reference for comparison with Asians. Therefore, the Asian Working Group for Sarcopenia (AWGS) advocated establishing separate cutoff values that take into account the body size of Asians from the EUROPE criteria [13] and establishing cutoffs for physical performance that match the current status of aging in each country [14, 15]. In addition to the above research deficiencies, other researchers tend to bundle their results for nonagenarians (people in their 90s) with results for study participants aged 80 or 85 or older [16]. Even in studies of people aged 90 and older, researchers do not make comparisons with other age groups or examine their characteristics, and only some report on physical performance indicators in 90-year-olds [17,18,19,20]. These various gaps in the literature mean that there are no published reference values for physical performance measures (PPMs) in the age group of 90 and above, which manifests in a few ways. For instance, most researchers assume that the oldest old are frail or weak [14, 21, 22], but some could set unduly high goals for this group and recommend excessive exercise and treatment. Without effective reference values, there is no accumulation of evidence on preventive and therapeutic interventions for the growing population of the oldest old.

The Georgia Centenarian Study compared centenarians in Tokyo and Georgia and found that those in Tokyo had lower physical functioning than those in Georgia, with the researchers identifying differences in medical care, living conditions, disabilities, and health status between the two countries as reasons for the differences in functioning [23]. Further examining the subject will require evaluating the physical functioning in the oldest members of society including those aged 90. As the latter group continues to increase in populations worldwide, establishing reference points for functioning in the oldest old will be critical for assessing whether pathological physical declines occur.

According to Cress et al., the state of physical function in the oldest old is highly variable [22]. Therefore, to properly assess the physical function of the oldest old, combining multiple indicators that take into account the ceiling or floor effect of the outcome is desirable, rather than just a single physical function assessment. In their life course approach to healthy aging, Kuh et al. stated that age-related changes in objective indicators and their association with other constructs are significant [24]. They advocate the need for longitudinal cohort studies using standardized measures to examine such associations. However, no studies in Japan have adequately assessed the physical functions of the oldest old using internationally established physical assessment methods and examined the results.

Objectives of this study

Based on the above, the first objective of this study was to propose reference PPMs using internationally comparable physical function assessment methods specifically for individuals aged 90 or above, who traditionally have been grouped with younger elderly populations. The second objective was to compare the reference values for age 90 with those for ages 70 and 80 and to clarify the differences and characteristics of this age group from other age groups of older adults.



The participants in this study were community-dwelling, independently living people who participated in a study of health and longevity called the Septuagenarians, Octogenarians, Nonagenarians Investigation with Centenarians (SONIC), an ongoing prospective cohort study of older people as part of the Centenarian Study initiated in June 2010 [25]. The SONIC study aims to assess the characteristics of a general population sample of older people consisting of three control groups of different ages (70s, 80s, and 90s) for a centenarian cohort and to clarify the factors for health, longevity, and psychological well-being. We conducted this study in two main regions of Japan, eastern and western (Tokyo metropolitan area and Hyogo Prefecture, respectively). Both regions include urban and rural areas.

The data collection for each age cohort was performed in different years because of a large number of participants recruited. The participants were recruited from residential registries and contacted by postal mail.

The SONIC study used a narrow age-range cohort design. Inclusion criteria were older adults whose ages fall within a range of 3 years of each cohort (70 80 90). The first-wave survey began in 2010 for the 70s cohort, 2011 for the 80s cohort, and 2012 for the 90s cohort. Each cohort includes individuals whose ages fall within a range of 3 years (e.g., 69–71 for a 70-year-old cohort). Due to the small number of participants only in the 90s cohort, participants of the 90s were newly recruited and added to the 90s cohort in the second (2015) and the third (2018) waves.

In the first wave (2010 ~ 2012), there were no restrictions on the conditions for participation. However, in the second (2015) and the third (2018) waves, the conditions for participation were set to exclude those who were certified for requiring long-term care or could not walk independently without assistance.

Of the 4307 eligible participants in the 70s cohort, 3307 who did not consent to an on-site survey were excluded, resulting in 1000 on-site participants in the 70s cohort (479 men and 521 women). Of the 5391 eligible participants in the 80s cohort, 4418 who did not consent to the on-site survey were excluded, resulting in 973 on-site participants in the 80s cohort (391 men and 410 women). Of the 4134 eligible participants in the 90s cohort, 3327 who did not consent to the on-site survey were excluded, resulting in 807 on-site participants in the 90s cohort (385 men and 404 women).

We used the 90s cohort (88–91), the oldest old, as the primary target group of this study. Of the 807 on-site participants aged 90 years, we excluded 18 who could not complete all physical function tests, leaving 789 participants from the different survey waves: 264 in 2012 (119 men and 145 women), 264 in 2015 (131 men and 133 women), and 261 in 2018 (135 men and 126 women). After we excluded 89 people who had been certified as requiring long-term care and 10 who were missing care insurance certification data, we were left with 690 study participants (344 males and 346 females) to establish physical performance reference values. Figure 1 presents a flow chart of the details of study participant selection.

Fig. 1
figure 1

Flow chart of the details of the study participant selection

Methods for assessing physical performance

Building on earlier research challenges and limitations, we measured physical performance in a population aged 90 and above using scores for hand grip strength measured by a Smedley YD-100 hand grip dynamometer (Yagami Ltd., Tokyo, Japan) and the SPPB as the basis. Since these two methods have been used in many previous studies of the oldest old, the results of this study can be comparable to those of previous studies. Researchers have used a wide mix of items to evaluate physical performance in older adults [26,27,28]. Among the measures, researchers in Europe and the United States have used the SPPB to evaluate physical performance in frail elderly populations [29, 30]. Even the oldest old can perform it safely, and it has been validated to predict a wide range of indicators such as mortality and nursing care status [31, 32]. In addition, hand grip strength and gait speed are widely used measures for evaluating physical performance regardless of region or age, including in evaluating sarcopenia, and frail in older adults [18, 27, 29]. Researchers have used hand grip strength, in particular, to evaluate muscle strength even in age groups other than the elderly; it has been shown to be the most predictive and valid indicator for all-cause mortality [33].


The SPPB measures balance, gait, strength, and endurance on a scale of 0 to 12. A higher total SPPB score indicates higher lower limb function. Researchers use SPPB as one of the screening tools for functional status, such as sarcopenia [34]. Balance is measured as standing balance holding time with the feet aligned horizontally, semi-vertically (semi-tandem), and vertically (tandem). For this study, we measured gait following Guralnik’s original SPPB gait measurement method [29]. We used a static start method in which we measured usual gait speed from the starting line within a frame of 8 ft. (2.44 m) without an acceleration period. Lower limb strength including endurance is assessed by measuring the time from sitting in a chair to standing for five repetitions. For the chair stand test, we used a chair at the Japanese standard height of 40 cm. We followed the original SPPB scoring criteria [29].

Hand grip strength

In this study, we used the measurement method recommended by the American Society of Hand Therapists [35], whereby participants were required to sit, rotate their shoulders inward to a neutral position, bend their elbows to 90°, place their forearms in a neutral position, and dorsiflex their wrists between 0° and 30°. We measured hand grip strength twice with the participant’s dominant hand up to 1/10th of a kilogram. If the participant could not use the dominant hand to grasp, whether because of a fracture or for other reasons, we used the non-dominant one. For maximum hand grip strength, we used the stronger of the two measurements.

Assessing health-related information

We measured height and weight at the site and calculated body mass index (kg/m2). We collected other information, medical history, self-rated health, and drinking and smoking history, using self-administered questionnaires.

ADLs and IADLs

There are two methods for evaluating functioning in the elderly. Basic ADLs include walking, moving, eating, and bathing, and higher-order IADLs include managing transportation, meal preparation, medication, and finance. We used the Barthel Index [36], which is a widely used measure for assessment and rehabilitation, to assess the study participants’ basic ADL functioning. The Barthel Index scale allows evaluation of functional independence in 10 ADLs (feeding, grooming, bathing, dressing, bowel and bladder care, toilet use, and mobility [ambulation, transfers, and stair climbing]). The responses of the 10-item BI are scored on five levels ranging from 0 to a maximum of 5, 10, or 15, with a total score of 100.

To measure IADLs, we used the Tokyo Metropolitan Institute of Gerontology Index of Competence (TMIG-IC). Lawton systematized seven activity skill levels required for the elderly to live independently in the community. Based on those skills, Koyano et al. developed the TMIG-IC; now, it is widely used for measuring IADLs in Japan [37]. There are 13 yes-no questions with 1 point for each yes, for a score range of 0 to 13 points; higher scores indicate higher functioning.

Statistical analysis

We used descriptive statistics to characterize the study population by means and standard deviations. We analyzed differences between men and women using the unpaired T-test, chi-squared test, and Mann-Whitney U test and calculated PPM means and standard deviations (SDs) for all 90-year-olds and each sex. To compare linear trends in the PPM means within the age cohort, we used one-way analyses of variance by sex. We used quintiles of each PPM to construct appraisal standards according to sex for age 90, which are criteria for determining whether the decline in physical function in the oldest old in question is a standard decline due to aging or a pathological decline. We set significance at p < .05 and performed all statistical analyses using IBM SPSS Statistics version 25.


Table 1 summarizes the characteristics of the 90-year-old study participants. All PPMs were significantly higher in men than women, except for the 5times chair stand test. Men had significantly higher ADL scores than women, but there were no statistically significant differences between men and women on the IADLs.

Table 1 Characteristics of all study participants

Table 2 shows the number of participants unable to perform each PPM and those who had difficulty with the five-times sit-to-stand test, which took more than 20 s. The percentages of unusable data were 2.1% (n = 17 participants), 6.7% (n = 17), 19.7% (n = 158), and 6.6% (n = 53) for hand grip strength, usual gait speed, sit-to-stand test, and tandem balance, respectively. The number of people who took more than 20 s to stand up from a chair five times was 168, or 20.8% of the total. The lowest and highest rates of missing data were for hand grip strength and the 5times chair stand test, respectively.

Table 2 Numbers of participants with unable data for each variable

Tables 3 and 4 present unweighted simple PPM means and SDs according to age group in men and women, respectively. For all PPMs, the age 90 cohort was statistically significantly different from the age 70 and 80 cohorts (all trends P <  0.001). In women, all PPMs showed a significant decreasing trend as the age cohort increased. In contrast, there were no statistically significant differences in 5times chair stand, tandem balance, or SPPB total score between male 70- and 80-year-olds.

Table 3 Descriptive statistics for physical performance measures according to age group (men)
Table 4 Descriptive statistics for physical performance measures according to age group (women)

Hand grip strength decreased equitably with a similar slope (value of the difference between ages 70–80 and 80–90: M 4.79, 4.03; W 2.84, 2.67) with an increasing 10-year age cohort for both sexes. In contrast, usual gait speed, tandem balance, and SPPB total score (value of the difference each between ages 70–80 and 80–90: M 0.18, 2.26; W 0.76, 2.57) showed a larger drop between the 80–90 age cohort than the drop between the 70–80 age cohort. Finally, Table 5 shows the PPM quintiles for men and women of age 90, excluding those who were certified as requiring long-term care. We calculated means for each physical function assessment by sex for the 90-year-old subjects, used these means as reference values, and constructed sex-specific evaluation criteria according to the quintiles. In addition, hand grip strength and SPPB total score show ADL and IADL values according to each quintile (Table 5). Although we found a ceiling effect in tandem balance in men, all other PPMs had an approximately symmetrical distribution. The means and SDs for each PPMs were as follows: Hand grip strength 24.1 ± 5.4 kg, usual gait speed 0.80 ± 0.22 m/s, five-times chair stand 17.2 ± 6.73 s, tandem balance 5.89 ± 4.42 s, and SPPB 8.3 ± 2.2 respectively, in men and hand grip strength 14.4 ± 4.0 kg, usual gait speed 0.72 ± 0.20 m/s, five-times chair stand 17.8 ± 7.89 s, tandem balance 4.72 ± 4.35 s, and SPPB 7.5 ± 2.4, respectively, in women. Both ADLs and IADLs decreased at quintile 1 (Lowest) in hand grip strength. In SPPB, Both ADLs and IADLs showed a stepwise decrease from quintile 3 (Normal) to 1. All of these declines were statistically significant (P <  0.001).

Table 5 Quintiles of physical performance measures for 90 years old


Main findings

The results of this study showed that hand grip strength decreased at a similar slope in each age group as the age group increased. In contrast, the total SPPB score, an indicator of lower limb muscle strength, showed a greater drop between ages 80 and 90 than between ages 70 and 80. This more significant drop suggests a rapid decline in physical function, especially in the lower limbs, during the transition from age 80 to age 90, the age of the oldest old.

Hand grip strength indicates a decline in total muscle strength, which gradually declines with aging. On the other hand, SPPB scores include not only muscle strength of the lower limbs but also the state of functional aspects such as gait and balance in the assessments. Therefore, it is possible that the SPPB is also influenced by the frequency of daily activities (standing up, moving). The rapid decline in lower limb function at age 90 is assumed to be due to much less outdoor mobility, such as a decrease in going out and participation in organizations.

In Japan, the SPPB has not been actively utilized in assessing older adults. The reason for this is that the original SPPB scoring cannot assess differences in lower limb function due to the ceiling effect, which often results in a perfect score, especially when older adults living in the community are targeted [38]. The results of this study showed that the means of the 70- and 80-year-olds were not normally distributed, with each score exceeding 10 and being close to the perfect score. In contrast, the SPPB at age 90 was normally distributed. Below the SPPB quintile three or lower, both ADL and IADL scores show a statistically significant decrease. This significant decrease suggests that at the age of 90, the decline in daily living function may be inferred by the SPPB value. Therefore, we suggest that SPPB is suitable for assessing the oldest old in Japan.

Table 6 presents a comparison of the findings from earlier large-scale cohort studies of physical functioning in elderly populations. These researches follow; the BFC80+ [39], the Leiden 85-Plus [40], the Tokyo Metropolitan Institute of Gerontology-Longitudinal Interdisciplinary Study on Aging (TMIG-LISA) 6 cohort study [16], the Newcastle 85-Plus Study [41], the BFC80+, the 90+ Study [17], the validity 90+ Study [18], the Danish 1905 cohort survey [19], and Genetics of Healthy Aging (GEHA) [20]. In the following sections, we use these previous studies to compare and discuss the results of this study.

Table 6 Assessment of physical performance in previous studies of the oldest old (85 years and older)

Possible causes for value differences

When comparing the results of different studies evaluating physical functions, it is important to use the same measurement methods. The differences between our results and those of previous studies might be attributed to the differences in measurement methods. In the following sections, we discuss possible causes for differences between studies in the evaluated PPMs and propose research directions for future studies focusing on the oldest old.

Hand grip strength

In this study, hand grip strength decreased as age increased for both men and women, which was consistent with earlier findings among the elderly in Japan [42]. Compared to previous studies conducted in Europe, we had a higher value than in the Newcastle 85-Plus Study and lower than in the Leiden 85-Plus Study. These variations reflect the inconsistency in research findings on the oldest old age 90 and over. Hand grip strength varies depending on standing versus sitting, upper limb position, and forearm posture, so appropriate comparisons require similar measurement methods. In populations other than solely older adults, hand grip strength in Japan is often measured in a standing position with the elbow extended. There are few measurement data on older adults in a sitting position [43].

In examinations of differences in hand grip strength by posture, researchers found greater hand grip strength in the standing position than in people who were seated [35]. In previous studies, some researchers measured grip while standing, and some did not mention the measurement posture. These differences in measurement methods could have affected the results of comparing studies.

It is often difficult for oldest old people over age 90 to hold a standing position. In fact, about 20% of the 90-year-old participants in the SONIC study needed some kind of assistance to stand up and hold the standing position, and it was difficult to measure their ability to stand up from a chair. Not only the oldest old but also those who cannot hold a standing position are expected to have weaker grip (muscle) strength than those who can hold a standing position without holding anything. Therefore, a participant’s ability to hold the standing position is a prerequisite for measuring hand grip strength, which can cause selection bias; studies might show high findings because the authors selected people with high muscle strength. Similarly, some investigators measured hand grip strength in an unstable state in which the subject held the balance with one hand while standing. In the Leiden 85-Plus, where the hand grip strength was greater than in the present study, 14.8% of the subjects could not be measured, and it is possible that those results were influenced by selection bias because people who could not stably hold the standing position were excluded from the measurement.

In short, the effect of measurement position on 90-year-olds is significant due to their physical vulnerability, and therefore, when comparing PPMs, it is important to consider the measurement method (such as measurement position and frequency) more strictly than is generally necessary with older adults. The oldest old often have difficulty maintaining a standing position and are at greater risk of falling, which suggests that it is safer and more stable to measure hand grip strength of the oldest old in a sitting posture.

Gait speed

This is the first study to report walking speed by gender in Japanese subjects aged 90 years who were not certified as needing long-term care using an internationally standardized measurement method. Researchers in Japan commonly use a dynamic start with an acceleration period before the measurement distance, whereas Western researchers commonly use a static start without an acceleration period. The dynamic start is faster because it reduces the influence of the slow acceleration period. Because studies on Japanese older adults, including the TMIG-LISA 6 cohort study, used dynamic start [16], it is not possible to accurately compare Japanese findings with findings from overseas studies. However, with the present study, we used the static start method, and thus, we consider that our findings are comparable with those from Western studies.

For the usual gait speed per second in this study, the rates for men (0.8 m/s) and women (0.72) were faster than the overall gait speeds in the Leiden 85-Plus (all: 0.52 m/s) [40] and BFC80+ [39] (age 85; M: 0.7 m/s, W: 0.5 m/s), which included participants at a younger average age than the subjects in the present study. The usual gait speeds we found were faster than those in the 90+ Study [17] (ages 90–94; M: 0.66, W: 0.52), which used older participants than ours, and those in the Danish 1905 cohort survey (ages 92–93; M: 0.64, W: 0.52), whose participants were also older.

The Leiden 85-Plus (age 89) and BFC80+ (age 85), both study cohorts were younger than the SONICs, measured maximum gait speed, while the other studies (90+ Study, Danish 1905) measured usual gait speed. Maximum gait speed is generally faster than usual speed. Our findings for usual gait speed were faster than the maximum gait speeds in other studies, indicating a faster gait speed in the oldest Japanese individuals than the speed of the oldest old in the West.

Research suggests that differences in mobility, including walking speed, are attributed to differences in lifestyle [44, 45]. Usual gait speed is highly sensitive in predicting ADL disability in people over 75 years old [43]. In the case of this study, we assumed that the differences in walking speed from findings in other studies were attributable to those people in Japan who tend to live a tatami mat lifestyle.; Japanese houses still have tatami flooring. In Japan, some older adults live on tatami mats. If older adults live in a room with tatami flooring, they sit on the tatami for meals and sleep in the tatami lifestyle. They regularly stand up and sit down, strengthening their lower limbs daily. It is necessary to investigate whether similar cultural factors and lifestyle features affect ADLs and other measurements of functioning in the oldest old in different countries.

In addition, 90-year-olds are often unable to participate in surveys outside homes due to their declining physical functioning. We adopted a gait distance of 2.44 m in this study because we considered that the oldest old citizens who were in institutions or could not come to the survey site could measure their gaits at the places where they lived. Since the advent of the COVID-19 pandemic in 2020, group surveys conducted in large venues have been severely restricted to prevent infection, and such restrictions are expected to continue in the future. Because the SPPB can be measured at home or in a facility with limited space, we considered it appropriate for measuring physical function in 90-year-olds even during the pandemic.

Chair stand and chair height

The time in this study for the five-times sit-and-stand test (M: 15.9 and W: 16.3 × the upper limit of quartile 3 compared with the 90+ Study) was faster than those in the validity 90+ Study (M: 18.0, W: 20.0) for the same age group and those in the 90+ Study (M: 16.1, W: 16.7). In addition, the differences between men and women in the results of this study were small, and there was no statistically significant gender difference in the results in the 90-year-old reference value (3rd normal quintile). In addition, women in the SONIC study were faster in the 70-year-old age cohort.

It is possible that the chair’s height had an advantage for Japanese women, who are shorter than those in Europe and the United States and men. One of the factors that affect standing behavior is the height of the chair used for measurement [46]. In the SONIC study, a Japanese standard chair height of 40 cm was used for both men and women, and because women are shorter than men, a chair of the same height would make it relatively easier for women to stand up and harder for men. This relative ease of standing could be why there was little difference between men and women on the 5times chair stand test.

It is also possible that the differences in chair heights used between the studies in Europe, the United States, and Japan could have affected the rise times in this study in addition to the differences. However, we could not verify this proposition because researchers in the previous studies did not include the heights of the chairs used. Therefore, accurately comparing rising data internationally requires noting and considering differences in the chair heights used.

Chair standing itself as a screening criterion

In this study, the proportion of participants who failed or were unable to perform the five-times sit-and-stand test was the largest among the tests of physical performance assessment: 1.6% in the age 70 cohort, 4.7% in age 80, and about 20% in age 90. In other words, the percentage of 90-year-olds who were unable to perform the test five times was much higher than that of other age cohorts.

Furthermore, compared between PPMs, the proportion of 90-year-olds who could not complete the chair stand test failure (19.7%) was larger than those for hand grip strength (2.1%) and normal gait speed (6.7%) among the other tests. In addition, 20.9% of the subjects took longer than 20 s to perform the test (Table 1). This means that at the age of 90, the test of standing up from a chair proves to be remarkably difficult.

In the Danish cohort survey of nonagenarians, 61% of men and 50% of women were able to stand up without using their hands [19]; in other words, 39% of men and half of women could not stand up without using their hands. Similarly, in the 90+ Study, 32% of all participants failed to complete the five-times sit-and-stand test. These results indicate significantly higher proportions of people who have difficulty standing up without using their hands in the oldest old (90 years and older) than in younger old age groups.

Guralnik originally gave an SPPB score of 0 for failures to complete the five-times sit-stand and a score of 1 to the lowest 25% of those who completed the test [29]. Based on the results of this study and of previous studies, it is possible that about one-quarter of the oldest old are unable to rise from a chair without holding on to something, which corresponds to the lower approximately 25% of the total oldest old population to the criterion for 1 point on the original SPPB. Therefore, the five-times sit-to-stand test could be a screening test for lower limb muscle strength. Indeed, the AWGS, in its 2019 criteria for the diagnosis of sarcopenia, allows general practices and facilities to use the test if they do not have dual-energy X-ray absorption or bioelectrical impedance analysis [15].

Researchers in a study of older adults in Japan used a test called the Frail 10-Second Chair Stand Test to measure the number of times a person could stand up in 10 s without using their hands in a similar way to the SPPB and found that it was independently associated with quadriceps strength and TUG test score, which represents dynamic balance [47]. These findings suggest that the chair stand test can be used to assess overall lower extremity muscle strength even when used alone in the oldest old over 90 years of age (in Japan).

Study strengths and limitations

Study characteristics

The first distinguishing feature of this study is that we conducted it with older adults in Asia. Asian countries other than Japan are in the transition stage to aging societies, and the numbers of the oldest old are small and of little interest. Our study assessed the physical functional status of the oldest old within Japan specifically for the first time. The second distinguishing feature of our study is that we treat age 90 as a discrete group, not just as members of groups age 80 or 85 and above. As we have seen, there are currently no reference values for assessing the physical performance of nonagenarians. With this background, we included 90-year-olds as an independent age cohort.

The main strength of this study is that we had access to a large sample size of the oldest old over 90 years of age one country in Asia, where there are still few oldest old people; generally, there are fewer oldest old men than women, but rates in this study were similar: 48.8% for men and 51.2% for women. Furthermore, the results of this study can be easily compared with other studies because we assessed physical function using internationally standardized measurement methods. At present, there are no representative data for Asia’s oldest old, including in Japan, and which is why we believe our results can be used as the best current reference values for the oldest old people in Asian countries.

The main limitation of this study was selection bias. The subjects in this study were 90-year-olds who were not certified as needing long-term care who were otherwise eligible to participate. However, according to the 2015 census, approximately 50% of Japanese aged 90 and older are certified as needing long-term care in an institution [48]. Therefore, we assume that the participants in this study were relatively healthy community-dwelling 90-year-olds. The distribution of physical functions across all 90-year-olds can be estimated by adding assessments that include those certified as requiring long-term care. In addition, as mentioned in the discussion, we grouped the three populations from different participation years, so it is necessary to consider the rejuvenation phenomenon separately among these populations. Finally, due to the cross-sectional design of this study, it was impossible to make causal inferences about age or sex differences in PPMs.


This study analyzed sex-specific reference values and appraisal standards for five PPMs in nondisabled, community-dwelling, Japanese oldest old (age 90 and over). This study indicates that a rapid physical performance decline occurs in the lower limbs during the transition from age 80 to age 90. Furthermore, this study indicates that the SPPB and the 5times chair stand test are practical physical function assessment methods that can be utilized in Japanese oldest old. Although absolute physical performance varies among populations, the age and sex differences in PPMs could be standard across Asian countries. The characteristics of these results in PPMs can be broadly shared with Japan and Asian countries.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.



The Asian Working Group for Sarcopenia


Activities of Daily Living


Instrumental Activities of Daily Living


Genetics of Healthy Aging


Physical performance measures


Standard deviation


The Septuagenarains, Octogenarians, Nonagenarians Investigation with Centenarians study


The Short Physical Performance Battery


The Tokyo Metropolitan Institute of Gerontology Index of Competence


The Tokyo Metropolitan Institute of Gerontology-Longitudinal Interdisciplinary Study on Aging


The Timed up and go


  1. Covinsky KE, Palmer RM, Fortinsky RH, Counsell SR, Stewart AL, Kresevic D, et al. Loss of Independence in activities of daily living in older adults hospitalized with medical illnesses: increased vulnerability with age. J Am Geriatr Soc. 2003;51:451–8.

    Article  PubMed  Google Scholar 

  2. He W, Muenchrath MN. 90+ in the United States: 2006–2008 American Community Survey Reports; 2011.

    Google Scholar 

  3. Ouchi Y, Rakugi H, Arai H, Akishita M, Ito H, Toba K, et al. Redefining the elderly as aged 75 years and older: proposal from the joint Committee of Japan Gerontological Society and the Japan geriatrics society. Geriatr Gerontol Int. 2017;17(7):1045–7.

    Article  PubMed  Google Scholar 

  4. United Nations, Department of Economic and Social Affairs PD. World Population Ageing 2017 2017. Available from:

    Google Scholar 

  5. Paganini-Hill A, Kawas CH, Corrada MM. Lifestyle factors and dementia in the oldest-old: the 90 + study. Alzheimer Dis Assoc Disord. 2016;30(1):21–6.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Jylhä M, Enroth L, Luukkaala T. Trends of functioning and health in nonagenarians: the vitality 90+ study. Annu Rev Gerontol Geriatr. 2013;33(1):313–32.

    Article  Google Scholar 

  7. Kjær AA, Siren A, Seestedt MH, Fridberg T, Casier F. Cohort profile: the Danish longitudinal study of ageing (DLSA). Int J Epidemiol. 2019;48(4):1050–1050G.

    Article  PubMed  Google Scholar 

  8. Skytthe A, Valensin S, Jeune B, Cevenini E, Balard F, Beekman M, et al. Design, recruitment, logistics, and data management of the GEHA (Genetics of healthy ageing) project. Exp Gerontol. 2011;46(11):934–45.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Hao Q, Dong B, Yang M, Dong B, Wei Y. Frailty and cognitive impairment in predicting mortality among oldest-old people. Front Aging Neurosci. 2018;10:295.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Vaes B, Pasquet A, Wallemacq P, Rezzoug N, Mekouar H, Olivier P-A, et al. The BELFRAIL (BF C80 +) study: a population-based prospective cohort study of the very elderly in Belgium. BMC Geriatr. 2010;10:39 Available from:

    Article  PubMed  PubMed Central  Google Scholar 

  11. Bootsma-Van Der Wiel A, Van Exel E, De Craen AJM, Gussekloo J, Lagaay AM, Knook DL, et al. A high response is not essential to prevent selection bias: results from the Leiden 85-plus study. J Clin Epidemiol. 2002;55:1119–25.

    Article  PubMed  Google Scholar 

  12. Collerton J, Barrass K, Bond J, Eccles M, Jagger C, James O, et al. The Newcastle 85+ study: biological, clinical and psychosocial factors associated with healthy ageing: study protocol. BMC Geriatr. 2007;7:14.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Cruz-Jentoft AJ, Bahat G, Bauer J, Boirie Y, Bruyère O, Cederholm T, et al. Sarcopenia: revised European consensus on definition and diagnosis. Age Ageing. 2019;48(1):16–31.

    Article  PubMed  Google Scholar 

  14. Chen LK, Liu LK, Woo J, Assantachai P, Auyeung TW, Bahyah KS, et al. Sarcopenia in Asia: consensus report of the Asian working group for sarcopenia. J Am Med Dir Assoc. 2014;15(2):95–101.

    Article  PubMed  Google Scholar 

  15. Chen L-K, Woo J, Assantachai P, Auyeung T-W, Chou M-Y, Iijima K, et al. Asian working group for sarcopenia: 2019 consensus update on sarcopenia diagnosis and treatment. J Am Med Dir Assoc. 2020;21(3):300–7.

    Article  PubMed  Google Scholar 

  16. Seino S, Shinkai S, Fujiwara Y, Obuchi S, Yoshida H, Hirano H, et al. Reference values and age and sex differences in physical performance measures for community-dwelling older Japanese: a pooled analysis of six cohort studies. PLoS One. 2014;9(6):e99487.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  17. Bullain SS, Corrada MM, Shah BA, Mozaffar FH, Panzenboeck M, Kawas CH. Poor physical performance and dementia in the oldest old: the 90+ study. Arch Neurol. 2013;70(1):107–13.

    Google Scholar 

  18. Tiainen K, Hurme M, Hervonen A, Luukkaala T, Jylhä M. Inflammatory markers and physical performance among nonagenarians. J Gerontol A Biol Sci Med Sci. 2010 Jun;65(6):658–63.

    Article  PubMed  CAS  Google Scholar 

  19. Nybo H, Gaist D, Jeune B, McGue M, Vaupel JW, Christensen K. Functional status and self-rated health in 2,262 nonagenarians: the Danish 1905 cohort survey. J Am Geriatr Soc. 2001;49(5):601–9.

    Article  CAS  PubMed  Google Scholar 

  20. Cevenini E, Cotichini R, Stazi MA, Toccaceli V, Palmas MG, Capri M, et al. Health status and 6 years survival of 552 90+ Italian sib-ships recruited within the EU project GEHA (GEnetics of healthy ageing). Age (Omaha). 2014;36(2):949–66.

    Article  CAS  Google Scholar 

  21. Passarino G, Montesanto A, De Rango F, Garasto S, Berardelli M, Domma F, et al. A cluster analysis to define human aging phenotypes. Biogerontology. 2007;8(3):283–90.

    Article  CAS  PubMed  Google Scholar 

  22. Cress ME, Gondo Y, Davey A, Anderson S, Kim SH, Poon LW. Assessing physical performance in centenarians: Norms and an extended scale from the Georgia centenarian study. Curr Gerontol Geriatr Res. 2010;2010:6.

    Google Scholar 

  23. Martin P, Gondo Y, Arai Y, Ishioka Y, Woodard JL, Poon LW, et al. Physical, sensory, and cognitive functioning among centenarians: a comparison between the Tokyo and Georgia centenarian studies. Qual Life Res. 2018;27(11):3037–46.

    Article  PubMed  Google Scholar 

  24. Kuh D, Karunananthan S, Bergman H, Cooper R. A life-course approach to healthy ageing: Maintaining physical capability. Proc Nutr Soc. 2014;73(2):237–48.

    Article  PubMed  PubMed Central  Google Scholar 

  25. Gondo Y, Masui Y, Kamide K, Ikebe K, Arai Y, Ishizaki T. SONIC Study, A Longitudinal Cohort Study of the Older People as Part of a Centenarian Study. In: Pachana NA, editor. Encyclopedia of Geropsychology. Singapore: Springer; 2017.

  26. Richardson P. The Timed & go. J Am Geriatr Soc. 1991;39(2):142–8.

    PubMed  Google Scholar 

  27. Cooper R, Kuh D, Cooper C, Gale CR, Lawlor DA, Matthews F, et al. Objective measures of physical capability and subsequent health: a systematic review. Age Ageing. 2011 Jan;40(1):14–23.

    Article  PubMed  Google Scholar 

  28. Fried LP, Tangen CM, Walston J, Newman AB, Hirsch C, Gottdiener J, et al. Frailty in older adults: evidence for a phenotype. Vol. 56, Journal of Gerontology: Medical sciences Copyright. 2001. Available from:

    Google Scholar 

  29. Guralnik JM, Simonsick EM, Ferrucci L, Glynn RJ, Berkman LF, Blazer DG, et al. A short physical performance battery assessing lower extremity function: association with self-reported disability and prediction of mortality and nursing home admission. J Gerontol Med Sci. 1994;49(2):85–94.

    Article  Google Scholar 

  30. de Fátima Ribeiro Silva C, Ohara DG, Matos AP, Pinto AC, Pegorari MS. Short physical performance battery as a measure of physical performance and mortality predictor in older adults: a comprehensive literature review. Int J Environ Res Public Health. 2021;18(20):10612.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Guralnik JM, Ferrucci L, Pieper CF, Leveille SG, Markides KS, Ostir GV, et al. Lower extremity function and subsequent disability: consistency across studies, predictive models, and value of gait speed alone compared with the short physical performance battery. J Gerontol A Biol Sci Med Sci. 2000;55(4):221–31.

    Article  Google Scholar 

  32. Pavasini R, Guralnik J, Brown JC, di Bari M, Cesari M, Landi F, et al. Short physical performance battery and all-cause mortality: systematic review and meta-analysis. BMC Med. 2016;14(1):215.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Wu Y, Wang W, Liu T, Zhang D. Association of Grip Strength with Risk of all-cause mortality, cardiovascular diseases, and Cancer in community-dwelling populations: a Meta-analysis of prospective cohort studies. J Am Med Dir Assoc. 2017;18(6):551.e17–35.

    Article  Google Scholar 

  34. Cruz-Jentoft AJ, Baeyens JP, Bauer JM, Boirie Y, Cederholm T, Landi F, et al. Sarcopenia: European consensus on definition and diagnosis. Age Ageing. 2010;39(4):412–23.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Roberts HC, Denison HJ, Martin HJ, Patel HP, Syddall H, Cooper C, et al. A review of the measurement of grip strength in clinical and epidemiological studies: towards a standardised approach. Age Ageing. 2011;40:423–9.

    Article  PubMed  Google Scholar 

  36. Liu W, Unick J, Galik E, Resnick B. Barthel index of activities of daily living: item response theory analysis of ratings for long-term care residents. Nurs Res. 2015;64(2):88–99.

    Article  PubMed  Google Scholar 

  37. Koyano W, Shibata H, Nakazato K, Haga H, Suyama Y. Measurement of competence: reliability and validity of the TMIG Index of Competence. Arch Gerontol Geriatr. 1991;13(2):103–16 Available from: [cited 1 Jun 2012].

    Article  CAS  PubMed  Google Scholar 

  38. Makizako H, Hiroyuki S. The modified version of the short physical performance battery for community-dwelling Japanese older adults. Rigakuryohogaku. 2017;44(3):197–206.

    Google Scholar 

  39. Legrand D, Adriaensen W, Vaes B, Matheï C, Wallemacq P, Degryse J. The relationship between grip strength and muscle mass (MM), inflammatory biomarkers and physical performance in community-dwelling very old persons. Arch Gerontol Geriatr. 2013;57(3):345–51.

    Article  CAS  PubMed  Google Scholar 

  40. Ling CHY, Taekema D, De Craen AJM, Gussekloo J, Westendorp RGJ, Maier AB. Handgrip strength and mortality in the oldest old population: the Leiden 85-plus study. Can Med Assoc J. 2010;182(5):429–35.

    Article  Google Scholar 

  41. Granic A, Mendonça N, Sayer AA, Hill TR, Davies K, Adamson A, et al. Low protein intake, muscle strength and physical performance in the very old: the Newcastle 85+ study. Clin Nutr. 2018;37(6):2260–70.

    Article  PubMed  PubMed Central  Google Scholar 

  42. Kamide N, Kamiya R, Nakazono T, Ando M. Reference values for hand grip strength in Japanese community-dwelling elderly: a meta-analysis. Environ Health Prev Med. 2015;20(6):441–6.

    Article  PubMed  PubMed Central  Google Scholar 

  43. Shinkai S, Watanabe S, Kumagai S, Fujiwara Y, Amano H, Yoshida H, et al. Walking speed as a good predictor for the onset of functional dependence in a Japanese rural community population. Age Ageing. 2000;29(5):441–6.

    Article  CAS  PubMed  Google Scholar 

  44. Aoyagi K, Ross PD, Nevitt MC, Davis JW, Wasnich RD, Hayashi T, et al. Comparison of performance-based measures among native Japanese, Japanese-Americans in Hawaii and Caucasian women in the United States, ages 65 years and over: a cross-sectional study. BMC Geriatr. 2001;1:1–7.

    Article  Google Scholar 

  45. Yoshinaga S, Shiomitsu T, Kamohara M, Fujii Y, Chosa E, Tsuruta K. Lifestyle-related signs of locomotive syndrome in the general Japanese population: a cross-sectional study. J Orthop Sci. 2019;24(6):1105–9.

    Article  PubMed  Google Scholar 

  46. Kuo YL. The influence of chair seat height on the performance of community-dwelling older adults’ 30-second chair stand test. Aging Clin Exp Res. 2013;25(3):305–9.

    Article  PubMed  Google Scholar 

  47. Murata S, Otao H, Murata J, Horie J, Onituka M, Yokoyama T, et al. Relationship between the 10-second chair stand test (frail CS-10) and physical function among the frail elderly. Rigakuryoho Kagaku. 2010;25(3):431–5 Available from:

    Article  Google Scholar 

  48. Ministry of Health, Labour and Welfare (2015). Overview of Survey of Long-term Care Benefit Expenditures: Accessed 28 Aug 2021

    Google Scholar 

Download references


We are grateful to all SONIC participants who participated in these studies. We sincerely appreciate all staff involved in the SONIC study.


This study was supported in part by grants-in-aid from the Ministry of Education, Culture, Sports, Science, and Technology of Japan (Y.G.: 18 K18456, 17H02633, 26310104, 24653194, 21330152).

Author information

Authors and Affiliations



KM, SY, and YG: concept, designed, and developed the overall research plan. KM, YG, YM, SY, YY, KI, YA, MK, KK, HA and TI carried out data collection. KM and YG conducted analysis and KM,YY,and YG interpretation of data. KM, and YG wrote and had responsibility for the final contents of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Kiyoaki Matsumoto.

Ethics declarations

Ethics approval and consent to participate

The SONIC study was approved by the Institutional Review Board of Osaka University Graduate School of Medicine, Dentistry, and Human Sciences (Osaka, Japan) and the Tokyo Metropolitan Geriatric Hospital and Institute of Gerontology (Tokyo, Japan). This Research was performed in accordance with the Declaration of Helsinki. Informed consent was obtained from all study participants on site prior to starting the survey. Participation of this study was voluntary. Was voluntary. We explained the aims, methods, sources of funding, any possible conflicts of interest, institutional affiliations of the researcher, the anticipated benefits and potential risks of the study and the discomfort it may entail, post-study provisions. And we gave all participants that they had the right to refuse to participate in the study or to withdraw consent to participate at any time without reprisal. Participants have been informed of the research, and they have given their written consent. Their anonymity has been guaranteed.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Matsumoto, K., Gondo, Y., Masui, Y. et al. Physical performance reference values for Japanese oldest old: a SONIC study. BMC Geriatr 22, 748 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: