Skip to main content
  • Research article
  • Open access
  • Published:

The choice of self-rated health measures matter when predicting mortality: evidence from 10 years follow-up of the Australian longitudinal study of ageing



Self-rated health (SRH) measures with different wording and reference points are often used as equivalent health indicators in public health surveys estimating health outcomes such as healthy life expectancies and mortality for older adults. Whilst the robust relationship between SRH and mortality is well established, it is not known how comparable different SRH items are in their relationship to mortality over time. We used a dynamic evaluation model to investigate the sensitivity of time-varying SRH measures with different reference points to predict mortality in older adults over time.


We used seven waves of data from the Australian Longitudinal Study of Ageing (1992 to 2004; N = 1733, 52.6% males). Cox regression analysis was used to evaluate the relationship between three time-varying SRH measures (global, age-comparative and self-comparative reference point) with mortality in older adults (65+ years).


After accounting for other mortality risk factors, poor global SRH ratings increased mortality risk by 2.83 times compared to excellent ratings. In contrast, the mortality relationship with age-comparative and self-comparative SRH was moderated by age, revealing that these comparative SRH measures did not independently predict mortality for adults over 75 years of age in adjusted models.


We found that a global measure of SRH not referenced to age or self is the best predictor of mortality, and is the most reliable measure of self-perceived health for longitudinal research and population health estimates of healthy life expectancy in older adults. Findings emphasize that the SRH measures are not equivalent measures of health status.

Peer Review reports


Self-rated health (SRH) is a widely used measure for health status in public health and epidemiological research due to strong associations with other subjective and objective measures of well-being, health outcomes and mortality [e.g. [1]]. The multidimensional concept of health that is encapsulated within a single global SRH response is considered by World Health Organisation (WHO[2]) and the Euro-REVES 2 [3] project to be one of the best indicators of health at the individual and population level. Both of these organisations have extensively investigated the relationship between global SRH and health outcomes and recommended the measure to estimate policy relevant data on aspects of public health such as healthy life expectancy and mortality [4].

The most commonly used SRH measure has a global or current reference point (i.e. how would you rate your health in general/at the present time?). A comparative reference point is also often used to anchor the assessment, such as comparing current health to previous health (self-comparative), or same-aged peers (age-comparative). All these forms of the SRH item are in use in health surveys around the world as an indicator of older adults' years lived in good health [e.g. [3]]. Despite their extensive use, and the robust relationship between poor ratings of SRH and major health outcomes, there is scant research that has compared how these SRH measures perform in older adult populations; in particular it has not been established how the SRH measures compare in their relationship with mortality.

In the few studies that have compared the association between SRH items and mortality, mixed results have been reported. Manderbacka, Kareholt, Martikainen and Lundberg [5] found the predictive quality of global and age-comparative SRH items was dependent on gender, with the age-comparative item being a better predictor of mortality for males than females in simultaneous models. In contrast, Vuorisalmi, Lintonen and Jylhä [6] reported both an age-comparative and global comparative item was a stronger predictor of mortality for males than females. Even less attention has been paid to the self-comparative item, and the temporal reference point that is used in the few studies has been broad (i.e. ranging from 5 to 10 years previous). As the interval over which participants gauge their health expands, so does the possibility that these retrospective reports may be erroneous. For example, Bath [7] found that a global SRH item was more robust in predicting mortality than a self-comparative measure that asked older adults to compare their present health to health five years previously. In a study that compared a global, age-comparative and self-comparative (10 years previous) SRH items it was found that the predictive quality of the self-comparative item became less robust for males after longer follow up periods, whereas none of the SRH items were predictive of mortality in adjusted models for females after 3 and 7.5 year follow-ups [8].

The inconsistent findings regarding the predictive quality of the different SRH items, and the effect that gender may have on this SRH-mortality relationship, warrants further investigation. It is also not clear the extent that these SRH measures predict mortality independently of other potential associated changes, as not all studies have accounted for other known mortality risks, such as demographic variables, health and health behaviours. Furthermore, no study has accounted for the high correlation between the SRH measures by simultaneously modelling the three SRH measures together.

A further consideration is whether the SRH-mortality relationship is constant over time. There is a growing body of evidence that a dynamic evaluation of self-rated health, that is, taking into account the potential time-varying nature of SRH across time, may provide a more authentic depiction of the relationship between subjective health assessments and mortality [9]. Cross-sectional and longitudinal studies suggest SRH does not remain stable across the lifespan [1013]; therefore using a single occasion measurement of SRH to predict mortality may not take into account the biopsychosocial interactions of health and the lifespan health trajectory [14, 15]. The advantage of modelling the relationship between time-varying SRH ratings and mortality is that it enables the modelling of time-varying predictors. Previous studies have found that time-varying measures of Global SRH are a superior predictor of mortality compared to a single fixed measure [1518]. However, no study to date has investigated how dynamic changes in age-comparative or self-comparative SRH may relate to mortality risk.

The aim of this study is to fill current gaps within the literature regarding the sensitivity of time-varying SRH measures with different reference points to predict mortality in an older adult sample. The unique contributions of the study are (1) the dynamic evaluation of the relationship between three time-varying SRH measures (global, age-comparative and self-comparative) on mortality risk, (2) the identification of the unique impact of each SRH measure by controlling for time-dependent and time-varying measures of biopsychosocial factors known to increase mortality risk in older adults [1517], and (3) the comparison of the independent and concurrent relationship between the three SRH measures on mortality in order to account for the correlation of these measures and further determine their unique predictive quality. While the literature suggests gender differences in the relationship between SRH measures and mortality [e.g. [5, 7]], and age-group differences in SRH ratings [e.g. [13, 19]], it is not clear if men and women exhibit the same association between SRH and mortality at different ages. Therefore, interactions between SRH items and gender and age-groups are also investigated in order to more fully assess these dimensions of difference.



The Australian Longitudinal Study of Ageing (ALSA) has been fully described elsewhere [20]. In brief, households with residents over 70 years were identified from the South Australian Electoral Database. The sample was stratified by area, gender, and 5-year cohort groups (70-74, 75-79, 80-84, and ≥ 85) [21]. Males were over-sampled to ensure sufficient numbers in follow up. Of the 2,705 eligible residents 1,477 (55%) agreed to be interviewed. Spouses (>65 years) and co-residents (>70 years) of the sample were also asked to participate, which brought the total number of participants at baseline to 2,087 community dwelling and residential care individuals. The patterns of health care utilization in the final sample were found to be similar to that of the general Australian population [20].

Data collection began in 1992. Baseline and waves 3 (24 months from baseline), 6 (96 months) and 7 (120 months) data consisted of a comprehensive two-hour home interview including questions on demographic, health, medical, psychosocial, and physical status. Waves 2 (12 months from baseline), 4 (36 months), and 5 (60 months) were conducted via short telephone interviews and addressed changes in biopsychosocial factors since last measurement period. Data from all seven waves were included in the current study. Baseline ages ranged from 65 to 103 years of age (mean age = 78.14 years, SD = 6.68). In the final wave of data collected the remaining 489 participants were aged between 75 to 102 years (mean age = 84.94 years, SD = 4.90). Reasons for non-response at wave 7 were due to death (58.8% of the baseline participants), participants unable to be contacted (2.0%), participants who had moved out of scope of the study (2.3%), and those that refused to be interviewed (13.5%). Table 1 displays the descriptive characteristics for the final sample across all waves.

Table 1 Descriptive Statistics for all Waves, and Non-Survivors at Wave 1


Mortality status

Mortality status was established through searches of official death certificates conducted by the Epidemiological Branch of the Department of Health in South Australia and were confirmed by the South Australian Births, Deaths and Marriage bureau [22]. Matching the ALSA respondents to death information was conducted using full name, date of birth, and last known address. During the period from baseline to wave 7 1,228 deaths were recorded (70.9% of sample; 58.3% males). As expected there were significant differences between survivors and non-survivors on predictor variables and covariates. Compared to survivors, non-surviving respondents were more likely to have poorer global (χ2 (4) = 113.69, p < .001), age- (χ2 (2) = 51.29, p < .001), and self-comparative (χ2 (2) = 25.63, p < .001) SRH at baseline. They were also more likely to be male (χ2 (1) = 55.65, p < .001), be older at baseline (t(1731) = 22.49, p < .001), smoke (χ2 (2) = 12.54, p = .002), have an income less than $12 000 (χ2 (4) = 32.05, p < .001), less than 14 years of education (χ2 (1) = 8.58, p = .003), be living in an institution (χ2 (1) = 48.15, p < .001), and not have a partner (χ2 (1) = 39.49, p < .001). In addition non-survivors were in poorer health at baseline as they were more likely to have more medical conditions (t(1731) = 2.65, p = .008), be on more medications (t(1730) = 8.73, p < .001), have greater functional disability (ADL's - t(1731) = 6.29, p < .001; IADL's - t(1731) = 8.27, p < .001), poorer cognitive functioning (MMSE; t(1702) = -9.36, p < .001), and more depressive symptoms (CES-D; t(1715) = 6.90, p < .001). Table 1 displays the descriptive characteristics for the non-survivors at baseline and the full sample across all waves.

Self-rated health (SRH)

Global SRH was measured with the question "How would you rate your overall health at the present time?" (1-'excellent' to 5-'poor'). Age-comparative SRH was measured in response to the question "Would you say your health is better (1), about the same (2) or worse (3) than most people your age?" Self-comparative SRH was worded "Is your health now better (1), about the same (2) or not as good (3) as it was 12 months ago?" Global and self-comparative SRH was measured at all seven waves whilst the age-comparative item was measured at baseline, waves 3, 6, and 7. SRH ratings were reverse coded so that the highest score was equivalent to the most positive health rating.


Demographic variables included gender, age, community versus residential dwelling, partner status, annual income, and number of years of education. The education question asked participants how old they were when they left school, with possible responses ranging from 1 = never went to school, 2 = under fourteen years, 3 = fourteen years, 4 = fifteen years, to 7 = eighteen or more years. The education variable was dichotomised at the median age category to reflect ≤ 14 years versus ≥ 15 years education [20, 22].

Physical and functional health and medications

Participants were asked at baseline, wave 3, 6, and 7 if they had been diagnosed and were currently suffering from a heart condition, cancer, or diabetes. In addition participants were shown a prompt card that listed another 38 medical conditions including arthritis, diabetes, and gallstones, and asked to indicate if they suffered from these as well as list any other conditions they had. The total number of conditions currently suffered was summed to create a continuous aggregate score of number of conditions. At baseline, and waves 3 and 6 participants were asked to nominate, and show the interviewer the container, for prescribed and non-prescribed mediations they were currently taking. Number of medications was summed to reflect a continuous variable of total number of medications for each respondent.

Functional status was assessed using the Activities for Daily Living (ADL) and the Instrumental Activities of Daily Living (IADL) measures [23] at all seven waves. The ADL measures difficulties bathing, dressing, eating, using the toilet, and getting around or away from home. IADL questions include ten activities regarding housework, meal preparation, money management and the use of public transport. Scores are coded (0 = "no difficulty" and 1 = "difficulty") and summed so that higher scores indicated greater functional disability.

Smoking status

Smoking Status measured current and past smoking of cigarettes, pipe or cigars at baseline. The items were coded to reflect (1) current smoker, (2) ex-smoker and (3) never smoked.

Depressive symptoms

Depressive symptoms were measured at baseline and waves 3, 6 and 7, using the Centre for Epidemiology Depression Scale (CES-D), a 20-item questionnaire designed for use in community-based epidemiological studies [24]. A four-point scale was used to assess how an individual felt in the last week, with answers extending from rarely or none of the time (0) to most of the time (3). Summed scores ranged from 0 to 60 with a higher score indicative of more depressive symptoms. The scale had a high level of internal consistency with a Cronbach's alpha coefficient of .85.

Cognitive functioning

Cognitive functioning was measured at baseline and waves 3, 6 and 7, with the Mini-Mental State Examinations scale [MMSE: [25]]. The scale assesses orientation to place and time, attention and calculation, and memory recall [20]. The MMSE has been shown to have satisfactory reliability and construct validity and displays a high degree of sensitivity for moderate to severe cognitive impairment [26].

Statistical analysis

Participants with over 25% observable data missing (n = 354 (16.9%)) were removed from the data set [27] leaving a final sample of 1,733 (52.6% males). This criteria was used as Byrne [27] has shown that model and fit estimates are comparable between a complete data set and one with up to 25% data loss when a full information maximum likelihood imputation method is used. Participants removed from the data set were more likely to be male (χ2 (1) = 15.84, p < .001), older at baseline (t(2085) = 7.95, p < .001), have a greater number of problems with ADL's (t(2085) = 3.34, p = .001), and IADL's (t(2085) = 3.32, p = .001), and be taking more medications (t(2085) = -4.51, p < .001).

Of the final remaining sample 21.3% had < 5% missing data over the seven waves, 14.9% had 6 to 10% missing, 6.8% had 11 to 15% missing, 13.9% had 16 to 20% missing, and 43.1% had >21% missing. As expected, sensitivity analysis revealed that there was a .96 probability (area under the Receiver Operating Characteristic (ROC) curve: Standard Error = .005) that a randomly chosen participant in the final sample who had died over the follow up period would have a greater percentage of missing data compared to a randomly chosen survivor. The missing values for the remaining sample were imputed with the maximum likelihood approach of the Expectation Maximization (EM) algorithm method [28]. The EM method uses all available data and alternates the iterative algorithm between estimating missing values from observed responses and parameter estimates and maximises the likelihood for the subsequent full data [29].

Cox regression models were used to analyse the effect of time-varying predictors and covariates on mortality risk (Singer & Willet, 2003). Number of years from baseline interview until death or censorship was the measure for time used in the models. The Cox Regression is a partial likelihood method of estimation which takes into account the number and rank order of deaths in the sample. Because of a 'conditioning argument' [[30]; p.520] within the partial likelihood method no assumptions are made regarding the shape of the baseline hazard function, therefore only the effect of the predictors and covariates are evaluated. The great advantage to using this Cox Regression approach is that a model can be fitted regardless of the baseline hazard function complexity. Singer and Willet [30] describe the probability of the event (mortality) risk is modeled as:

where the time-invariant predictor or covariate in a model is represented by X 1i and the time-varying predictor or covariate is represented by X 2ij. h(t ij )/h 0(t j ) represents an individual's (i) mortality hazard ratio (HR) at time t j and is therefore a product of the baseline hazard function h 0, and the individuals true risk score at a given time (i.e. the antilog of each raw coefficient - β 1 X 1i + β 2 X 2ij ). To deal with the unbalanced data (i.e. not all variables are observed at every wave - see Table 1) we carried forward the most recent value of each time-varying predictor to the next wave if it was missing at that wave [30]. Singer and Willet argue that this approach is particularly appropriate to account for the shortfall of predictor information for categorical data when there are complex patterns of temporal variation of observations.

SRH items were treated as ordinal variables with the reference category designated as the most positive rating (i.e. 'excellent' for global; and 'better' for age-and self-comparative). Covariates were computed to reflect time-varying values over the observation periods with the exception of the time-invariant covariates (gender, education, income and smoking status). Income and smoking status were included as categorical variables and therefore baseline measures were used for ease of interpretation. For time-invariant covariates the HR represents the effect of a one-unit change in the related predictor on the raw hazard of mortality over the 10 years of follow-up. For the time-varying predictors and covariates, the HR represents the weighted average of short-term mortality risk across the 10 years follow-up (i.e. the mean of the risks from baseline to first measurement period plus second measurement to third measurement period and so on) [31]. For the categorical predictors the interpretation is essentially the same, except that the HR represents the difference in the risk of mortality compared to the reference group.

The addition of gender and age-group interaction terms into the ordinal models resulted in an additional 32 parameters. To ensure models remained parsimonious [32] SRH items were not identified as categorical in the separate interaction models allowing for an interaction effect to be identified. A significant interaction term resulted in adjusted ordinal models run by group to ascertain group differences.


Associations between SRH measures and mortality

Prior to the Cox regression models the relationship between the three SRH measures was investigated. As expected, correlations between the global, age-comparative and self-comparative SRH measures across the seven waves were all significant at p < .05. Correlations between global and age-comparative SRH were moderate, ranging from .271 (p < .000) at wave 7 to .471 (p < .001) at baseline. Similar correlations were found between global and self-comparative SRH, ranging from .278 (p < .001) at baseline to .475 (p < .001) at wave 4. Correlations were smaller between age-comparative and self-comparative SRH, ranging from .138 at wave 7 to .221 at baseline.

Table 2 shows the unadjusted associations between SRH items and mortality as well as the net effects of SRH items on mortality. 'Poor' global SRH increased the unadjusted risk of mortality by 4.71 times, compared to 'excellent' ratings (model 1). 'Worse' age-comparative SRH increased mortality risk by 2 times compared to 'better' ratings (model 2). 'Not as good' self-comparative ratings increased the mortality risk by 1.23 times compared to 'better' ratings (model 3).

Table 2 Hazard Ratios (and 95% Confidence Intervals) Comparing Time-Varying Self-Rated Health (SRH) Models Predicting Mortality (N = 1733)

When global SRH was placed in the same models as age-comparative (model 4) and self-comparative (model 5) SRH the relationship between 'worse' age-comparative and 'not as good' self-comparative ratings and mortality became non-significant. Model 6 revealed that 'worse' age-comparative and 'not as good' self-comparative ratings independently predicted mortality when placed in the same model. In model 7, after accounting for shared variance of all three SRH items, poor global SRH was revealed as the strongest independent predictor of mortality. These results confirm that the global SRH measure accounts for the relationships between the comparative SRH measures and mortality.

Table 3 shows the models adjusted for demographic and health risk factors. In the independent models, after accounting for other mortality risk factors, 'poor', 'fair' or 'good' global SRH ratings and 'worse' age-comparative ratings over time indicate a significant increase in mortality risk for older adults compared to the most positive ratings. In contrast, 'same' self-comparative ratings significantly reduced the mortality risk compared to 'better' over time. In the final full model, all three SRH items are entered to account for overlap between these measures. This model shows that the relationship between the three SRH measures and mortality remains relatively unchanged from the independent models in Table 3. The most notable difference is the reduction in hazard ratio from 3.37 to 2.83 for 'poor' global SRH. This suggests that a poor global rating reflects both age and self comparison processes to some degree.

Table 3 Hazard Ratios and 95% Confidence Intervals for Adjusted Self-Rated Health Models Predicting Mortality; the Australian Longitudinal Study of Ageing, 1992 - 2004 (N = 1733)

Gender and age by SRH item interactions

As described above separate models investigated gender and age interactions with SRH. The gender by SRH interaction term was not significant (HR = 0.99, 95% CI: 0.98, 1.01). However, a significant age by SRH interaction term was found (HR = 1.00, 95% CI: 1.00, 1.001). To investigate the interaction effect separate adjusted models for each age-group were conducted (see Table 4). Age-groups were categorised into young-old (65 to 74 years), old-old (75 to 84 years) and oldest-old (85+ years) at baseline, as defined in the gerontological literature [33].

Table 4 Hazard Ratios and 95% Confidence Intervals for Concurrent SRH Adjusted Age-Group Models Predicting Mortality; the Australian Longitudinal Study of Ageing, 1992 - 2004

The young-old age-group model revealed that "poor" global and "worse" age-comparative ratings significantly predicted mortality. For the old-old adults age-comparative and self-comparative SRH did not independently predict mortality. Similarly, only 'poor' global ratings were found to significantly predict mortality for the oldest-old age-group. Averaging 'same' self-comparative health resulted in a significant reduction in mortality risk for young-old and oldest-old adults compared to 'better' ratings over time.


Our results indicate that the three SRH items do not have comparable relationships with mortality. These results build on previous findings that SRH measures with different reference points are not interchangeable measures of subjective health for older adults [5, 34]. To our knowledge this is the first time the predictive nature of three commonly used SRH items have been compared over time in a dynamic evaluation model. This comparison revealed that, overall, global SRH was the strongest predictor of mortality when taking into account the time-varying nature of the ratings across time, whilst the weakest association was with the self-comparative item.

These findings are contrary to some previous studies that have found that an age-comparative SRH item is a more robust predictor of mortality than a global item for males in a similar age-group [5], or that the three SRH items had similar predictive qualities (55 to 85 year old sample) [8]. However for most previous studies the SRH-mortality associations have been modelled separately for gender [e.g. [58]]. The contrasting methodology that was used in the current study revealed non-significant interaction terms for gender in the models, indicating that the relationship between the different SRH measures and mortality was not significantly different for males and females,. Hence separate models for men and women were not justified.

Furthermore, our findings expand the literature because the comparison of the three SRH items was investigated through a dynamic model of SRH and mortality, using time-varying SRH ratings to predict mortality rather than ratings at a single point in time. By modelling the mortality hazard using time-varying predictors and covariates we assessed the cumulative short-term effects of SRH ratings on mortality over a 10 year follow-up period. While well-suited to the global SRH data, the findings indicate that this dynamic evaluation model may not be as suitable for investigating comparative SRH items' association with mortality. In support of this premise additional analysis (not shown here) compared the mortality risk for single measurement SRH ratings (baseline and most recent observation prior to death). These results revealed that the most recent observation of poor self-comparative SRH (rating health as "not as good" as 12 months prior) held a similar association with mortality risk (HR = 1.67; 95% CI = 1.38-2.01) as the most recent 'poor' global SRH rating (HR = 1.66; 95% CI = 1.18-2.34) after adjusting for all SRH measures and other mortality risk factors. In contrast, rating health as worse than others their own age did not significantly predict mortality in the long term (baseline model - HR = 1.14; 95% CI = 0.89-1.46), or on a shorter follow-up period (most-recent observation model - HR = 1.03; 95% CI = 0.83-1.28). These results tentatively suggest that age-comparative SRH is not a good indicator of mortality risk for older adults, whereas considering health to have declined in the past 12 months (self-comparative SRH) may be as good a short term indicator of mortality as poor global SRH.

The difference in predictive quality of these SRH measures is most likely due to the fundamental nature of anchoring the health evaluation to a particular reference point, such as peers or own past health. For example, the age-comparative item may enhance health assessments due to a self-protective 'social downgrading' process [35], whereas the forced temporal aspect of the self-comparative item elicits more negative ratings as it makes recent negative changes in health more salient [36]. Etiologically speaking, self-perceived temporal decline in health could be argued to be a good indicator of imminent mortality risk, as the further analysis above has demonstrated. However, with advanced health decline older adults may perceive that their health cannot get any worse, thus they may begin to rate their health as 'the same' as previous, placing an inherent limitation to the self-comparative item for providing unique mortality information over time. This limitation of the self-comparative SRH measure may also explain our seemingly counterintuitive findings that 'same' self-comparative ratings are protective of mortality risk compared to 'better' ratings. For example, an individual who rates their health as 'better' than the previous year could be reflecting on their experience of recent health issues that may subsequently increase mortality risk. The contrast between the proportion of the current sample who rated their health as "better than others their own age" and "not as good as twelve months ago", along with the small to medium correlations found between the SRH measures, supports the notion that the reference point invokes specific comparison processes which can bias health assessments [14, 37], making them less predictive of mortality over time.

Idler and Benyamini [14] and Jylhä [38] argue that the robust SRH-mortality relationship found in global SRH is most likely due to complex, dynamic human judgements that include contextual evaluation frameworks where past and current health is considered along with future health expectations. In the few studies that have compared the determinants of global and comparative SRH items, the global measure has been found to be the most inclusive measure of subjective health in terms of its associations with other factors of health [37, 39]. For example, Eriksson et al. [39] found that physical, functional and mental health, health behaviours, and psychosocial factors (such as social support), held significantly stronger associations with global SRH compared to an age-comparative measure. Similarly, the global measure has previously been shown to be the most comprehensive SRH item for the ALSA sample used here [40]. Taken together, these findings suggest that the strong association observed between global SRH and mortality is due to the global measure reflecting an all-encompassing evaluation of health compared to the other SRH items.

In the current study the utility of the SRH items to predict mortality was dependent on age. In particular, the two comparative measures did not provide unique information of mortality risk in adults over 75 years of age. It has been argued that the age-comparative item is not appropriate to use in older populations, or samples with a large age range, due to its sensitivity to age [6, 34, 38]. The current findings support this notion of age-sensitivity and extend it to the self-comparative item. Further research is needed to clarify whether these comparison effects extend to other age groups or are merely cohort effects.

Whilst the focus of the current study was on the impact of the varying SRH measures on mortality, and the majority of the covariate relationships with mortality are as expected, there are findings in the models that are note worthy. For example, we found a significant protective effect for number of medical conditions. We suggest that the counterintuitive relationship between number of medical conditions and mortality found here are a product of the large number of non-life threatening conditions that were included, such as cataracts, gout, hernia, ingrown toenails, and migraines. The conditions included in the aggregate variable were not weighted here for life-threat, as previous research has suggested that this does not necessarily improve model fit [41], however the combination of stage of disease and comorbidity was found by these authors to increase mortality risk. Together with the current findings it is suggested that future research is needed to ascertain the best way to measure and weight comorbidity in relation to mortality risk.

The major strength of our study is that the large amount of longitudinal data (up to seven waves spanning 12 years) allowed for comprehensive health models to be tested in a time-varying approach. Furthermore, the mortality relationship of each SRH item was established by directly comparing the items whilst accounting for shared variance with other SRH items and health covariates. A limitation of the data set is the unbalanced data collection (i.e. not all measures were observed at each wave) and the different modes of data collection (face-to-face versus telephone interview). Whilst, previous research has supported the reliability of telephone interviewing and the strong correlation between this mode with face-to-face interviewing in established samples [e.g. [42, 43]], the unbalanced data may require care in the interpretation of results. Singer and Willet [30] argue that the method of imputing the time-varying predictors when they are not observed, by carrying forward the most recent value (as was used here, see Statistical Analysis section), is most likely to result in a conservative estimate. It should also be noted that the small number of oldest-old adults at baseline remaining in the wave 6 and 7 samples could limit our findings, as small sample sizes may result in reduction of statistical power to detect significant effects [29]. Baseline selection effects of this age group must also be taken into account [44], along with possible sample attrition due to causes other than mortality. For example, the significant differences in characteristics of the excluded participants (due to missing data, see Statistical Analysis section above) suggest a possible bias as the oldest-old, males, and those with increased functional difficulties and number of medications were less likely to be included in the final sample. However, the finding that 'poor' global SRH predicted mortality for the oldest-old age group in the adjusted models, as did being male and having increased ADL's, suggests our results are more likely to be an underestimation of effects. Therefore the relationship between time-varying SRH and mortality risk may in fact be stronger than is indicated here.


In conclusion, global, age-comparative and self-comparative SRH items embody unique, age-sensitive, associations with mortality over time. Researchers should exercise caution when pooling or harmonising SRH items as they are not comparable measures of health. Future research investigating the potential for time-varying, and even change in, SRH measures to predict other major health outcomes, such as functional disability and health care utilisation, may extend the application of SRH items for indicators of population health.

The age sensitivity of the comparative SRH measures suggests they should be used with caution in older adult populations, particularly if used for predicting mortality. Furthermore, the usefulness of tracking age- and self-comparative measures to predict mortality is limited by the anchoring of the evaluation to the reference point. In contrast, 'poor' global is a robust predictor of mortality across age groups over time, indicating that this is the most reliable measure of self-perceived health for older adults.


  1. Johnson RJ, Wolinsky FD: The structure of health status among older adults: disease, functional limitation, and perceived health. J Health Soc Behav. 1993, 34: 105-121. 10.2307/2137238.

    Article  CAS  PubMed  Google Scholar 

  2. World Health Organisation, Statistics Netherland: Health interview surveys: Towards international harmonization of methods and instruments. 1996, Copenhagen: WHO Regional Office for Europe (WHO Regional Publications, European Series, no 58)

    Google Scholar 

  3. Robine J-M, Jagger C, Romieu I: Selection of a coherent set of health indicators for the European Union. Phase II: final report. 2002, Montpellier, France: Euro-REVES

    Google Scholar 

  4. Robine J-M, Jagger C, Euro-REVES 2 Group: Creating a coherent set of indicators to monitor health across Europe. Eur J Public Health. 2003, 13: 6-14. 10.1093/eurpub/13.suppl_1.6.

    Article  PubMed  Google Scholar 

  5. Manderbacka K, Kareholt I, Martikainen P, Lundberg O: The effect of point of reference on the association between self-rated health and mortality. Soc Sci Med. 2003, 56: 1447-1452. 10.1016/S0277-9536(02)00141-7.

    Article  PubMed  Google Scholar 

  6. Vuorisalmi M, Lintonen T, Jylha M: Global self-rated health data from a longitudinal study predicted mortality better than comparative self-rated health in old age. J Clin Epidemiol. 2005, 58: 680-687. 10.1016/j.jclinepi.2004.11.025.

    Article  PubMed  Google Scholar 

  7. Bath PA: Differences between older men and women in the self-rated health-mortality relationship. Gerontologist. 2003, 43: 387-

    Article  PubMed  Google Scholar 

  8. Deeg DJH, Kriegsman DMW: Concepts of self-rated health: Specifying the gender difference in mortality risk. Gerontologist. 2003, 43: 376-

    Article  PubMed  Google Scholar 

  9. Lyyra T-M, Leskinen E, Jylhä M, Heikkinen E: Self-rated health and mortality in older men and women. A time-dependent covariate analysis. Arch Gerontol Geriatr. 2009, 48: 14-18. 10.1016/j.archger.2007.09.004.

    Article  PubMed  Google Scholar 

  10. Franks P, Gold MR, Fiscella K: Sociodemographics, self-rated health, and mortality in the US. Soc Sci Med. 2003, 56: 2505-2514. 10.1016/S0277-9536(02)00281-2.

    Article  PubMed  Google Scholar 

  11. Harris JR, Pedersen NL, Stacey C, McClearn GE, Nesselroade JR: Age differences in the etiology of the relationship between life satisfaction and self-rated health. J Aging Health. 1992, 4: 349-368. 10.1177/089826439200400302.

    Article  Google Scholar 

  12. Svedberg P, Lichtenstein P, Pedersen NL: Age and sex differences in genetic and environmental factors for self-rated health: A twin study. J Gerontol. 2001, 56B: S171-

    Article  Google Scholar 

  13. McCullough ME, Laurenceau J-P: Gender and the natural history of self-rated health: A 59-year longitudinal study. Health Psychol. 2004, 23: 651-655. 10.1037/0278-6133.23.6.651.

    Article  PubMed  Google Scholar 

  14. Idler EL, Benyamini Y: Self-rated health and mortality: A review of twenty-seven community studies. J Health Soc Behav. 1997, 38: 21-10.2307/2955359.

    Article  CAS  PubMed  Google Scholar 

  15. Han B, Phillips C, Ferrucci L, Bandeen-Roche K, et al: Change in self-rated health and mortality among community-dwelling disabled older women. Gerontologist. 2005, 45: 216-

    Article  PubMed  Google Scholar 

  16. Strawbridge WJ, Wallhagen MI: Self-rated health and mortality over three decades: Results from a time-dependent covariate analysis. Res Aging. 1999, 21: 402-416. 10.1177/0164027599213003.

    Article  Google Scholar 

  17. Ferraro KF, Kelley-Moore JA: Self-rated health and mortality among black and white adults: Examining the dynamic evaluation thesis. J Gerontol. 2001, 56B: S195-

    Article  Google Scholar 

  18. Svardsudd K, Tibblin G: Is quality of life affecting survival? The study of men born in 1913. Scand J Prim Health Care. 1990, 1: 55-60.

    CAS  Google Scholar 

  19. Dening TR, Chi L-Y, Brayne C, Huppert FA, Paykel ES, O'Connor DW: Changes in self-rated health, disability and contact with services in a very elderly cohort: A 6-year follow-up study. Age Ageing. 1998, 27: 23-10.1093/ageing/27.1.23.

    Article  CAS  PubMed  Google Scholar 

  20. Andrews GR, Clark M, Luszcz MA: Successful aging in the Australian Longitudinal Study of Aging: applying the MacArthur model cross-nationally. J Soc Issues. 2002, 58: 749-765. 10.1111/1540-4560.00288.

    Article  Google Scholar 

  21. Giles LC, Metcalf PA, Glonek GF, Luszcz M, Andrews GR: The effects of social networks on disability in older Australians. J Aging Health. 2004, 16: 517-538. 10.1177/0898264304265778.

    Article  PubMed  Google Scholar 

  22. Giles LC, Glonek GF, Luszcz M, Andrews GR: Effect of social networks on 10 year survival in very old Australians: The Australian longitudinal study of aging. J Epidemiol Community Health. 2005, 59: 574-579. 10.1136/jech.2004.025429.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Fillenbaum GG: Multidimensional functional assessment for older adults: The Duke Older Americans Resources and Services procedures. 1988, Hillsdale, New Jersey: L. Erlbaum Associates

    Google Scholar 

  24. Radloff L: The CES-D scale: A self-report depression scale for research in the general population. J App Psychol Aging. 1977, 3: 385-401.

    Google Scholar 

  25. Folstein MF, Folstein SE, McHugh PR: "Mini-mental state": A practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res. 1975, 12: 189-198. 10.1016/0022-3956(75)90026-6.

    Article  CAS  PubMed  Google Scholar 

  26. Tombaugh TN, McIntyre NJ: The mini-mental state examination: A comprehensive review. J Am Geriatr Soc. 1992, 40: 922-935.

    Article  CAS  PubMed  Google Scholar 

  27. Byrne BM: Structural equation modeling with AMOS: Basic concepts, applications, and programming. 2001, Mahwah, New Jersey: Lawrence Erlbaum Associates

    Google Scholar 

  28. Schafer JL, Graham JW: Missing Data: Our View of the State of the Art. Psychol Methods. 2002, 7: 147-177. 10.1037/1082-989X.7.2.147.

    Article  PubMed  Google Scholar 

  29. Fitzmaurice GM, Laird NM, Ware JH: Applied Longitudinal Analysis. 2004, Hoboken, New Jersey: John Wiley & Sons Inc

    Google Scholar 

  30. Singer JD, Willett JB: Applied Longitudinal Data Analysis: Modeling Change and Event Occurrence. 2003, New York: Oxford University Press

    Book  Google Scholar 

  31. Dekker FW, de Mutsert R, van Diijk PC, Zoccali C, Jager K: Survival analysis: time-dependent effects and time-varying risk factors. Kidney Int. 2007, 74: 994-997. 10.1038/ki.2008.328.

    Article  Google Scholar 

  32. Cohen J, Cohen P, West SG, Aiken LS: Applied Multiple Regression/Correlation Analysis for the Behavioural Sciences. 2003, Mahwah, New Jersey: Lawrence Erlbaum Associates Inc, 3

    Google Scholar 

  33. Neugarten B: Age groups in American society and the rise of the young-old. Ann Am Acad Pol Soc Sci. 1976, 415: 187-198. 10.1177/000271627441500114.

    Article  Google Scholar 

  34. Vuorisalmi M, Lintonen T, Jylha M: Comparative vs global self-rated health: Associations with age and functional ability. Aging Clin Exp Res. 2006, 18: 211-

    Article  PubMed  Google Scholar 

  35. Suls J, Marco CA, Tobin S: The role of temporal comparison, social comparison, and direct appraisal in the elderly's self-evaluations of health. J App Soc Psychol. 1991, 21: 1125-1144. 10.1111/j.1559-1816.1991.tb00462.x.

    Article  Google Scholar 

  36. Klauer T, Ferring D, Filipp S: "Still stable after all this...?" Temporal comparison in coping with severe and chronic disease. Int J Behav Dev. 1998, 22: 339-355. 10.1080/016502598384405.

    Article  Google Scholar 

  37. Manderbacka K, Lundberg O: Examining points of reference of self-rated health among Swedish oldest old. Arch Gerontol Geriatr. 1996, 23: 47-60. 10.1016/0167-4943(96)00707-8.

    Article  CAS  PubMed  Google Scholar 

  38. Jylhä M: What is self-rated health and why does it predict mortality? Towards a unified conceptual model. Social Science & Medicine. 2009, 69: 307-316.

    Article  Google Scholar 

  39. Eriksson I, Unden A-L, Elofsson S: Self-rated health. Comparisons between three different measures. Results from a population study. Int J Epidemiol. 2001, 30: 326-333. 10.1093/ije/30.2.326.

    Article  CAS  PubMed  Google Scholar 

  40. Sargent-Cox KA, Anstey KJ, Luszcz M: Determinants of self-rated health items with different points of reference: Implications for health measurement of older adults. J Aging Health. 2008, 20: 739-761. 10.1177/0898264308321035.

    Article  PubMed  Google Scholar 

  41. Yancik R, Wesley MN, Ries LA, Havlik RJ, Long S, Edwards BK, Yatels JW: Comorbidity and age as predictors of risk for early mortality of male and female colon carcinoma. Cancer. 1998, 82: 2123-2134. 10.1002/(SICI)1097-0142(19980601)82:11<2123::AID-CNCR6>3.0.CO;2-W.

    Article  CAS  PubMed  Google Scholar 

  42. Fenig S, Levav I, Kohn R, Yelin N: Telephone vs face-to-face interviewing in a community psychiatric survey. Am J Public Health. 1993, 83: 896-898. 10.2105/AJPH.83.6.896.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Rohde P, Lewinsohn P, Seeley M: Comparability of telephone and face-to-face interviews in assessing axis I and II disorders. Am J Psychiatry. 1997, 154: 1593-1598.

    Article  CAS  PubMed  Google Scholar 

  44. Hofer SM, Sliwinski MJ: Design and analysis of longitudinal studies of aging. Handbook of the Psychology of Aging. Edited by: Birren JE, Schaie KW. 2006, San Diego: Academic Press, 15-37. full_text.

    Chapter  Google Scholar 

Pre-publication history

Download references

Author information

Authors and Affiliations


Corresponding author

Correspondence to Kerry A Sargent-Cox.

Additional information

Competing interests

This study was funded by the South Australian Health Commission, the Australian Rotary Health Research Fund, the US National Institute of Health (Grant No. AG 08523-02) and the National Health and Medical Research Council (NHMRC; Grant No.229936). KJA is supported by NHMRC Fellowship No.366756.

Authors' contributions

Author KSC designed the current study, conducted the literature review, data analyses and interpretation of the data, and drafted the article. Author's KJA and MAL contributed to the draft and supervised the study's analytical strategy. All authors have read and approved the final manuscript.

Kerry A Sargent-Cox, Kaarin J Anstey and Mary A Luszcz contributed equally to this work.

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Sargent-Cox, K.A., Anstey, K.J. & Luszcz, M.A. The choice of self-rated health measures matter when predicting mortality: evidence from 10 years follow-up of the Australian longitudinal study of ageing. BMC Geriatr 10, 18 (2010).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: