- Research article
- Open Access
- Open Peer Review
Psychometric properties of four fear of falling rating scales in people with Parkinson’s disease
BMC Geriatrics volume 14, Article number: 66 (2014)
Fear of falling (FOF) is commonly experienced in people with Parkinson’s disease (PD). It is a predictor of recurrent falls, a barrier to physical exercise, and negatively associated with health-related quality of life. A variety of rating scales exist that assess different aspects of FOF but comprehensive head-to-head comparisons of their psychometric properties in people with PD are lacking. The aim of this study was to evaluate the psychometric properties of four FOF rating scales in people with PD. More specifically, we investigated and compared the scales’ data completeness, scaling assumptions, targeting, and reliability.
The FOF rating scales were: the Falls Efficacy Scale-International (FES-I), the Swedish FES (FES(S)), the Activities-specific Balance Confidence scale (ABC), and the modified Survey of Activities and Fear of Falling in the Elderly (mSAFFE). A postal survey was administered to 174 persons with PD. Responders received a second survey after two weeks.
The mean (SD) age and PD duration of the 102 responders were 73 (8) and 7 (6) years, respectively. ABC had worse data completeness than the other scales (6.9 vs. 0.9–1.3% missing data). All scales had corrected item-total correlations exceeding 0.4 and showed acceptable reliabilities (Cronbach’s alpha and Intraclass Correlation Coefficient (ICC) >0.80) but only FES-I had ICC >0.90. The standard error of measurements ranged from 7% (FES-I) to 12% (FES(S)), and the smallest detectable differences ranged from 20% (FES-I) to 33% (FES(S)) of the total score ranges. ABC and FES(S) had substantially more outliers than mSAFFE and FES-I (10 and 15 vs. 3 and 4, respectively) when the two test occasions were compared.
When assessing FOF in people with PD, the findings in the present study favoured the choice of FES-I or mSAFFE. However, FES-I was the only scale with ICC >0.90 which has been suggested as a minimum when using a scale for individual comparisons.
Parkinson’s disease (PD) is a common neurodegenerative disorder that affects balance and people with PD fall more often than age-matched healthy controls [1, 2]. Fear of falling (FOF) is commonly experienced [3, 4] and is a predictor of recurrent falls , a barrier to physical exercise , and is negatively associated with health-related quality of life . It is therefore important to detect and follow the progress of FOF in people with PD, and FOF should be considered a crucial endpoint for interventions [4, 7]. High quality rating scales assessing FOF are important in both clinical practice and research. When choosing a rating scale, one has to consider which aspects the scale should cover as well as its psychometric properties (e.g., data completeness, scaling assumptions, targeting, and reliability). Increased knowledge of the psychometric properties of FOF rating scales will facilitate the interpretation of data obtained from the scales.
A variety of rating scales exist that are said to assess different aspects of FOF . The Falls Efficacy Scale-International (FES-I) assesses concerns about falling and is recommended by the Prevention of Falls Network Europe (ProFaNE) . FES-I was developed by combining and modifying items from three other scales: the original FES that assesses fall-related self-efficacy , the Activities-specific Balance Confidence scale (ABC) that assesses balance confidence , and the Survey of Activities and Fear of Falling in the Elderly (SAFE) that assesses both activity level, FOF and activity restriction . SAFE has later been modified into a shorter version (mSAFFE) that taps activity avoidance due to the risk of falling .
In a recent study, we compared the content validity of FES-I, the Swedish FES (FES(S)), ABC and mSAFFE by linking them to the International Classification of Functioning, Disability and Health . The linking process showed that all four scales mainly focus on FOF in relation to mobility. The ABC almost exclusively focuses on mobility, whereas the other rating scales cover a more diverse set of activities, such as self-care (FES-I, FES(S) and mSAFFE) and activities concerning community, social, and civic life (FES-I and mSAFFE) .
As psychometric properties, such as validity and reliability, are sample dependent , specific studies are needed to determine the psychometric properties of FOF rating scales in PD. One previous Swedish study has assessed the psychometric properties of FES(S) and mSAFFE in PD with satisfying results . Four studies have assessed the psychometric properties of ABC in PD [16–19]. However, three of the ABC studies have limited PD samples (n = 19 to 37) [16–18] and three are based on a limited set of psychometric analyses [16, 18, 19]. To our knowledge, no study has assessed the psychometric properties of FES-I in people with PD. Thus, a comprehensive head-to-head comparison of psychometric properties of FOF rating scales in people with PD is warranted and will facilitate choosing a FOF rating scale for clinical practice and research in PD.
The aim of this study was to evaluate the psychometric properties of FES-I, FES(S), ABC and mSAFFE in people with PD. More specifically, we investigated and compared the scales’ data completeness, scaling assumptions, targeting, and reliability.
This postal survey study was sent to 174 persons with PD. It included socio-demographic and disease-related questions, as well as four FOF rating scales which were administered twice (hereafter referred to as t1 and t2), two weeks apart.
Participants and sample size
Participants were recruited from two outpatient hospital clinics in southern Sweden and included individuals with a clinically confirmed PD diagnosis (ICD-10: G 20.9) since at least one year. Exclusion criteria were difficulties reading and writing Swedish, clinically confirmed Alzheimer’s disease, dementia, or cognitive or medical problems of a severity that were assumed to restrict giving informed consent or participating in the study. Moreover, individuals who were completely bedridden or wheelchair bound were excluded since most items in the FOF rating scales refer to walking ability. A PD specialized nurse at each of the outpatients clinics and one of the authors (SBJ) screened the medical records of all PD patients that had visited the two clinics during the past 14 months (n = 275). Fifty-nine persons (39% female) were excluded based on the exclusion criteria. Their mean (SD) age and PD duration were 76 (8) and 10 (6) years, respectively. In addition, 42 persons did not meet the inclusion criterion of a PD diagnosis of at least one year. A total of 174 possible participants remained, which was considered the final sample.
To reach a ‘good sample size’ according to recommendations for methodological quality and test-retest reliability analysis , we aimed at 50 to 99 participants with FOF total scores at both t1 and t2. Based on previous postal surveys in people with PD [21, 22], we anticipated a response rate of approximately 65% at t1. Some additional drop outs were expected at t2, as well as some internal missing responses on the FOF rating scales.
All participants gave their written informed consent. The study was conducted in accordance with the Helsinki Declaration and was approved by the Regional Ethics Review Board in Lund, Sweden (Dnr 2013/118).
All 174 possible participants were mailed the following: information about the study, an informed consent form, socio-demographic and disease-related questions, the four FOF rating scales (FES-I, FES(S), ABC, and mSAFFE), and a pre-stamped return envelope. A reminder was sent after two weeks to non-responders. Responders received a second survey after about two weeks, and a reminder was sent one week later to non-responders.
The internal order of the FOF rating scales was altered to minimize the risk that the ordering affected data completeness. Four different arrangements were used so that the scales appeared an equal number of times as the first, second, third, and fourth scale. Although the order of scales was altered, the original order of items within the scales remained unchanged.
Socio-demographic and disease-related questions
Current mobility when completing the survey at t1 and t2 was self-rated as: good (i.e., parkinsonian “on” state), good but hyperkinetic, or bad (i.e., parkinsonian “off” state). The survey at t1 included demographic questions (e.g., PD duration and living arrangements), as well as single-item questions targeting self-rated PD severity (response options: mild, moderate, or severe), self-rated general health (scored 1–5; higher = worse, inspired by the general health question in the Short Form–36) , activities of daily living (Parkinson’s disease Activities of Daily Living Scale; PADLS) , and freezing of gait (item 3 of the self-administered version  of the Freezing of Gait Questionnaire; FOGQsa) . Both PADLS and FOGQsa have been shown to be valid and reliable in people with PD [24, 25]. An open-ended question targeted the presence of diseases or health-related problems other than their PD. Dichotomous questions (Yes/No) targeted the following: dyskinesia; fluctuations with periods of increasing PD symptoms; FOF; activity avoidance due to the risk of falling; unsteadiness while walking; unsteadiness during turning in walking/standing; use of walking aid or personal support while walking indoors and outdoors, respectively; previous falls and/or near falls during the past six months. A fall was defined as “an event in which the respondent came to rest on the ground, floor, or lower level” (definition adopted from ProFaNE) . A near fall was defined as “a fall initiated but arrested by support from a wall, railing, or other person, etc.” . Finally, participants were asked whether they had responded to the survey themselves (with or without assistance in reading and/or writing).
The four FOF rating scales
The FES-I assesses concerns about falling . Respondents answer how concerned they are about the possibility of falling in relation to 16 different activities. Response categories are: not at all, somewhat, fairly, or very concerned (scored 1 to 4, respectively). The total score ranges from 16 to 64 (higher = worse) . The Swedish translated FES-I was used in this study .
The FES(S) assesses fall-related self-efficacy . Respondents answer how confident they are in performing 13 different activities without falling. Response categories range from 0 (not confident at all) to 10 (completely confident). The total score ranges from 0 to 130 (higher = better) .
The ABC assesses balance confidence . Respondents answer how confident they are that they would not lose their balance or become unsteady when performing 16 different activities . In this study, a Swedish translated and culturally adapted version of the ABC was used. The cultural adaptation implies that items related to stepping onto or off escalators are changed to traveling by bus (L. Lundin-Olsson, unpublished material, written personal communication, June 20, 2012). Response categories range from 0 (no confidence) to 10 (completely confident). The total score is the mean value of the 16 items, transformed into percentage, i.e., ranges from 0 to 100% (higher = better).
The mSAFFE assesses activity avoidance due to the risk of falling in relation to 17 different activities . Response categories are: never, sometimes, or always avoid (scored 1 to 3, respectively). The total score ranges from 17 to 51 (higher = worse) . The Swedish translated mSAFFE was used in this study (L. Lundin-Olsson, unpublished material, written personal communication, June 20, 2012).
The analyses were performed using the IBM SPSS Statistics 21.0 software and were based on four parts: i) data completeness, ii) scaling assumptions, iii) targeting, and iv) reliability. Data completeness and reliability (except Cronbach’s alpha) were based on data from both t1 and t2. Scaling assumptions, targeting and Cronbach’s alpha were based on t1 data only. The relationships between the rating scales were determined by calculating the Pearson’s correlation coefficients (r) between the scales, based on t1 data.
Data completeness of the four rating scales was determined by calculating the percentage of missing data for items and total scores [15, 31]. No imputation was done, i.e., a total score required absence of any missing item responses.
Scaling assumptions were explored to examine the legitimacy of summing item scores to generate total scale scores, according to a series of criteria [15, 31]. That is, mean scores, SDs, and distribution of item response option frequency should be roughly parallel across items. Also, corrected item-total correlations should exceed 0.4, indicating that items measure the same underlying construct and contain a similar proportion of information concerning FOF [15, 31].
Targeting refers to whether the rating scales’ score distributions can adequately represent the true level of FOF in the sample . This was evaluated by studying the rating scales’ score distribution, skewness, and floor and ceiling effects. Mean total scores should be close to the scales’ midpoint, total scores range the full span, skewness less than ±1 [15, 32], and floor and ceiling effects (the percentage respondents receiving the minimum and maximum possible scores, respectively) should not exceed 15–20% [15, 33].
Reliability is a measure of the random error associated with scale scores and the reproducibility of scores . This was assessed in several ways. The internal consistency was examined by means of Cronbach’s alpha . The test-retest reliability was studied in terms of one-way random, single measures Intraclass Correlation Coefficient (ICC1,1) with absolute agreement definition of concordance . Cronbach’s alpha and ICC >0.75 or >0.80 are considered acceptable for group level [35, 36], while ICC >0.90 has been suggested as a minimum when using scales for individual comparisons [36, 37]. The standard error of measurement (SEM) was calculated using the formula . The smallest detectable difference (SDD) was calculated using the formula . Due to differences in scoring ranges between the scales, SEM and SDD values were also expressed as percentages of the possible scoring ranges, to facilitate comparisons.
The mean difference (đ) in scale scores between t1 and t2 and the 95% CI around đ were calculated. If the 95% CI includes 0, there are no systematic differences between t1 and t2 . The number of outliers for each rating scale was calculated (an outliers was defined as a participant with differences between t1 and t2 outside the first or third quartile ± 1.5 × interquartile range) . Finally, test-retest data were plotted and visually inspected in the form of Bland-Altman graphs (the individual differences between t1 and t2 were plotted against the individual mean of t1 and t2) . Since these graphs did not contribute any additional information than the numerical analyses, they are not presented here.
Of the 174 possible participants, 63 persons did not respond and 6 explicitly expressed that they did not want to participate; they (n = 69; 54% women) had a mean (SD) age of 77 (9) years. One hundred and five persons returned the first postal survey, but three surveys were not answered by the person with PD and were therefore excluded. This resulted in 102 included participants and a conservative response rate of 59%. Ninety-seven persons responded to the second survey (t2). Basic demographic data and participants characteristics are presented in Table 1. A majority (n = 60) of the participants stated that they had one or more disease or health-related problem, apart from their PD. The most common problems were cardiovascular (n = 22) and musculoskeletal (n = 22). Current mobility at t1 was rated as good (i.e., parkinsonian “on” state) by 48 participants, good but hyperkinetic by 17, and bad (i.e., parkinsonian “off” state) by 35 participants (2 missing responses). Corresponding mobility ratings at t2 were: good (“on”) 52 participants, good but hyperkinetic 15, and bad (“off”) 23 participants (7 missing responses).
Relationship between the scales
The correlations (r) between the four FOF rating scales ranged from 0.80 to 0.93 (P < 0.001); the weakest correlation was found between mSAFFE and ABC and the strongest between mSAFFE and FES-I.
One of the 102 participants left FES-I completely blank and another person left both FES(S) and ABC blank. Four additional persons misunderstood the ABC: three persons responded by writing “X” instead of specifying a digit after the items, and the fourth person supplied double digits on each item, resulting in uninterpretable responses. The number of participants that obtained a total score was: ABC, n = 82; mSAFFE, n = 86; FES(S), n = 90; and FES-I, n = 92. The overall mean of missing responses were: FES-I, 0.9%; FES(S), 1.0%; mSAFFE, 1.3%; and ABC, 6.9% (those that left the scales completely blank are not included in these numbers). The number of participants that obtained a total score at t2 was: ABC, n = 79; FES(S), n = 85; mSAFFE, n = 86; and FES-I, n = 90.
Item means and SDs, respectively, were roughly parallel for most items in each of the FOF scales. Some items of FES-I, ABC and mSAFFE had a larger proportion of participants that chose the worse response options, resulting in worse mean scores (i.e., more difficult items). These were: FES-I items 11 (Walk on slippery surface), 14 (Walk on uneven surface) and 15 (Walk up/down a slope), ABC items 6 (Stand on chair to reach) and 16 (Walk on icy sidewalks), and mSAFFE item 8 (Go out when it is slippery) (Tables 2, 3, 4, 5). A larger proportion of responders chose the best response option for FES-I item 3 (Preparing simple meals) and mSAFFE items 4 (Go to the doctor/dentist), 6 (Take a shower) and 12 (Walk around indoors) (data available on request). All four scales had corrected item-total correlations exceeding 0.4 (Tables 2, 3, 4, 5).
All four scales spanned almost the full range of possible scale scores and the scales’ mean scores were close to the scales’ midpoints (i.e., FES-I, 40; FES(S), 65; ABC, 50; mSAFFE, 34). Skewness was < ±1, and floor and ceiling effects were <20% for all four scales (Tables 2, 3, 4, 5).
The mean time between responses to the first and the second survey was 16.7 (SD 3.8, min-max 13–38) days. Reliability coefficients, SEM and SDD values for the four FOF scales are presented in Table 6. All scales had Cronbach’s alpha >0.90 and ICC >0.80, and one (FES-I) had ICC >0.90. The đ was close to 0 with CI including 0 for all four scales. There were 3 outliers in mSAFFE, 4 in FES-I, 10 in ABC, and 15 in FES(S).
This is the first comprehensive comparison of the psychometric properties of four commonly used FOF scales in people with PD. Our main findings were: ABC had markedly worse data completeness than the other scales, all scales showed acceptable reliability (Cronbach’s alpha and ICC >0.80) but only FES-I had ICC >0.90, and FES(S) and ABC had substantially more outliers than mSAFFE and FES-I when comparing t1 and t2.
Our sample consisted of more males than females, which is in agreement with prevalence studies of PD . The mean age and PD duration were 73 and 7 years, respectively, which correspond well with a previously reported mean age at symptom onset of 62 to 70 years . Our sample contained fewer fallers than previous studies [3, 44], whereas the prevalence of FOF in our sample (55%) was within a previously reported range (38–59%) [3, 4]. Self-reported PD severity ranged from mild to severe. The present sample thus seems fairly representative, although it needs to be noted that those with severe cognitive or medical problems were excluded.
Relationship between the scales
The four FOF rating scales correlated ≥0.80, which is not surprising since the content is similar . However, the scales are said to assess different aspects of FOF, i.e., concerns about falling, fall-related self-efficacy, balance confidence and activity avoidance due to the risk of falling [9, 11, 13, 30]. Previous studies have stated that these constructs are not interchangeable and that scale selection should be based on the specific construct of interest [3, 8]. Thus, more studies are needed to confirm the relationships between the different FOF scales.
ABC had a substantially higher proportion of missing data than the other scales (6.9 vs. 0.9–1.3%). Four persons completely misunderstood the ABC, implying that the instructions need to be clarified. It should, however, be noted that the Swedish version of the ABC was used in this study and the instructions might be perceived as more clear in the original ABC. To our knowledge, no previous study has presented data completeness for ABC in people with PD or other samples.
The percentage of missing data was highest (12.9%) for ABC items 14 and 15. These are the items that are culturally adapted in the Swedish version (changed from stepping on/off escalators into traveling by bus, L. Lundin-Olsson, written personal communication, June 20, 2012). The high number of missing data suggests that these items are difficult to understand or irrelevant to the participants . In fact, three participants had written supplementary comments, stating that they did not travel by bus. An additional 19 participants stated that they always avoided traveling by public transport according to item 15 of mSAFFE. While the original ABC includes instructions on how to respond to activities that the respondent does not engage in, the Swedish translated ABC does not. This might explain the high number of missing responses in these items. However, even if these two items are removed, missing data remains higher for ABC than for the other scales (6.0% vs. 0.9–1.3%).
While FES(S) items were roughly parallel, this was not the case for the other three scales. These findings are not unexpected since the various items within the scales are of different difficulty level. Moreover, items that were rated as more difficult by our sample have been rated as more difficult in previous PD studies as well as in older healthy populations [3, 9, 11, 17]. One could argue that some variation in item difficulty levels is preferable, since this result in a scale that is able to assess FOF in individuals with both low and high levels of FOF. Although classic test theory states that items within a scale should be “roughly parallel” to allow for a summed total score [15, 31], no guidelines exist that describe how rigid this judgement should be. Previous studies using the FOF scales studied here have, in fact, all used regular total scores [3, 9, 11, 13, 17, 18, 45].
All four scales seem fairly well targeted and met the criterion of floor and ceiling effects below 20% . FES(S) had 17.8% ceiling effect, which is higher than the other scales (4.9–10.5% floor/ceiling effect). A previous PD study found a lower ceiling effect of FES(S) (10.1–10.6%), but a higher floor effect of mSAFFE (18.3–19.4% vs. 10.5% in our study) . No previous study has presented data on floor and ceiling effects on FES-I and ABC in people with PD.
All four scales had high internal consistency (Cronbach’s alpha >0.90) and acceptable test-retest reliability (ICC >0.80) [35, 36]. However, only FES-I had an ICC >0.90, which has been suggested as a minimum when using scales for individual comparisons [36, 37]. In comparison with previous reliability studies in PD, our results of internal consistency were consistent with previous studies [3, 16–19]. Test-retest reliability of FES(S) was lower than previously reported (ICC = 0.82 vs. ICC = 0.87) . The situation was similar for mSAFFE (0.85 vs. 0.92) . The ICC of ABC in the current study was in between the results of the two previous ABC studies that assessed test-retest reliability (0.86 vs. 0.79 and 0.94) [17, 18]. These differences are likely to appear as psychometric properties are sample dependent .
SEM% for the four rating scales varied from 7 to 12%. This implies that a change in a mean score greater than 7 to 12% of the possible scoring range indicates a “real” change (above measurement error), when assessing FOF for a group of people with PD . SDD% were 20 to 33%, indicating that the smallest change in an individual’s FOF score that can be interpreted as a “real” change (above measurement error) should exceed 20 to 33% of the possible scoring ranges . FES-I had the lowest SEM% and SDD%, where a difference of at least 4 and 10 points indicated a “real” change on a group and individual level, respectively.
There is a variety of FOF scales , and it needs to be acknowledged that this psychometric comparison is not fully comprehensive since only four Swedish translated scales were included. We selected FES-I because it is recommended by ProFaNE , and its forerunners (or adaptations of them) since they are commonly used. More studies of other FOF rating scales are needed, as well as cross-national comparisons, to establish which rating scale that is the best of all available FOF scales.
The postal survey study design means that all scales were self-administered, and it needs to be underlined that the present findings may not apply if the scales are administered as an interview. Furthermore, the cross-sectional design does not enable us to determine either the responsiveness of the FOF scales, nor the minimal important differences. However, it has been argued that SEM is a reasonable approximation of the minimal important difference .
All four FOF scales showed acceptable internal consistency and test-retest reliability. ABC revealed insufficiencies in terms of data completeness, and ABC and FES(S) had many outliers when comparing t1 and t2. When assessing FOF in people with PD, the findings in the present study favoured the choice of FES-I or mSAFFE. However, FES-I was the only scale with ICC >0.9, which has been suggested when using a scale for individual comparisons.
Suarez H, Geisinger D, Suarez A, Carrera X, Buzo R, Amorin I: Postural control and sensory perception in patients with Parkinson’s disease. Acta Otolaryngol. 2009, 129: 354-360. 10.1080/00016480802495446.
Mak MK, Pang MY: Parkinsonian single fallers versus recurrent fallers: different fall characteristics and clinical features. J Neurol. 2010, 257: 1543-1551. 10.1007/s00415-010-5573-9.
Nilsson MH, Drake AM, Hagell P: Assessment of fall-related self-efficacy and activity avoidance in people with Parkinson’s disease. BMC Geriatr. 2010, 10: 78-10.1186/1471-2318-10-78.
Grimbergen YA, Schrag A, Mazibrada G, Borm GF, Bloem BR: Impact of falls and fear of falling on health-related quality of life in patients with Parkinson’s disease. J Parkinsons Dis. 2013, 3: 409-413.
Mak MK, Pang MY: Fear of falling is independently associated with recurrent falls in patients with Parkinson’s disease: a 1-year prospective study. J Neurol. 2009, 256: 1689-1695. 10.1007/s00415-009-5184-5.
Ellis T, Boudreau JK, DeAngelis TR, Brown LE, Cavanaugh JT, Earhart GM, Ford MP, Foreman KB, Dibble LE: Barriers to exercise in people with Parkinson disease. Phys Ther. 2013, 93: 628-636. 10.2522/ptj.20120279.
Nilsson MH, Hariz GM, Iwarsson S, Hagell P: Walking ability is a major contributor to fear of falling in people with Parkinson’s disease: implications for rehabilitation. Parkinsons Dis. 2012, 2012: 713236-
Moore DS, Ellis R: Measurement of fall-related psychological constructs among independent-living older adults: a review of the research literature. Aging Ment Health. 2008, 12: 684-699. 10.1080/13607860802148855.
Yardley L, Beyer N, Hauer K, Kempen G, Piot-Ziegler C, Todd C: Development and initial validation of the Falls Efficacy Scale-International (FES-I). Age Ageing. 2005, 34: 614-619. 10.1093/ageing/afi196.
Tinetti ME, Richman D, Powell L: Falls efficacy as a measure of fear of falling. J Gerontol. 1990, 45: P239-P243. 10.1093/geronj/45.6.P239.
Powell LE, Myers AM: The activities-specific balance confidence (ABC) scale. J Gerontol A Biol Sci Med Sci. 1995, 50A: M28-M34. 10.1093/gerona/50A.1.M28.
Lachman ME, Howland J, Tennstedt S, Jette A, Assmann S, Peterson EW: Fear of falling and activity restriction: the survey of activities and fear of falling in the elderly (SAFE). J Gerontol B Psychol Sci Soc Sci. 1998, 53: P43-P50.
Yardley L, Smith H: A prospective study of the relationship between feared consequences of falling and avoidance of activity in community-living older people. Gerontologist. 2002, 42: 17-23. 10.1093/geront/42.1.17.
Bladh S, Nilsson MH, Carlsson G, Lexell J: Content analysis of 4 fear of falling rating scales by linking to the international classification of functioning, disability and health. PM R. 2013, 5: 573-582. 10.1016/j.pmrj.2013.01.006. e571
Hobart J, Cano S: Improving the evaluation of therapeutic interventions in multiple sclerosis: the role of new psychometric methods. Health Technol Assess. 2009, 13: 1-177.
Peretz C, Herman T, Hausdorff JM, Giladi N: Assessing fear of falling: can a short version of the activities-specific balance confidence scale be useful?. Mov Disord. 2006, 21: 2101-2105. 10.1002/mds.21113.
Dal Bello-Haas V, Klassen L, Sheppard MS, Metcalfe A: Psychometric properties of activity, self-efficacy, and quality-of-life measures in individuals with Parkinson disease. Physiother Can. 2011, 63: 47-57. 10.3138/ptc.2009-08.
Steffen T, Seney M: Test-retest reliability and minimal detectable change on balance and ambulation tests, the 36-item short-form health survey, and the unified Parkinson disease rating scale in people with parkinsonism. Phys Ther. 2008, 88: 733-746. 10.2522/ptj.20070214.
Lohnes CA, Earhart GM: External validation of abbreviated versions of the activities-specific balance confidence scale in Parkinson’s disease. Mov Disord. 2010, 25: 485-489. 10.1002/mds.22924.
Terwee CB, Mokkink LB, Knol DL, Ostelo RW, Bouter LM, de Vet HC: Rating the methodological quality in systematic reviews of studies on measurement properties: a scoring system for the COSMIN checklist. Qual Life Res. 2012, 21: 651-657. 10.1007/s11136-011-9960-1.
Bladh S, Nilsson MH, Hariz GM, Westergren A, Hobart J, Hagell P: Psychometric performance of a generic walking scale (Walk-12G) in multiple sclerosis and Parkinson’s disease. J Neurol. 2012, 259: 729-738. 10.1007/s00415-011-6254-z.
Nilsson MH, Bladh S, Hagell P: Fatigue in Parkinson’s disease: measurement properties of a generic and a condition-specific rating scale. J Pain Symptom Manage. 2013, 46: 737-746. 10.1016/j.jpainsymman.2012.11.004.
Ware JE, Sherbourne CD: The MOS 36-item short-form health survey (SF-36) I. Conceptual framework and item selection. Med Care. 1992, 30: 473-483. 10.1097/00005650-199206000-00002.
Hobson JP, Edwards NI, Meara RJ: The Parkinson’s disease activities of daily living scale: a new simple and brief subjective measure of disability in Parkinson’s disease. Clin Rehabil. 2001, 15: 241-246. 10.1191/026921501666767060.
Nilsson MH, Hariz GM, Wictorin K, Miller M, Forsgren L, Hagell P: Development and testing of a self administered version of the freezing of gait questionnaire. BMC Neurol. 2010, 10: 85-10.1186/1471-2377-10-85.
Giladi N, Shabtai H, Simon ES, Biran S, Tal J, Korczyn AD: Construction of freezing of gait questionnaire for patients with Parkinsonism. Parkinsonism Relat Disord. 2000, 6: 165-170. 10.1016/S1353-8020(99)00062-0.
Lamb SE, Jorstad-Stein EC, Hauer K, Becker C: Development of a common outcome data set for fall injury prevention trials: the prevention of falls network Europe consensus. J Am Geriatr Soc. 2005, 53: 1618-1622. 10.1111/j.1532-5415.2005.53455.x.
Gray P, Hildebrand K: Fall risk factors in Parkinson’s disease. J Neurosci Nurs. 2000, 32: 222-228. 10.1097/01376517-200008000-00006.
Nordell E, Andreasson M, Gall K, Thorngren K-G: Evaluating the Swedish version of the falls efficacy scale-international (FES-I). Adv Physiother. 2009, 11: 81-87. 10.1080/14038190802318986.
Hellstrom K, Lindmark B: Fear of falling in patients with stroke: a reliability study. Clin Rehabil. 1999, 13: 509-517. 10.1191/026921599677784567.
Ware JE, Gandek B: Methods for testing data quality, scaling assumptions, and reliability: the IQOLA Project approach. International Quality of Life Assessment. J Clin Epidemiol. 1998, 51: 945-952. 10.1016/S0895-4356(98)00085-7.
Hobart JC, Riazi A, Lamping DL, Fitzpatrick R, Thompson AJ: Improving the evaluation of therapeutic interventions in multiple sclerosis: development of a patient-based measure of outcome. Health Technol Assess. 2004, 8: iii-1–48
McHorney CA, Tarlov AR: Individual-patient monitoring in clinical practice: are available health status surveys adequate?. Qual Life Res. 1995, 4: 293-307. 10.1007/BF01593882.
Schuck P: Assessing reproducibility for interval data in health-related quality of life questionnaires: which coefficient should be used?. Qual Life Res. 2004, 13: 571-586.
Shrout PE, Fleiss JL: Intraclass correlations: uses in assessing rater reliability. Psychol Bull. 1979, 86: 420-428.
Nunnally JC, Bernstein IH: Psychometric theory. 1994, New York: McGraw-Hill, 3
Scientific Advisory Committee of the Medical Outcomes Trust: Assessing health status and quality-of-life instruments: attributes and review criteria. Qual Life Res. 2002, 11: 193-205. 10.1023/A:1015291021312.
Streiner DL, Norman GR: Health measurement scales: a practical guide to their development and use. 2008, Oxford; New York: Oxford University Press, 4
Terwee CB, Bot SD, de Boer MR, van der Windt DA, Knol DL, Dekker J, Bouter LM, de Vet HC: Quality criteria were proposed for measurement properties of health status questionnaires. J Clin Epidemiol. 2007, 60: 34-42. 10.1016/j.jclinepi.2006.03.012.
Lexell JE, Downham DY: How to assess the reliability of measurements in rehabilitation. Am J Phys Med Rehabil. 2005, 84: 719-723. 10.1097/01.phm.0000176452.17771.20.
Norman GR, Streiner DL: Biostatistics: the bare essentials. 2008, Shelton, Conn: People’s Medical Pub. House, 3
Wirdefeldt K, Adami HO, Cole P, Trichopoulos D, Mandel J: Epidemiology and etiology of Parkinson’s disease: a review of the evidence. Eur J Epidemiol. 2011, 26 (Suppl 1): S1-S58.
Muangpaisan W, Mathews A, Hori H, Seidel D: A systematic review of the worldwide prevalence and incidence of Parkinson’s disease. J Med Assoc Thai. 2011, 94: 749-755.
Bloem BR, Hausdorff JM, Visser JE, Giladi N: Falls and freezing of gait in Parkinson’s disease: a review of two interconnected, episodic phenomena. Mov Disord. 2004, 19: 871-884. 10.1002/mds.20115.
Allen NE, Canning CG, Sherrington C, Lord SR, Latt MD, Close JC, O’Rourke SD, Murray SM, Fung VS: The effects of an exercise program on fall risk factors in people with Parkinson’s disease: a randomized controlled trial. Mov Disord. 2010, 25: 1217-1225. 10.1002/mds.23082.
King MT: A point of minimal important difference (MID): a critique of terminology and methods. Expert Rev Pharmacoecon Outcomes Res. 2011, 11: 171-184. 10.1586/erp.11.9.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2318/14/66/prepub
The authors wish to thank the participants for their cooperation, and Jeanette Härnberg, RN and Mia Olsson, RN for assistance with selection of possible participants. The study was accomplished within the Strategic Research Area MultiPark and the Centre for Ageing and Supportive Environments (CASE) at Lund University, Lund, Sweden. MHN is affiliated to the Swedish Parkinson Academy.
The authors declare that they have no competing interest.
SBJ, MHN and JL conceived and designed the study. SBJ performed data collection, analysed the data and drafted the initial manuscript. All authors participated in writing (and approved) the final version of the manuscript.