Inter-rater Reliability of the K-GMFM-88 and the GMPM for Children with Cerebral Palsy

Article information

Ann Rehabil Med. 2012;36(2):233-239
Publication date (electronic) : 2012 April 30
doi : https://doi.org/10.5535/arm.2012.36.2.233
Department of Rehabilitation Medicine, CHA Bundang Medical Center, CHA University, Seongnam 463-712, Korea.
Corresponding author: Minyoung Kim. Department of Rehabilitation Medicine, CHA Bundang Medical Center, CHA University, 351, Yatap-dong, Bundang-gu, Seongnam 463-712, Korea. Tel: +82-31-780-6281, Fax: +82-31-780-6206, kmin@cha.ac.kr
Received 2011 May 16; Accepted 2011 December 06.

Abstract

Objective

To examine inter-rater reliability of the Korean version Gross Motor Function Measure (K-GMFM-88) and the Gross Motor Performance Measure (GMPM) based on the video clips.

Method

We considered a sample of 39 children (28 boys and 11 girls; the mean age=3.50±1.23 years) with cerebral palsy (CP). Two pediatric physical therapists assessed the children based on video recordings.

Results

For the K-GMFM-88, the intraclass correlation coefficient (ICC3, 1) ranged from .978 to .995, and Spearman's correlation coefficient ranged from .916 to .997. For the GMPM, ICC3, 1 ranged from .863 to .929, and Spearman's correlation coefficient ranged from .812 to .885. With the gross motor function classification system classified according to the functional level (GMFCS I-II vs. III-V), the ICCs were .982 and .994 for the K-GMFM-88 total score and .815 and .913 for the GMPM total score. There were good or high correlations between the subscales of the two measures (r=.762-.884).

Conclusion

The K-GMFM-88 and GMPM are reliable tools for assessing the motor function of children with CP. These two methods are highly correlated, which adds more reliability on them. Thus, it is advisable to use K-GMFM-88 and GMPM for children with CP to assess gross motor function.

INTRODUCTION

Cerebral palsy (CP) is a group of disorders affecting the development of movements and postures and causing activity limitations that are attributable to non-progressive disturbances in the developing fetal or infant brain.1 Among the functional domains of impairments in children with CP, the gross motor function matters the most for their general activities in life. The gross motor function in children with CP has been conceptualized as having two main features: function and performance.2 The "function" means ability to accomplish certain motor activity and it does not necessarily implicate quality of motor control. On the other hand, the "performance" refers to the quality of motor activity or how well the child performs a certain activity. For example, when a child can stand independently for 10 seconds, this description designates the gross motor "function" of the child, whereas the "performance" description refers to the degree of stability during his or her standing.3

Previous studies attempting to reveal the effectiveness of some treatment modalities for children with CP have been limited because of a lack of valid and reliable assessment tools capable of quantifying functional changes following interventions.4 To address this deficiency, the Gross Motor function Measures Group of researchers and therapists in Ontario have developed two assessment instruments to measure subtle but meaningful changes in the motor function and performance of children with CP. As a result, they proposed the gross motor function measure (GMFM) and the gross motor performance measure (GMPM), which were designed to be used together.5 The reliability of the GMFM and the GMPM has been documented in the West.6-8 Russell et al.4 reported that the inter-rater reliability of the GMFM ranged from .87 to .99 across five dimensions and .99 for the total score. Thomas et al.9 found that the inter-observer reliability of the GMPM ranged from .78 to .86 for the five attributes.

To the authors' knowledge, no study has determined the reliability of the GMFM and the GMPM for Korean children with CP. When an observational assessment tool is used to measure clinical outcomes, it is important to establish the reliability of that tool.10 Several types of reliability tests are necessary for determining the stability, consistency, and dependability of scores for a specific instrument, and particularly inter-rater, intra-rater, and test-retest reliabilities are basic ones. Although all types of reliability are important, this study focused on determining the inter-rater reliability of the K-GMFM-88 and the GMPM.

MATERIALS AND METHODS

We employed a sample of 39 children with CP (28 boys and 11 girls). These children were admitted to the CHA Bundang Medical Center in Korea for intensive rehabilitation (Table 1). Inclusion criteria were children who were diagnosed as CP by medical doctors specializing in pediatric rehabilitation, and the age range was 2 to 7 years, while exclusion criteria were the children who received orthopedic surgery in last six months because their motor performance was not likely to be indicative of their typical motor ability. All the children and their mothers provided their written consent for their participation in this study, and we followed the principles in the Declaration of Helsinki.

Table 1

Summary Statistics for Subjects (N=39)

With permission of the copyright owner, we translated the "Administration and Scoring Guidelines for the GMFM-88 and the GMFM-66" in the GMFM User's manual.4 The translation procedure followed the forward-backward-forward method. The GMPM comprises 20 items of the GMFM, and thus, we translated it based on the GMFM-88, which is a criterion-referenced observational measure for assessing the gross motor function in children with CP. The GMFM-88 consists of 88 items grouped into five domains: (A) lying & rolling, (B) sitting, (C) crawling/kneeling, (D) standing, and (E) walking/running/jumping. Each item is scored on a four-point Likert-type scale (0-1-2-3). The higher the score, the better the gross motor function is. We converted the raw score for each domain into the percentage of the maximum per domain. Each domain was equally weighted, and we calculated the total score by summing the percentages of each domain and dividing the result by five. The total score for the GMFM-88 was based on the percentages for the five domains and was obtained when the subject finished this measure.4

The GMPM is composed of 20 items selected from the GMFM through a consensus method for assessing the quality of movements in children with CP. The GMPM uses a subset of GMFM items for the following five domains: lying/rolling, crawling/kneeling, sitting, standing, and walking/running/jumping. Three of the 20 items are static (e.g., standing), whereas the remaining 17 are dynamic (e.g., hopping on one foot). For each GMPM item, three out of the five possible attributes (alignment, coordination, dissociated movement, stability, and weight shift) are determined to be assessed. Alignment refers to the adjustment of parts or segments of the body in relation to each other. Coordination is defined as the smooth and controlled use of movements in motor performance and takes into account the timing, velocity, direction, force, and amplitude of movements. Dissociated movements refer to isolated movements (e.g., the extension of the hip with the flexion of the knee). Stability refers to the active maintenance of a body position in the presence of disturbing forces. Finally, weight shift is defined as movement involving the transfer of the body's center of gravity. We assessed each attribute by using a five-point Likert-type scale ranging from "severely abnormal" (1) to "consistently normal" (5) and calculated the percent scores for the attributes and the total score (scale 0-100%). We scored all three attributes for each item simultaneously, based on the average performance in the three trials.11

We administered the assessments based on the K-GMFM-88 and the GMPM in a pediatric physical therapy room that was comfortable and familiar to the subjects, and the procedure was videotaped by two pediatric physical therapists. We used video recordings because instant scoring in detail needs long time from frequent break, and it enables more accurate scoring. All children were assessed barefoot, without assistive devices. It took about 40 minutes and 20 minutes to administer the test to record the subject's gross motor function and performance respectively. To assess inter-rater reliability, two pediatric physical therapists (rater A and B) who were not involved in the video-recording served as raters for the K-GMFM-88 and the GMPM. Both of them had more than six years of experience in the evaluation and treatment of children with CP. The two raters attended a one-week GMFM-88 and GMPM training workshop on the administration and scoring of the K-GMFM-88 and GMPM, which included ten hours of manual education and ten hours of videotapescoring sessions. The training session was guided by a senior pediatric physical therapist who was fully proficient in the assessment tools. The two raters viewed video recordings of 39 subjects independently and scored them by using K-GMFM-88 and the GMPM over a oneweek period without discussing the results.

We calculated the means and standard deviations (SDs) for each test. The K-GMFM-88 and the GMPM are ordinal measures presenting a number of response options that "order" the characteristics of interest from better to less skilled performance, and thus, we employed non-parametric statistics.4 We used intraclass correlation coefficients (ICC3, 1) with 95% confidence intervals and Spearman's correlation coefficients to evaluate the inter-rater reliability of domain/attribute scores and total scores for the K-GMFM-88 and the GMPM. In addition, we employed the Wilcoxon signed-rank test for differences between the raters. Further, we determined ICCs through functional classification. Finally, we used Spearman's correlation coefficients to assess the relationship between the K-GMFM-88 and the GMPM. We conducted all the statistical analyses by using the SPSS software package for Window (ver. 12.01). A p-value of less than 0.05 was considered significant.

RESULTS

Table 1 provides the characteristics of the 39 subjects. In terms of the five domains and total scores for K-GMFM-88, ICCs ranged from .978 to .995, and Spearman's correlation coefficients ranged from .916 to .997. In terms of the five attributes and total scores for the GMPM, ICCs ranged from .863 to .929, and Spearman's correlation coefficients ranged from .812 to .885 (Table 2). There was no statistical difference between the two raters for domain and total score of the K-GMFM. For the GMPM, all attributes except for "alignment" showed no difference between raters A and B (Table 3). When stratified by the functional levels GMFCS I and II, ICCs ranged from .898 to .982 and from .757 to .830 for the K-GMFM-88 and the GMPM, respectively. For GMFCS III, IV, and V, ICCs ranged from .974 to .997 and from .808 to .913 for the K-GMFM-88 and the GMPM, respectively. Spearman's correlation coefficients, in the GMFCS I, II, was ranged from .875 to .991 and from .649 to .758 for the K-GMFM and GMPM respectively. Spearman's correlation coefficient of the GMFCS III-V was .925-.998 and .715-.838 individually (Table 4). Table 5 described the correlation between the K-GMFM and the GMPM. All the subscale scores of the measures showed good or high correlation positively (r=.762-.884).

Table 2

Inter-rater Reliability for the K-GMFM-88 and the GMPM

Table 3

Wilcoxon Signed-rank Test for the K-GMFM-88 and the GMPM

Table 4

Inter-rater Reliability for the K-GMFM-88 and the GMPM Classified by Functional Level

Table 5

Relationship between the K-GMFM-88 and the GMPM

DISCUSSION

We evaluated the inter-rater reliability of the K-GMFM-88 and the GMPM. To the authors' knowledge, this study is the first to consider a population of Korean children with CP to determine the reliability of the K-GMFM-88 and the GMPM simultaneously by using the same subjects and video recordings. The results indicates good reliability of K-GMFM-88 and the GMPM.

Although there are several statistical methods for assessing inter-rater reliability, we employed the ICC and Spearman's correlation coefficient to provide an in-depth analysis of the inter-rater reliability of the K-GMFM-88 and the GMPM. We used the ICC to assess the degree of correspondence and agreement between the rates,12 and Spearman's correlation coefficient to measure the correlation. Spearman's correlation coefficient provides information on the level of the correlation as well as the direction of the correlation. Watkins and Portney12 reported that an ICC≥.90 indicates a high reliability, .75-.90 indicates good reliability, .50-.75 indicates moderate reliability, and ≤.50 indicates poor reliability. Meyer13 reported that a correlation coefficient r≥.8 indicates a high correlation, r=.6-.8 indicatesa good correlation, r=.4-.6 indicates a moderate correlation, and r≤.4 indicates a poor correlation. In this study, the ICCs and Spearman's correlation coefficient for the K-GMFM-88 were high for all the domains and total scores. It was reported that the reliability of the GMFM-88 from 317 children with CP aged 1 and 15 years was very good.14 The total ICC was reported as .96 (.69-.98)5 and .78-.86 by attributes of the GMPM.9 In the present study, the ICCs and Spearman's correlation coefficients for total scores and attributes for the GMPM were from good to high. We also assessed the difference in each score between the two raters, and found some difference in the "alignment" attribute of the GMPM through the Wilcoxon signed-rank test, although the ICCs were high. Assessing the "alignment" involves the observation of more than one body segment,5 and requires a decision from the multiple features at a time, which might bring greater chance of error or discordance. Thus, raters should be familiar with the manual and scoring practices to minimize the inter-rater inconsistencies.

In the present study, inter-rater reliability did not vary according to the functional level of the subject for the five domains and total score of the K-GMFM-88. However, the total scores for the GMPM were lower for GMFCS I and II than for GMFCS III, IV, and V. Sorsdahl et al.15 examined the inter-reliability of the GMPM by using video recordings of the children with CP and reported that the ICC for the total score of the GMPM was lower in the indepedendent ambulators who often move at high speed, which is consistent with the present study results. We attempted to slow the subjects down, but the observation was limited by their innate movement patterns.

The reliability of a test refers its ability to provide consistent results. A number of sources of variations may influence the reliability of results obtained from a measure. These sources include problems with the test itself, such as unclear administration guidelines or imprecise scoring systems. A lack of training can lead to variations in raters. In this study, we provided the raters with intensive training to minimize such variations. In addition, structured video recordings and video scoring offered many advantages. In this study, we allowed the raters to watch video recordings several times and to stop the recordings to review the scoring guidelines in the manuals. Thus, video observations may be better when the scoring is performed by trained and skilled raters.15

The correlation between the K-GMFM-88 and the GMPM ranged from .762 to .884, which suggested a consideratble overlap between their constructs. This indicates that to examine the gross motor function of children with CP, we need to consider the level as well as performance quality of their gross motor function.16 This study's main limitation was that we evaluated only the ICC. A high ICC indicates high relative reliability, however it does not necessarily imply high absolute reliability. Relative reliability refers the consistent ranking of scores for an individual in a group by repeated measurements, and small measurement errors are needed for absolute reliability. Thus, a more appropriate way to investigate the reliability of an instrument intended for use in a clinical setting may be to examine absolute reliability. In addition, high relative and absolute reliability may require intensive efforts under practical guidelines. Further research to examine the inter-rater reliability of K-GMFM and GMPM should be continued by pediatric therapists and clinicians with a wide range of clinical experience across multi-center.

CONCLUSION

The inter-rater reliability of K-GMFM-88 and the GMPM was highly satisfactory in terms of total scores and subscores indicating that they are reliable methods for assessing the gross motor functional ability as well as the quality of movement in children with CP.

References

1. Rosenbaum P, Paneth N, Leviton A, Goldstein M, Bax M, Damiano D, Dan B, Jacobsson B. A report: the definition and classification of cerebral palsy April 2006. Dev Med Child Neurol Suppl 2007;109:8–14. 17370477.
2. Palisano RJ, Gracely EJ, Rosenbaum PL. Gross motor capability and performance of mobility in children with cerebral palsy: a comparison across home, school, and outdoor/community settings. Phys Ther 2004;84:419–429. 15113275.
3. Boyce WF, Gowland C, Rosenbaum PL, Lane M, Plews N, Goldsmith CH, Russell DJ, Wright V, Potter S, Harding D. The Gross motor performance measure: validity and responsiveness of a measure of quality of movement. Phys Ther 1995;75:603–613. 7604079.
4. Russell DJ, Rosenbaum PL, Avery LM, Lane M. Gross motor function measure (GMFM-66 & GMFM-88) user's manual 2002. 2nd edth ed. Hamilton: Gross Motor Measure Groups. p. 3–4.
5. Gowland C, Boyce WF, Wright V, Russell DJ, Goldsmith CH, Rosenbaum PL. Reliability of the Gross Motor Performance Measure. Phys Ther 1995;75:597–602. 7604078.
6. Bower E, McLellan DL, Arney J, Campbell MJ. A randomized controlled trial of different intensities of physiotherapy and different goal-setting procedures in 44 children with cerebral palsy. Dev Med Child Neurol 1996;38:226–237. 8631519.
7. Guyatt GH, Water S, Norman G. Measuring change over time: assessing the usefulness of evaluative instruments. J Chronic Dis 1987;40:171–178. 3818871.
8. Mclaughlin JF, Bjornson KF, Astley SJ, Hays RM, Hoffinger SA, Armantrout EA, Roberts TS. The role of selective dorsal rhizotomy in cerebral palsy: critical evaluation of a prospective clinical series. Dev Med Child Neurol 1994;36:755–769. 7926327.
9. Thomas SS, Buckon CE, Phillips DS, Aiona MD, Sussman MD. Interobserver reliability of the gross motor performance measure: preliminary results. Dev Med Child Neurol 2001;43:97–102. 11221911.
10. Harris SR, Haley SM, Tana WL, Swanson MW. Reliability of observational measures of the Movement Assessment of Infants. Phys Ther 1984;64:471–477. 6709711.
11. Boyce W, Gowland C, Rosenbaum P, Hardy S, Lane M, Plews N, Goldsmith C, Russell D, Wright V, Potter S, et al. Gross motor performance measure manual 1998. Kingston: Queen's University. p. 6.
12. Watkins MP, Portney LG. Foundations of clinical research: applications to practice 1993. 1st edth ed. East Norwalk: Conn Appleton and Lange. p. 53–67.
13. Meyer CR. Measurement in physical education 1979. 1st edth ed. New York: Ronald Press Co.
14. Beckung E, Carlsson G, Carlsdotter S, Uvebrant P. The natural history of gross motor development in children with cerebral palsy aged 1 to 15 years. Dev Med Child Neurol 2007;49:751–756. 17880644.
15. Sorsdahl AB, Moe-Nilssen R, Strand LI. Observer reliability of the Gross Motor Performance Measure and Quality of Upper Extremity Skills Test, based on video recordings. Dev Med Child Neurol 2008;50:146–151. 18201304.
16. Palisano RJ, Hanna SE, Rosenbaum PL, Russell DJ, Walter SD, Wood EP, Raina PS, Galuppi BE. Validation of a model of gross motor function for children with cerebral palsy. Phys Ther 2000;80:974–985. 11002433.

Article information Continued

Table 1

Summary Statistics for Subjects (N=39)

Table 1

Values are mean±SD or n (%)

GMFCS: Gross motor function classification system

Table 2

Inter-rater Reliability for the K-GMFM-88 and the GMPM

Table 2

ICC: Intraclass correlation coefficient, CI: Confidence interval, K-GMFM-88: Korean version Gross motor function measure-88, GMPM: Gross motor performance measure

*p<0.01

Table 3

Wilcoxon Signed-rank Test for the K-GMFM-88 and the GMPM

Table 3

Values are mean±SD

K-GMFM-88: Korean version Gross motor function measure-88, GMPM: Gross motor performance measure

*p<0.05

Table 4

Inter-rater Reliability for the K-GMFM-88 and the GMPM Classified by Functional Level

Table 4

Values are mean±SD

K-GMFM-88: Korean version Gross motor function measure-88, GMPM: Gross motor performance measure, GMFCS: Gross motor function classification system

*p<0.01

Table 5

Relationship between the K-GMFM-88 and the GMPM

Table 5

Values are Spearman's rho

K-GMFM-88: Korean version Gross motor function measure-88, GMPM: Gross motor performance measure

*p<0.01