Psychometric properties of tests that assess the angular vestibulo-ocular reflex (aVOR) and vestibulospinal reflex function are currently unknown. This study investigated the test-retest reliability and response stability of gaze stabilization, postural sway, and dynamic balance measures in persons with multiple sclerosis (MS) and controls.
Nineteen adults with MS and 14 controls performed passive horizontal head impulses, quiet standing, and dynamic balance tests on two separate occasions. Gaze stabilization measures included aVOR gain, number of compensatory saccades (CSs) per head rotation, CS latency, and gaze position error. Postural sway included sway amplitude and total sway path. Dynamic balance measure included the Functional Gait Assessment. Intraclass correlation coefficient, standard error of measurement (SEM, SEM%), and minimal detectable difference at 95% confidence level were calculated.
Test-retest reliability for aVOR gain, CSs per head rotation, and gaze position error was moderate and for each postural sway and dynamic balance measure was good. Low error (SEM, SEM%) for aVOR gain, CS latency, postural sway, and dynamic balance variables and low minimal detectable difference values for aVOR gain and Functional Gait Assessment scores were seen, suggestive of acceptable response stability.
These results support the utility of some of the gaze and postural measures for examination and treatment efficacy purposes in people with MS.
Balance disorders are frequent in persons with multiple sclerosis (MS),1 which may result in falls, comorbidities, decreased physical activity, and impaired quality of life.2,3 Imbalance and a high incidence of falls (>50%)4,5 form an important health and societal participation concern in persons with MS due to the associated restrictions in activities of daily living, the injury-related financial costs, and the increased risk of mortality.2,3,6 Gaze and postural instabilities have been associated with balance impairments and increased fall risk in older individuals and in people with various neurologic disorders.2,7,8 However, these factors are not frequently examined in persons with MS during routine clinical practice, demonstrating the need for comprehensive balance assessments when investigating underlying visual, vestibular, or proprioceptive deficiencies. Moreover, the measurement properties of various gaze and postural assessments have not been well-studied in persons with MS, which may further hinder their use by clinical providers in practice.
Gaze stabilization during head movements is a function of the vestibulo-ocular reflex (VOR) and other oculomotor strategies that generate compensatory eye movements to maintain visual stability of a target.9,10 Accurate perception and integration of the vestibulo-ocular and vestibulospinal sensations is required to maintain gaze and postural stability during daily movements.9 Recently, the video head impulse test (vHIT) has been used to assess gaze stabilization,11 and although the measurement accuracy of vHIT has been established in people with peripheral vestibular loss,11–13 there are limited data available on its reliability in individuals with central neurologic disorders such as MS or in healthy individuals.
Postural instability is frequently examined by standing postural sway and clinical balance measures. Standing postural sway measures use center of pressure (COP)–based findings on a force platform.14 Impairments in standing postural sway have been associated with balance and gait impairments15,16 and are identified as important factors in the relearning of independent standing and walking abilities.17 Still, the reliability of static posturographic tests and related spatial domain parameters (amplitude, sway path) are less examined in people with MS.18,19 Clinical balance measures, on the other hand, include a variety of performance-based tests, such as reaching, timed walking, and stair climbing tests. One such measure is the Dynamic Gait Index, which has previously been studied in people with MS20–22 ; however, the reliability of a modified version, the Functional Gait Assessment (FGA), is currently unknown in persons with MS. Because low scores on the FGA have been associated with fall risk and balance deficits in people with vestibular disorders,23 there is a need to examine its reliability in people with MS who frequently present with balance dysfunction.
Identification of reliability and response stability are critical to detect true test score changes between two sessions beyond that attributable to measurement error. Because the psychometric properties of instrumented gaze stabilization (vHIT), postural sway (COP), and clinical dynamic balance (FGA) measures are not distinctly established in persons with MS, there is a need to assess their reliability and response stability to confidently document gaze and postural performance in this population. Therefore, to support the use of these outcome measures for examination and treatment purposes, we aimed to investigate the test-retest reliability and the stability of repeated responses for gaze stabilization, postural sway, and dynamic balance measures in people with MS and in age-matched controls. We hypothesized that these measures would demonstrate at least moderate reliability and acceptable response stability in people with MS and controls.
Thirty-nine persons with MS recruited from local neurologist practices and support groups were screened (Figure S1, published in the online version of this article at ijmsc.org). Age-matched neurologically healthy controls were recruited from the community. The data presented for this study are part of a larger trial8 in which the sample size was determined from previously observed effect sizes for between-group differences in angular VOR (aVOR) gain using vHIT.13 The aVOR is an involuntary eye movement reflex that stabilizes retinal images during angular head perturbations to maintain clear vision, and the aVOR gain is typically determined by the ratio of peak eye velocity to peak head velocity during head perturbations. Selection criteria for people with MS were as follows: 18 years or older; confirmed MS diagnosis; able to stand for at least 30 seconds without support; Expanded Disability Status Scale score of 6.5 or less; fall risk established by meeting one of four criteria (≥2 retrospective falls per year, Activities-Specific Balance Confidence Scale score <70, Berg Balance Scale score <44, or Dizziness Handicap Inventory score >59)20 ; no history of exacerbations in the previous 3 months; no history of other peripheral or central nervous system injury or neuro-otologic condition besides MS; absence of internuclear ophthalmoparesis or ophthalmoplegia; and absence of other orthopedic, cognitive, medical, or surgical limitations that may limit their study participation.
Participants who met the inclusion criteria signed an institutional review board–approved informed consent form and provided the medical, MS-related, and fall history via personal interviews. Thereafter, the fall risk assessments to identify persons with MS who met any one of the four aforementioned fall risk criteria were conducted. Regarding persons with MS with fall risk, we excluded those with peripheral vestibular pathologic abnormalities identified by a clinical vestibular examination. In persons with MS identified as a fall-risk, we excluded those with peripheral vestibular pathologies by a clinical vestibular examination comprising oculomotor range of motion, bedside HIT, VOR cancellation test, saccades, and test of skew.9 Peripheral vestibular pathologic abnormalities were screened for using an infrared video goggles system, and nystagmus at rest, nystagmus during benign positional vertigo tests, and head-shake nystagmus examinations. The consented participants subsequently underwent a physical therapist (H.G.)–administered neurologic examination (Expanded Disability Status Scale).24 The gaze stabilization, standing postural sway, and dynamic balance measurements were then conducted on a single day, followed by a repeated assessment on a different day. The assessment days were separated by at least 2 days (Figure S1) and were conducted by the same assessor (H.G.). The fall risk screening; clinical vestibular, neurologic, and instrumented gaze; and balance evaluations were performed by a qualified trained assessor with a physical therapy license and more than 4 years of experience working with individuals with MS (H.G.).
Gaze stabilization was determined using vHIT goggles (GN Otometrics) (Figure S2), which have been previously validated in people with aVOR deficits11,12 and have been used to demonstrate gaze deficits in persons with MS.8 The experimenter administered horizontal (yaw) head impulses to each side to collect at least eight trials suitable for analysis (Appendix S1). The study procedure has been described in detail by Garg et al.8 The gaze stabilization variables (Figure 1) were aVOR gain, number of compensatory saccades (CSs) per head rotation, CS latency, and gaze position error. The aVOR gain was determined by dividing the desaccaded eye velocity area under the curve (AUC) by the head velocity AUC, beginning with the onset of the head impulse to the instant when head velocity reverted to zero.9 The onset of head movement was preferentially determined by the methods described by MacDougall et al13 as the time point 60 milliseconds before maximal yaw head acceleration and confirmed manually by visual inspection. In a few instances in which the MacDougall et al13 technique inaccurately identified the onset, the start of head movement was identified as the latest point in time when the head velocity rose above zero and continued to move in the intended direction. Although head rotations to both (left and right) directions were performed, the aVOR gain of the worst-functioning side was used as the dependent variable. The CSs per head rotation were manually counted for each head rotation and then averaged over the number of rotations. For investigative purposes, a CS was defined as a horizontal eye rotation occurring during the head rotation that assisted a deficient aVOR and occurred in the direction of the vestibular slow component.25 The CS latency was calculated as the time between onset of head acceleration and onset of the first identifiable CS. The gaze position error was measured by subtracting the eye position from the target position when the head velocity returned to zero.
Standing postural sway was assessed using an AMTI OR6-7 series force platform (Advanced Mechanical Technologies Inc, Watertown, MA). The kinetic (COP) data were sampled at 200 Hz. Data were captured using Vicon Nexus (Vicon Motion Systems, Centennial, CO). The following postural sway variables were calculated during a 25-second interval of quiet stance: mediolateral sway amplitude, determined as the maximal excursion of COP in the mediolateral direction; anteroposterior sway amplitude, determined as the maximal excursion of COP in the anteroposterior direction; and total sway path, determined as the cumulative COP excursion during the 25 seconds (Appendix S1). Postural sway has previously been studied in people with MS to detect aberrant sensory contributions using different instrumented techniques; however, the measurement properties of specific variables were not discussed.26,27
Data Reduction and Analysis
The collected raw data from the vHIT were exported to custom software written in Matlab (The MathWorks, Natick, MA) for screening and analysis. Standardized screening measures were used to discard trials with blinks and movement artifacts.30 Briefly, trials were screened using the following criteria: visual inspection for appropriateness of eye and head movement; blink identification by detecting either a biphasic deflection of the eye in the lateral plot or an acute deflection in the pitch plot.30 Kinetic data from Vicon Nexus was imported into Visual 3D software (C-Motion Inc, Germantown, MD) for further analysis. The data were filtered using a lowpass, zero-phase shift, Butterworth filter at 20 Hz based on visual inspection of the data and the results of a residual analysis.31 IBM SPSS Statistics for Windows, version 23.0 (IBM Corp, Armonk, NY) was used for all analyses.
Data for all dependent variables were examined separately for persons with MS and controls. Test-retest reliability, which reflects the agreement between scores on each testing day, was assessed using the intraclass correlation coefficient (ICC) with a two-way mixed-effects model (ICC[3,1]).32 The ICC point estimators were interpreted using the following criteria: ICC less than 0.5 indicates poor; 0.5 to 0.75, moderate; and greater than 0.75, good reliability.28 Response stability was established by calculations of standard error of measurement (SEM) and minimal detectable difference with a confidence level of 95% (MDD95). The SEM was computed as SEM = SD √(1–estimated reliability coefficient), where SD is the pooled standard deviation of test-retest measures, and the MDD95 (ie, measure of true change) was calculated as 1.96 × √2 × SEM.32 The SEM and MDD95 values, expressed in the unit of measurement, describe the limits for change that can be considered above the threshold of measurement error and indicate a true change after treatment. The SEM% and MDD95% were also determined as a percentage of the mean to produce unitless indicators and allow for comparisons. Values for SEM% and MDD95% less than 20% were considered conservative estimates of low measurement error and acceptable response stability.33
Nineteen persons with MS met the inclusion criteria and participated. The MS group comprised three men and 16 women with a mean ± SD age of 53.4 ± 11.7 years, falls per year of 4.2 ± 3.3, and disease duration (time since diagnosis) of 16.0 ± 11.4 years, and the control group included five men and nine women with a mean ± SD age of 54.6 ± 11.9 years and falls per year of 0.07 ± 0.27 (Table 1).
The aVOR gain and gaze position error demonstrated moderate reliability in people with MS, and the aVOR gain and CSs per head rotation demonstrated moderate reliability in controls. The CS latency demonstrated poor reliability in both groups (Table 2). The postural sway and dynamic balance measures demonstrated good reliability in people with MS (ICC range, 0.87–0.98) and controls (ICC range, 0.76–0.90) (Table 2).
Irrespective of the group, aVOR gain, CS latency, and all the postural sway and dynamic balance variables exhibited low measurement error (SEM% <20) and, therefore, acceptable response stability. Relative to controls, larger MDD95 estimates for CSs per head rotation and the postural sway measures were seen in people with MS (Table 2). Angular VOR gain and FGA scores demonstrated low (<20%) MDD95% in both groups, suggesting acceptable response stability.
This study aimed to examine the psychometric properties of gaze stabilization, postural sway, and dynamic balance measures by assessing the test-retest reliability and response stability between two separate assessments in people with MS and controls. We determined that most gaze stabilization measures demonstrated moderate test-retest reliability and that the postural sway and dynamic balance measures showed good test-retest reliability. The aVOR gain, CS latency, and all the postural sway and dynamic balance variables demonstrated SEM% less than 20, and aVOR gain and FGA scores showed low MDD95%, suggestive of acceptable response stability. The SEM and MDD values obtained from this study should enhance the interpretation of gaze and postural stability changes seen after treatment in persons with MS and healthy adults during intervention research and during clinical decision making.
Test-Retest Reliability: ICC
The aVOR gain was found to be a moderately reliable measure for persons with MS and controls in this study. Its moderate ICC value is likely due to within-individual variability in aVOR gain. Because hand placement technique34 and head thrust velocities35 have been identified as sources of variability, we used a consistent hand placement technique for all the trials. In addition, the mean ± SD head velocities administered on the two assessment days were similar (186.80° ± 18.21° per second on day 1 vs 187.60° ± 17.65° per second on day 2). We also demonstrated moderate reliability for other gaze stabilization measures (CSs per head rotation and gaze position error). To our knowledge, this is the first study to report the test-retest reliability of the horizontal vHIT in persons with MS and controls using the GN Otometrics system. These results concur with those of a previous study of healthy young adults using a different device.36 Future studies should investigate the sources of variability in gaze stabilization assessments and determine ways to minimize it.
Good reliability for postural sway outcomes was found in all the study participants. These findings are consistent with previous work examining test-retest reliability in healthy older adults and individuals with stroke.37–39 In addition, the dynamic balance measure exhibited good test-retest reliability in all study participants. Similar results have been demonstrated by previous studies in individuals with stroke29 and Parkinson disease40 (ICC > 0.90), emphasizing the usefulness of the FGA in assessing dynamic balance in neurologic populations. Because the FGA includes progressively challenging motor tasks, which addresses the visual, vestibular, and proprioceptive contributions to balance during gait, this test may be better suited for people with MS reporting imbalance and a history of frequent falls. This study demonstrated better test-retest reliability of the FGA in persons with MS (ICC, 0.95) compared with controls (ICC, 0.82). Taken together, these results support the use of postural sway and dynamic balance assessments in clinical practice and as relevant outcome measures in research trials using these populations. However, such results should be combined with the information on measurement error of these instruments.
Response Stability: SEM, MDD95 and SEM%, MDD95%
In this study, information from response stability metrics allows the clinician or researcher to interpret an observed change in an outcome as a potential product of measurement error or as a meaningful change in vestibular physiology or postural behavior. This study found low measurement error (SEM, SEM%) for aVOR gain, CS latency, and all the postural sway and dynamic balance variables. Low MDD95 and MDD95% values were seen for aVOR gain and FGA scores only.
Regardless of group membership, aVOR gain demonstrated acceptable response stability, suggesting that a relatively small magnitude of change is required to demonstrate a real change in performance as opposed to the other gaze stabilization measures (Table 2). These results support the utility of aVOR gain as a sensitive gaze stability outcome in persons with MS as well as in healthy individuals. The contributors to increased measurement error in other gaze stabilization measures may include the location and severity of demyelination, the stability and degree of vestibular dysfunction in the person being tested, age, the testing instrumentation, and the testing technique. Further research is warranted to characterize common sources of measurement error and recommend steps to minimize this error for diagnostic, prognostic, and rehabilitative purposes.
Low error (SEM% <20%) in the postural sway and dynamic balance measures suggests acceptable response stability for research and clinical practice. Among the COP-based variables, total sway path consistently demonstrated the lowest MDD95 estimates and percentage values, suggesting less measurement error than other outcomes. Previous studies have found similar results in individuals with stroke39 and healthy individuals.37 Because this study demonstrated a range of MDD95% estimates (<50%) for other postural sway variables, further research with larger sample sizes is needed to appropriately define population-specific outcomes of postural sway. This study found better response stability for the FGA in both groups as opposed to previous reports,29,40 however, higher estimates of SEM and MDD95 were seen in people with MS compared with healthy older adults. Clinically, this implies that to identify a real performance modification, a greater change is required in people with MS than in neurologically healthy individuals. Although there is no consensus on the acceptable amount of measurement error,33 these findings support the use of some of these postural sway and balance assessments in clinical and research settings for persons with MS as well as healthy adults.
Limitations and Conclusion
Small sample size and heterogeneity may have influenced the results, and, therefore, a larger study that includes homogenous groups of older adults and people with MS is warranted to support the utility of these gaze assessments in clinical practice. In addition, the current reliability data may not be generalized to other populations with neurologic and vestibular conditions. By design, this study chose to assess the test-retest reliability with a single rater and across different days; therefore, these results cannot be applied toward within-day assessments and multiple raters.
In conclusion, the present study found moderate reliability of gaze stabilization measures and good reliability of postural sway and dynamic balance measures in persons with MS as well as age-matched controls. Although the aVOR gain and FGA measures may be stable across repeated testing sessions, a relatively larger change in performance may be needed to detect real change for the remainder of the gaze stabilization and COP-based measures.
Accurate vestibular sensations allow for stable gaze and postural responses during static and dynamic positions requiring balance, thus highlighting the need for such assessments in persons with MS at fall risk.
This study addresses the insufficient information on clinimetric properties of gaze and postural assessments in MS, which might have been a hindrance to its application in clinical settings. Acceptable reliability and low values of minimum error and minimal detectable difference for gaze stability (angular vestibulo-ocular reflex gain), postural sway (total sway path), and dynamic balance (Functional Gait Assessment) measures were found.
Therefore, clinicians can use both instrumented and functional tests to document gaze and postural responses in persons with MS.
The authors declare no conflicts of interest.
This investigation was supported by the National Multiple Sclerosis Society (grant PP1841).
These data have been presented in part as a poster at the Fourth International Symposium on Gait and Balance in Multiple Sclerosis; October 10–11, 2014; Cleveland, OH. In addition, separate from this research question and article, part of the gaze stability assessment procedure and data has been previously published in Garg et al.8
From the Rocky Mountain University of Health Professions, Provo, UT, USA (HG); Department of Otolaryngology–Head and Neck Surgery, Johns Hopkins University School of Medicine, Baltimore, MD, USA (MCS); and Department of Physical Therapy (EG, KBF, LED) and Department of Health, Kinesiology, and Recreation (JS), The University of Utah, Salt Lake City, UT, USA.
Note: Supplementary material for this article is available at ijmsc.org.