Context

The King-Devick (K-D) test is used to identify oculomotor impairment after concussion. However, the diagnostic accuracy of the K-D test over time has not been evaluated.

Objectives

To (1) examine the sensitivity and specificity of the K-D test at 0 to 6 hours postinjury, 24 to 48 hours postinjury, the beginning of a return-to-play (RTP) protocol (asymptomatic), unrestricted RTP, and 6 months postconcussion and (2) compare outcomes between athletes with and those without concussion across confounding factors (sex, age, sport contact level, academic year, learning disorder, attention-deficit/hyperactivity disorder, migraine history, concussion history, and test administration mode).

Design

Retrospective, cross-sectional design.

Setting

Multiple institutions in the Concussion Assessment, Research and Education Consortium.

Patients or Other Participants

A total of 320 athletes with a concussion (162 men, 158 women; age = 19.80 ± 1.41 years) were compared with 1239 total collegiate athletes without a concussion (646 men, 593 women; age = 20.31 ± 1.18 years).

Main Outcome Measure(s)

We calculated the K-D test time difference (in seconds) by subtracting the baseline from the most recent time. Receiver operator characteristic (ROC) curve and area under the curve (AUC) analyses were used to determine the diagnostic accuracy across time points. We identified cutoff scores and corresponding specificity at both the 80% and 70% sensitivity levels. We repeated ROC with AUC analyses using confounding factors.

Results

The K-D test predicted positive results at the 0- to 6-hour (AUC = 0.724, P < .001), 24- to 48-hour (AUC = 0.701, P < .001), RTP (AUC = 0.640, P < .001), and 6-month postconcussion (AUC = 0.615, P < .001) tim points but not at the asymptomatic time point (AUC = 0.513, P = .497). The 0- to 6-hour and 24- to 48-hour time points yielded 80% sensitivity cutoff scores of −2.6 and −3.2 seconds (ie, faster), respectively, but 46% and 41% specificity, respectively. The K-D test had a better AUC when administered using an iPad (AUC = 0.800, 95% CI = 0.747, 0.854) compared with the spiral-bound card system (AUC = 0.646, 95% CI = 0.600, 0.692; P < .001).

Conclusions

The diagnostic accuracy of the K-D test was greatest at 0 to 6 hours and 24 to 48 hours postconcussion but declined across subsequent postconcussion time points. The AUCs did not differentiate between groups across confounding factors. Our negative cutoff scores indicated that practice effects contributed to improved performance, requiring athletes to outperform their baseline scores.

Key Points
  • The King-Devick test contributed acceptable diagnostic value at the 0- to 6-hour and 24- to 48-hour postconcussion time points, but it was not adequate at the asymptomatic time point and was only fair at the return-to-play and 6-month postconcussion time points.

  • The King-Devick test should also be used with caution over repeated administrations, as a practice effect exists.

Concussion assessment and diagnosis comprises a multifaceted approach using several tests to understand the entire clinical presentation of the injury.13  Several assessments are used at baseline and postconcussion to evaluate possible impairments. Concussion evaluation often includes assessing symptoms, neurocognition, balance, and vestibular and ocular performance.3,4  Sideline assessments are essential for informing the clinician who is making the diagnosis, but they must be conducted quickly and must be sensitive to acute impairment. These assessments are often used to facilitate return-to-play (RTP) decisions by sports medicine and allied health care providers.58  The Sports Concussion Assessment Tool, 5th edition, which was developed as a sideline screening tool, includes a patient-reported symptom scale, cognitive screening, brief neurologic screening, and balance testing; however, this test does not evaluate vision or eye movements.1,9  Use of eye movement and tracking assessments, such as the King-Devick (K-D) test, to identify oculomotor abnormalities after brain injury has been increasing.1,1013 

When administering the K-D test, health care providers ask athletes to read aloud a demonstration card and 3 test cards with rows of random, single-digit numbers as quickly as possible with no errors. Saccadic movements may evoke postconcussion impairment in visual and eye-movement pathways that are widely distributed throughout the brain.9  Increased latency and decreased accuracy of eye movements are common after a concussion.6 

Despite its widespread use, little is known about the diagnostic accuracy of the K-D test. In a meta-analysis,9  researchers concluded that the K-D test had high sensitivity (86%) and specificity (90%) when using a cutoff in which postconcussion completion time was longer than baseline completion time. Little information and scarce literature are available on the diagnostic accuracy of the K-D test from autonomous, external investigators who do not have financial interests in the test. We also lack evidence exploring the sensitivity and specificity of the K-D test across multiple postconcussion time points in collegiate athletes, as many researchers11,14  have analyzed repeated baseline performance only. Further investigation is warranted to explore collegiate athletes with concussion across multiple postconcussion time points.

No consensus exists regarding a clinically informative cutoff time difference relative to baseline time for the K-D test when screening individuals for a concussion. Researchers2,6,15  have suggested that concussion should be suspected when postconcussion completion time is longer than baseline completion time. Using this proposed cutoff requires caution because healthy adolescent and collegiate athletes present with pronounced practice effects, experiencing improvement or faster times with repeated administrations.1519  Breedlove et al18  also found that 27% of healthy participants had slowed or increased times when completing 2 baseline K-D tests approximately 1 year apart, which warrants assessments of specificity.

Age, sex, and reading-skill level affect K-D scores at baseline6,11,13,14 ; however, no authors have determined whether K-D test sensitivity and specificity differ across confounding factors. Limited data have addressed how athletes' migraine and concussion history, neurologic disorders (learning disorder [eg, dyslexia], attention-deficit/ hyperactivity disorder [ADHD]), vestibular disorders (vertigo), or the test administration mode influence K-D test sensitivity and specificity postconcussion. Understanding how confounding factors of characteristics and medical history influence postconcussion K-D test performance is necessary to inform clinical decision-making. Therefore, the first aim of our study was to examine the sensitivity and specificity of the K-D test at 0 to 6 hours postconcussion, 24 to 48 hours postconcussion, the beginning of an RTP protocol (asymptomatic), unrestricted RTP, and 6 months postconcussion. The 24- to 48-hour postconcussion time point was established as our earliest time point for comparison with the control group and because of the small number of athletes with a concussion evaluated between 0 and 6 hours postconcussion (n = 51; 15.9%). Our second aim was to explore the effect of confounding characteristics (sex, age, sport contact level, academic year, learning disorder, ADHD, migraine history, concussion history, and administration mode) on the sensitivity and specificity of the K-D test for concussion.

This study was part of the National Collegiate Athletic Association–Department of Defense Concussion Assessment, Research and Education (CARE) Consortium,2  an ongoing comprehensive study examining the effects of concussion in collegiate athletes and US military service academy members.20  All 30 CARE Consortium sites use a common definition and criteria for concussion.20  The K-D test is used at 6 of these sites as a Level B measure, which is considered an emerging evaluation and is added at the discretion of the clinicians at the performance site.

Athletes were only considered healthy and included in our control group if they had >1 baseline assessment and no concussion documented in the CARE Consortium database. The K-D baseline assessments were administered 1 year apart. Athletes were included in our concussed group if they sustained a concussion and completed K-D longitudinal postconcussion assessments. Athletes with concussions, which were diagnosed by each institution's sports medicine team with guidance from the current consensus guidelines,21  were excluded from the control group. Athletes with missing confounding factor variables were excluded from the corresponding analyses. All athletes provided written informed consent before participation, and the US Army Medical Research and Materiel Command Human Research Protection Office and each CARE Consortium site's institutional review board approved the study.

The K-D Test

The K-D test consists of a rapid number-naming task that takes <2 minutes to complete and is available on 2 platforms (iPad [various models, although the K-D test and platform were the same for all; Apple Inc] and paper). Athletes complete 3 test cards of rapid number naming, reading from left to right. For baseline assessments, athletes complete 2 error-free trials to supply a baseline score. The athlete's baseline is the faster of the 2 trials. The test is readministered after concussion, but trials are not repeated in cases of error. Total time is defined as the time in seconds to complete the entire test.5 

Sensitivity and Specificity

For participants with concussion, we subtracted their baseline time from their postconcussion time (postconcussion time point − baseline). This was done at each postconcussion time point: 0 to 6 hours postconcussion; 24 to 48 hours postconcussion; asymptomatic, defined as participants beginning the RTP protocol; unrestricted RTP, defined as participants having completed all stages in the gradual RTP protocol; and 6 months postconcussion. For participants without concussion, we subtracted their first baseline K-D test time from their second baseline K-D test time (baseline 2 – baseline 1). Given that repeat assessments at time points corresponding to those of the concussion group were not available in the control group, the difference in baseline times was used for the control group relative to the concussed group's performance across the 4 time points, beginning with the 24- to 48-hour postinjury time point. The mean time between the first and second baseline for the nonconcussed, control group was 398 ± 1.8 days. For the concussed group, the mean time between baseline and 0- to 6-hour postconcussion time points was 286 ± 68.2 days, 0- to 6-hour and 24- to 48-hour postconcussion time points was 1 ± 1.8 days, 24- to 48-hour postconcussion and asymptomatic time points was 12 ± 4.3 days, and asymptomatic and unrestricted RTP time points was 5 ± 4.4 days.

Influence of Confounding Factors

For the second aim, we extracted the following targeted self-reported descriptive and medical history factors from baseline clinical report forms: sex (female, male), sport contact level (contact, limited contact, noncontact),22  academic year (freshman, sophomore, junior, senior), learning disorder (no, yes), ADHD (no, yes), migraine history (no, yes), concussion history (0, 1, 2, 3+), and administration mode (iPad, spiral-bound cards).

Statistical Analyses

To address our first aim, we used receiver operator characteristic (ROC) curve and area under the curve (AUC) analyses to determine the diagnostic accuracy of the K-D test across the 4 postconcussion time points with the Wilson-Brown method using GraphPad Prism (version 8.1.2). For AUC analyses that were different, we identified cutoff scores and corresponding specificities at both the 80% and 70% sensitivity levels but also included continuous levels of various sensitivity and specificity values in Supplemental Tables 1–5 (available online at http://dx.doi.org/10.4085/1062-6050-0063.21.S1).

To address our second aim, we repeated ROC curve with AUC analyses from the 24- to 48-hour time point segmented by sex (female, male), sport contact level (contact, limited contact, noncontact), academic year (freshman, sophomore, junior, senior), learning disorder (no, yes), ADHD (no, yes), migraine history (no, yes), concussion history (0, 1, 2, 3+), and administration mode (iPad, spiral-bound cards). We used the 24- to 48-hour postconcussion time point because a gain of 38 individuals with concussion compared with 0 to 6 hours postconcussion occurred. To determine whether the characteristics differed across confounding factor segments, we compared the 95% CIs around each AUC outcome. The AUC outcomes with scores of <0.50 were considered poor; 0.51 to 0.69, fair; 0.70 to 0.80, acceptable; 0.80 to 0.90, excellent; and >0.90, outstanding.23  Values >0.55 were considered different. We evaluated the AUC 95% CI overlap and calculated Z and P values using the following formula: Z = abs(AUC1 − AUC2)/SQRT(⁠\(\def\upalpha{\unicode[Times]{x3B1}}\)\(\def\upbeta{\unicode[Times]{x3B2}}\)\(\def\upgamma{\unicode[Times]{x3B3}}\)\(\def\updelta{\unicode[Times]{x3B4}}\)\(\def\upvarepsilon{\unicode[Times]{x3B5}}\)\(\def\upzeta{\unicode[Times]{x3B6}}\)\(\def\upeta{\unicode[Times]{x3B7}}\)\(\def\uptheta{\unicode[Times]{x3B8}}\)\(\def\upiota{\unicode[Times]{x3B9}}\)\(\def\upkappa{\unicode[Times]{x3BA}}\)\(\def\uplambda{\unicode[Times]{x3BB}}\)\(\def\upmu{\unicode[Times]{x3BC}}\)\(\def\upnu{\unicode[Times]{x3BD}}\)\(\def\upxi{\unicode[Times]{x3BE}}\)\(\def\upomicron{\unicode[Times]{x3BF}}\)\(\def\uppi{\unicode[Times]{x3C0}}\)\(\def\uprho{\unicode[Times]{x3C1}}\)\(\def\upsigma{\unicode[Times]{x3C3}}\)\(\def\uptau{\unicode[Times]{x3C4}}\)\(\def\upupsilon{\unicode[Times]{x3C5}}\)\(\def\upphi{\unicode[Times]{x3C6}}\)\(\def\upchi{\unicode[Times]{x3C7}}\)\(\def\uppsy{\unicode[Times]{x3C8}}\)\(\def\upomega{\unicode[Times]{x3C9}}\)\(\def\bialpha{\boldsymbol{\alpha}}\)\(\def\bibeta{\boldsymbol{\beta}}\)\(\def\bigamma{\boldsymbol{\gamma}}\)\(\def\bidelta{\boldsymbol{\delta}}\)\(\def\bivarepsilon{\boldsymbol{\varepsilon}}\)\(\def\bizeta{\boldsymbol{\zeta}}\)\(\def\bieta{\boldsymbol{\eta}}\)\(\def\bitheta{\boldsymbol{\theta}}\)\(\def\biiota{\boldsymbol{\iota}}\)\(\def\bikappa{\boldsymbol{\kappa}}\)\(\def\bilambda{\boldsymbol{\lambda}}\)\(\def\bimu{\boldsymbol{\mu}}\)\(\def\binu{\boldsymbol{\nu}}\)\(\def\bixi{\boldsymbol{\xi}}\)\(\def\biomicron{\boldsymbol{\micron}}\)\(\def\bipi{\boldsymbol{\pi}}\)\(\def\birho{\boldsymbol{\rho}}\)\(\def\bisigma{\boldsymbol{\sigma}}\)\(\def\bitau{\boldsymbol{\tau}}\)\(\def\biupsilon{\boldsymbol{\upsilon}}\)\(\def\biphi{\boldsymbol{\phi}}\)\(\def\bichi{\boldsymbol{\chi}}\)\(\def\bipsy{\boldsymbol{\psy}}\)\(\def\biomega{\boldsymbol{\omega}}\)\(\def\bupalpha{\bf{\alpha}}\)\(\def\bupbeta{\bf{\beta}}\)\(\def\bupgamma{\bf{\gamma}}\)\(\def\bupdelta{\bf{\delta}}\)\(\def\bupvarepsilon{\bf{\varepsilon}}\)\(\def\bupzeta{\bf{\zeta}}\)\(\def\bupeta{\bf{\eta}}\)\(\def\buptheta{\bf{\theta}}\)\(\def\bupiota{\bf{\iota}}\)\(\def\bupkappa{\bf{\kappa}}\)\(\def\buplambda{\bf{\lambda}}\)\(\def\bupmu{\bf{\mu}}\)\(\def\bupnu{\bf{\nu}}\)\(\def\bupxi{\bf{\xi}}\)\(\def\bupomicron{\bf{\micron}}\)\(\def\buppi{\bf{\pi}}\)\(\def\buprho{\bf{\rho}}\)\(\def\bupsigma{\bf{\sigma}}\)\(\def\buptau{\bf{\tau}}\)\(\def\bupupsilon{\bf{\upsilon}}\)\(\def\bupphi{\bf{\phi}}\)\(\def\bupchi{\bf{\chi}}\)\(\def\buppsy{\bf{\psy}}\)\(\def\bupomega{\bf{\omega}}\)\(\def\bGamma{\bf{\Gamma}}\)\(\def\bDelta{\bf{\Delta}}\)\(\def\bTheta{\bf{\Theta}}\)\(\def\bLambda{\bf{\Lambda}}\)\(\def\bXi{\bf{\Xi}}\)\(\def\bPi{\bf{\Pi}}\)\(\def\bSigma{\bf{\Sigma}}\)\(\def\bPhi{\bf{\Phi}}\)\(\def\bPsi{\bf{\Psi}}\)\(\def\bOmega{\bf{\Omega}}\)\({\rm{SE}}_{{\rm{AUC1}}}^{\rm{2}}\) + \({\rm{SE}}_{{\rm{AUC2}}}^{\rm{2}}\)⁠), where abs is absolute and SQRT is square root. We calculated F and χ2 values for group comparisons using SPSS (version 24; IBM Corp) and α = .05 (Table). We also conducted a multivariate logistic regression using these same confounding factors to predict false-negatives in the concussed group and a separate multivariate logistic regression to predict false-positives in the control group. To classify false-negatives and false-positives, we used the 80% sensitivity cutoff from the 24- to 48-hour postconcussion time point.

Table

Descriptive Data of Concussion and Control Groups

Descriptive Data of Concussion and Control Groups
Descriptive Data of Concussion and Control Groups

An initial sample of 1719 athletes was examined for eligibility, with 1559 athletes meeting the inclusion criteria. Of these 1559 athletes, 320 were assigned to the concussion group, and 1239 were assigned to the healthy control group (Table).

King-Devick difference scores predicted concussion diagnosis status at the 0- to 6-hour postconcussion (AUC = 0.724, P < .001), 24- to 48-hour postconcussion (AUC = 0.701, P < .001), unrestricted RTP (AUC = 0.640, P < .001), and 6-month postconcussion (AUC = 0.615, P < .001) time point but not at the asymptomatic time point (AUC = 0.513, P = .50). The ROC curves and outcomes across the 5 postconcussion time points are shown in Figure 1. The ROC raw data including sensitivity, specificity, likelihood ratios, and 95% CIs with corresponding K-D cutoffs are provided in Supplemental Tables 1–5 . The 0- to 6-hour and 24- to 48-hour postconcussion time points had acceptable discrimination for sensitivity and specificity, but other time points displayed fair discrimination. A cutoff score of −2.6 seconds (ie, performing 2.6 seconds faster than baseline) yielded 80% sensitivity to athletes with a concussion at the 0- to 6-hour postconcussion time point. However, this resulted in specificity of 46%. A cutoff score of −3.2 seconds (performing 3.2 seconds faster than baseline) yielded 80% sensitivity to athletes with a concussion at the 24- to 48-hour postconcussion time point. Nevertheless, this resulted in specificity of 41%. A cutoff score of −1.5 seconds (performing 1.5 seconds faster than baseline) at 24 to 48 hours postconcussion yielded 70% sensitivity, with a specificity of 57%. The ROC curves for the asymptomatic time point were not different; thus, cutoff scores are not proposed.

Figure 1

Receiver operating characteristic curve and area under the curve (AUC; ± SE) across postconcussion time points: A, 0–6 h postconcussion (concussion group = 1239, control group = 159), B, 24–48 h postconcussion (concussion group = 1239, control group = 229), C, beginning of a return-to-play protocol (asymptomatic; concussion group = 1239, control group = 283), D, unrestricted return to play (concussion group = 1239, control group = 279), and E, 6 months postconcussion (concussion group = 1239, control group = 171). The horizontal dotted line at y = 80 indicates 100% specificity at a sensitivity of 80%. a Inverted receiver operating characteristic curve.

Figure 1

Receiver operating characteristic curve and area under the curve (AUC; ± SE) across postconcussion time points: A, 0–6 h postconcussion (concussion group = 1239, control group = 159), B, 24–48 h postconcussion (concussion group = 1239, control group = 229), C, beginning of a return-to-play protocol (asymptomatic; concussion group = 1239, control group = 283), D, unrestricted return to play (concussion group = 1239, control group = 279), and E, 6 months postconcussion (concussion group = 1239, control group = 171). The horizontal dotted line at y = 80 indicates 100% specificity at a sensitivity of 80%. a Inverted receiver operating characteristic curve.

Close modal

A cutoff score of −0.8 seconds (performing 0.8 seconds faster than baseline) at the unrestricted RTP time point yielded 80% sensitivity but specificity of 37%. A cutoff score of −1.9 seconds (performing 1.9 seconds faster than baseline) yielded 70% sensitivity but specificity of 47%.

At the 6-month postconcussion time point, a cutoff score of −0.4 seconds (performing 0.4 seconds faster than baseline) yielded 80% sensitivity but specificity of 34%. A cutoff score of −1.6 seconds (performing 1.6 seconds faster than baseline) yielded 70% sensitivity with specificity of 44%. Individuals with concussion tended to perform better at the 6-month postconcussion time point than at baseline.

Influence of Confounding Factors

Most confounding factors did not demonstrate differences in the AUC (P range = .06–.94; Figure 2), except for administration mode, such that the iPad administration mode (AUC = 0.800; 95% CI = 0.747, 0.854) had a greater AUC than the spiral-bound card system (AUC = 0.646; 95% CI = 0.600, 0.692; Figure 2H). We also observed a trend for learning disorder outcomes (P = .07) such that the K-D test score differences tended to have a better AUC for those with a learning disorder (AUC = 0.819; 95% CI = 0.715, 0.924) relative to those without a learning disorder (AUC = 0.716; 95% CI = 0.680, 0.752; Figure 2D). Similarly, confounding factors did not influence the odds of false-negatives (P = .65) in the concussed group. However, the K-D administration mode did influence the odds of false-positives in the control group (P = .002): iPad administration reduced the odds of false-positives (odds ratio = 0.59; 95% CI = 0.45, 0.76).

Figure 2

Receiver operating characteristic curves at the within–48-h time point segmented by, A, sex, B, sport contact level, C, academic year, D, learning disorder, E, attention-deficit/hyperactivity disorder, F, migraine history, G, concussion history, and H, administration mode (P < .05). The horizontal dotted line at y = 80 indicates 100% specificity at a sensitivity of 80%. a Indicates difference in area under the receiver operating characteristic curve (P < .001).

Figure 2

Receiver operating characteristic curves at the within–48-h time point segmented by, A, sex, B, sport contact level, C, academic year, D, learning disorder, E, attention-deficit/hyperactivity disorder, F, migraine history, G, concussion history, and H, administration mode (P < .05). The horizontal dotted line at y = 80 indicates 100% specificity at a sensitivity of 80%. a Indicates difference in area under the receiver operating characteristic curve (P < .001).

Close modal

Overall Diagnostic Accuracy

Our ROC curves at the 0- to 6-hour and 24- to 48-hour postconcussion time points showed an interesting pattern, whereby sensitivity was lower with slower postconcussion K-D times and specificity was higher. This pattern was reversed (Figure 1D and E) at the unrestricted RTP and 6-month postconcussion time points and followed a pattern in which sensitivity was higher with slower postconcussion K-D times and specificity was lower, which better matched the clinical interpretation. Clinicians should be aware that this pattern does not follow conventional clinical patterns, despite an AUC that suggests overall moderate diagnostic accuracy.

Based on our results, the K-D test contributed acceptable diagnostic accuracy at 0 to 6 hours and 24 to 48 hours postconcussion. This is concurrent with the designer's recommendation that the test be used as a remove-from-play test.5  However, the test is not sufficient to stand alone, which supports the use of a multifactorial approach. Additionally, as expected, the K-D test had poor to no diagnostic accuracy at the asymptomatic time point. The diagnostic accuracy was fair at the unrestricted RTP and 6-month postconcussion time points, which was indicative of the K-D test correctly predicting concussion diagnosis and magnitude of practice effects. At these time points, athletes with a concussion have cleared previous time points to return to sport. The RTP timepoint has been associated with modest improvements in K-D test performance due to participating in physical activity and improved cognitive performance.24 

Practice effects greatly influence repeat K-D test performance and clinical interpretation across repeated administrations. Our results may show that a practice effect (leading to faster completion of the K-D test) has a stronger influence on postconcussion K-D test scores compared with a concussion effect (which in theory is associated with slower completion of the K-D test). Athletes with a concussion take the test 4 times in 6 months, whereas healthy athletes take the K-D test only 2 times in 1 year. Clinicians should expect considerable improvement in completion time (ie, faster) with subsequent K-D test administrations.15  A faster K-D time is a product of a practice effect, and caution should be used when interpreting the outcomes across time points for individuals with concussion. In addition, our data revealed recommended cutoff scores at each time point to improve the clinical interpretation of the K-D test during concussion recovery.

Recommended Cutoff Scores

To achieve 80% sensitivity, clinicians should expect a 1.0- to 2.6-second faster time than the baseline time within 0 to 6 hours postinjury, a 1.0- to 3.2-second faster time at 24 to 48 hours postconcussion, and a 1.0- to 1.9-second faster time at unrestricted RTP. Similar to our results, Dhawan et al25  found that a cutoff time of 2 seconds faster than baseline yielded high sensitivity (90%) and specificity (91%) in adolescent athletes, which is notably higher than our observed level of specificity. However, this study had a smaller sample size, with only 20 participants diagnosed with a concussion and 121 participants without a concussion.25  Sensitivity is calculated as the number of true positives divided by the total number of individuals with a concussion. Therefore, a smaller sample size of those with a concussion could artificially inflate the sensitivity value. Other researchers have shown 0.7- to 2.5-second improvements in K-D times in healthy athletes between 2 baseline measurements.5,15,26  Breedlove et al18  determined the K-D test to be reliable between trials and years. Retesting with a 1-year interval revealed a small improvement of 2 seconds among intercollegiate athletes.18  However, 27% of athletes showed a slower performance from year 1 to year 2.18  Overall, these previous studies and our results confirm that athletes should perform considerably faster relative to their baseline time with subsequent K-D testing. Repeated tests will cause practice effects and improvement of scores, regardless of injury.18 

Confounding Factors

Confounding factors largely did not influence K-D test diagnostic accuracy postconcussion. The K-D test AUC for those with a learning disorder was slightly higher (AUC = 0.819) relative to that of those without a learning disorder (AUC = 0.716), although the CIs overlapped (Figure 2D). The K-D test was originally developed in 1976 to study eye movement in conjunction with reading ability to screen children for learning disorders, such as dyslexia.27,28  Our findings suggested that a learning disorder does not influence the diagnostic accuracy of the K-D test much when referenced to a preinjury baseline. Future research is needed to examine whether learning disabilities influence diagnostic accuracy in the absence of a baseline assessment and in a larger sample, as ours included only 15 athletes with concussion who had a diagnosed learning disorder.

The K-D test time and errors decrease (improved performance) as age increases among high school, collegiate, and professional athletes.14  However, we found no difference in performance by academic year (freshmen to seniors). Investigators14  have also suggested that sex influences K-D test performance, whereby males tend to score worse or slower. However, our results indicated no differences across sexes. We also did not identify differences in AUC among those with ADHD. A history of concussion did not influence K-D test performance, which had not been studied earlier.

Administration mode did affect AUC outcomes. The iPad version outperformed the card system. A shift has occurred from the card system to the iPad version because the former is no longer commercially available.29  Researchers29,30  have also recommended using the iPad version over the cards because the former has higher test-retest reliability and improved testing standards to minimize errors from administrators. Institutions should use the iPad version to avoid inappropriately diagnosing individuals with concussion when they do not have one.

Lastly, physical activity such as aerobic fitness may have been another confounding factor contributing to our results at the unrestricted RTP and 6-month postconcussion time points but not at the asymptomatic time point activity, with authors31  demonstrating improved cognitive performance after a bout of exercise. The 24- to 48-hour postconcussion time point likely occurs before athletes are asymptomatic, and thus, before physical activity resumes. Our results showed the diagnostic accuracy decreased to poor levels at the asymptomatic time point and then improved to fair levels at the unrestricted RTP time point. Physical inactivity may hinder cognitive performance, which was reflected in our outcomes.

Limitations

Although our study included a large multisite sample of athletes with concussion and control athletes, it had limitations. These findings were limited to those who were participating only at these sites. In addition, we used the difference in baseline times for the control group across the 4 postconcussion time points of the concussed group because repeat testing was not available for the control group. Sample sizes for the subanalyses were much smaller, which was also a limitation. Future researchers should consider using the K-D test in healthy matched individuals across the same time points as their counterparts with concussion.

We examined a large sample of collegiate athletes from multiple sites to determine the diagnostic accuracy of a test commonly used to evaluate concussions. Based on our findings, the K-D test contributed acceptable diagnostic value at the 24- to 48-hour postconcussion time point, but it was not adequate at the asymptomatic time point and was only fair at the unrestricted RTP time point. Confounding characteristics and medical history factors were not different across the same time points. These results suggest that the K-D test should not be used as a standalone assessment at any time point. Instead, the K-D test should be included as 1 part of a comprehensive assessment battery.

Contributing CARE Consortium Investigators were April Marie (Reed) Hoy, MS, ATC (Azusa Pacific University); Darren Campbell, MD (Brigham Young University); Louise A. Kelly, PhD (California Lutheran University); John DiFiori, MD (Hospital for Special Surgery, National Basketball Association); Justus D. Ortega, PhD (Humboldt State University); Nicholas Port, PhD (Indiana University); Margot Putukian, MD (Major League Soccer); T. Dianne Langford, PhD, and Jane McDevitt, PhD, ATC, CSCS (Temple University); Christopher C. Giza, MD (University of California, Los Angeles); Holly J. Benjamin, MD (University of Chicago); Thomas W. Kaminski, PhD, ATC (University of Delaware); James R. Clugston, MD, MS (University of Florida); Joseph B. Hazzard Jr, ATC (University of Houston-Clear Lake); Patrick G. O'Donnell, MHA (University of Massachusetts Memorial Medical Center); Luis A. Feigenbaum, PT, DPT, ATC (University of Miami); James T. Eckner, MD, MS (University of Michigan); Jason P. Mihalik, PhD, CAT(C), ATC (University of North Carolina at Chapel Hill); Christina L. Master, MD (University of Pennsylvania); Anthony P. Kontos, PhD, and Michael Collins, PhD (University of Pittsburgh Medical Center); Sara P.O. Chrisman, MD, MPH (University of Washington); Alison Brooks, MD, MPH (University of Wisconsin-Madison); Jonathan Jackson, MD, and Gerald McGinty, PT, DPT (United States Air Force Academy); Carlos Estevez, DPT, OCS, ECS (United States Coast Guard Academy); Kenneth L. Cameron, PhD, MPH, ATC (United States Military Academy); Adam Susmarski, DO (United States Naval Academy); Christopher M. Miles, MD (Wake Forest University); and Laura Lintner, DO (Winston-Salem University). We thank all research team members and clinical athletic trainers who helped in the data collection for this study.

This study was made possible, in part, by support from the Grand Alliance CARE Consortium, funded in part by the National Collegiate Athletic Association and the Department of Defense (Dr Kontos and S.P.B.). The US Army Medical Research Acquisition Activity, 820 Chandler Street, Fort Detrick, MD 21702-5014, USA, is the awarding and administering acquisition office. This work was supported by the Office of the Assistant Secretary of Defense for Health Affairs through the Psychological Health and Traumatic Brain Injury Program under award W81XWH-14-2-0151. Opinions, interpretations, conclusions, and recommendations are those of the authors and are not necessarily endorsed by the Department of Defense (Defense Health Program funds).

1.
Dessy
AM,
Yuk
FJ,
Maniya
AY,
et al
Review of assessment scales for diagnosing and monitoring sports-related concussion
.
Cureus
.
2017
;
9
(
12
):
e1922
.
2.
Broglio
SP,
Katz
BP,
Zhao
S,
McCrea
M,
McAllister
T;
CARE Consortium Investigators.
Test-retest reliability and interpretation of common concussion assessment tools: findings from the NCAA-DoD Care Consortium
.
Sports Med
.
2018
;
48
(
5
):
1255
1268
.
3.
Putukian
M.
Clinical evaluation of the concussed athlete: a view from the sideline
.
J Athl Train
.
2017
;
52
(
3
):
236
244
.
4.
Borich
MR,
Cheung
KL,
Jones
P,
et al
Concussion: current concept in diagnosis and management
.
J Neurol Phys Ther
.
2013
;
37
(
3
):
133
139
.
5.
Galetta
KM,
Brandes
LE,
Maki
K,
et al
The King-Devick test and sports-related concussion: study of a rapid visual screening tool in a collegiate cohort
.
J Neurol Sci
.
2011
;
309
(
1–2
):
34
39
.
6.
King
D,
Gissane
C,
Hume
PA,
Flaws
M.
The King-Devick test was useful in management of concussion in amateur rugby union and rugby league in New Zealand
.
J Neurol Sci
.
2015
;
351
(
1–2
):
58
64
.
7.
Seidman
DH,
Burlingame
J,
Yousif
LR,
et al
Evaluation of the King-Devick test as a concussion screening tool in high school football players
.
J Neurol Sci
.
2015
;
356
(
1–2
):
97
101
.
8.
Galetta
KM,
Morganroth
J,
Moehringer
N,
et al
Adding vision to concussion testing: a prospective study of sideline testing in youth and collegiate athletes
.
J Neuroophthalmol
.
2015
;
35
(
3
):
235
241
.
9.
Galetta
KM,
Mengling
L,
Leong
DF,
Ventura
RE,
Galetta
SL,
Balcer
LJ.
The King-Devick test of rapid number naming for concussion detection: meta-analysis and systematic review of the literature
.
Concussion
.
2015
;
1
(
2
):
CNC8.
10.
Russell-Giller
S,
Toto
D,
Heitzman
M,
Naematullah
M,
Shumko
J.
Correlating the King-Devick test with vestibular/ocular motor screening in adolescent patients with concussion: a pilot study
.
Sports Health
.
2018
;
10
(
4
):
334
339
.
11.
Chrisman
SPD,
Harmon
KG,
Schmidt
JD,
et al
Impact of factors that affect reading skill level on King–Devick baseline performance time
.
Ann Biomed Eng
.
2019
;
47
(
10
):
2122
2127
.
12.
Marinides
Z,
Galetta
KN,
Andrews
CM,
et al
Vision testing is additive to the sideline assessment of sports-related concussion
.
Neurol Clinic Pract
.
2015
;
5
(
1
):
25
34
.
13.
Lempke
LB,
Schmidt
JD,
Lynall
RC.
Athletic trainers' concussion-assessment and management practices: an update
.
J Athl Train
.
2020
;
55
(
1
):
17
26
.
14.
Moran
R,
Covassin
T.
Risk factors associated with baseline King-Devick performance
.
J Neurol Sci
.
2017
;
383
(
1
):
101
104
.
15.
Leong
DF,
Balcer
LJ,
Galetta
SL,
Evans
G,
Gimre
M,
Watt
D.
The King-Devick test for sideline concussion screening in collegiate football
.
J Optom
.
2015
;
8
(
2
):
131
139
.
16.
Oberlander
TJ,
Olson
BL,
Weidauer
L.
Test-retest reliability of the King-Devick test in an adolescent population
.
J Athl Train
.
2017
;
52
(
5
):
439
445
.
17.
Galetta
KM,
Barrett
J,
Allen
M,
et al
The King-Devick test as a determinant of head trauma and concussion in boxers and MMA fighters
.
Neurology
.
2011
;
76
(
17
):
1456
1462
.
18.
Breedlove
KM,
Ortega
JD,
Kaminski
TM,
et al
King-Devick test reliability in National Collegiate Athletic Association athletes: a National Collegiate Athletic Association-Department of Defense Concussion Assessment, Research and Education Report
.
J Athl Train
.
2019
;
4
(
12
):
1241
1246
.
19.
Buckley
T,
Breedlove
K,
Oldham
J,
et al
Year to year reliability of the King Devick test in collegiate student-athletes: an NCAA/DoD Grand Alliance report
.
Neurology
.
2017
;
88
(
16
suppl):P5.220.
20.
Broglio
SP,
McCrea
M,
McAllister
T,
et al
CARE Consortium Investigators.
A national study on the effects of concussion in collegiate and US military service academy members: the NCAA-DoD Concussion Assessment, Research and Education (CARE) Consortium structure and methods
.
Sports Med
.
2017
;
47
(
7
):
1437
1451
.
21.
McCrory
P,
Meeuwisse
W,
Dvořák
J,
et al
Consensus statement on concussion in sport – the 5th International Conference on Concussion in Sport held in Berlin, October 2016
.
Br J Sports Med
.
2017
;
51
(
11
):
838
847
.
22.
Rice
SG
;
American Academy of Pediatrics Council on Sports Medicine and Fitness.
Medical conditions affecting sports participation
.
Pediatrics
.
2008
;
121
(
4
):
841
848
.
23.
Hosmer
DW,
Lemeshow
S.
Applied Logistic Regression. 2nd ed
.
John Wiley & Sons
;
2000
:
160
164
.
24.
Herring
MP,
O'Connor
PJ,
Dishman
RK.
The effect of exercise training on anxiety symptoms among patients
.
Arch Intern Med
.
2010
;
170
(
4
):
321
331
.
25.
Dhawan
PS,
Leong
D,
Tapsell
L,
et al
King-Devick test identifies real-time concussion and symptomatic concussion in youth athletes
.
Neurol Clin Pract
.
2017
;
7
(
60
):
464
473
.
26.
Leong
DF,
Balcer
LJ,
Galetta
SL,
Liu
Z,
Master
CL.
The King-Devick test as a concussion screening tool administered by sports parents
.
J Sports Med Phys Fitness
.
2014
;
54
(
1
):
70
77
.
27.
King
A,
Devick
S.
The Proposed King-Devick Test and Its Relation to the Pierce Saccade Test and Reading Levels
.
Illinois College of Optometry
;
1976
.
28.
Weise
KK,
Swanson
MW,
Penix
K,
Hale
MH,
Ferguson
D.
King-Devick and pre-season visual function in adolescent athletes
.
Optom Vis Sci
.
2017
;
94
(
1
):
89
95
.
29.
Frequently asked questions.
King-Devick Technologies, Inc
.
30.
Raynowska
J,
Hasanaj
L,
Zhang
I,
et al
Agreement of the spiral-bound and computerized tablet versions of the King-Devick test of rapid number naming for sports related concussion
.
Ann J Sports Med Res
.
2015
;
2
(
9
):
1051
1057
.
31.
Chang
YK,
Etnier
JL.
Exploring the dose-response relationship between resistance exercise intensity and cognitive function
.
J Sport Exerc Psychol
.
2009
;
31
(
5
):
640
656
.

Supplementary data