Accurately diagnosing the level of disorders of consciousness (DOC) is clinically important because it can be used to determine whether the patient should be treated with pharmacologic therapeutics, noninvasive brain stimulation, or other therapeutic options [1]. It is also important for determining the prognosis of persons with DOC, which may reduce the burden on caregivers [2–4]. Furthermore, distinguishing between DOC status is important from a medico-ethical perspective, as minimally conscious state (MCS) and vegetative state/unresponsive wakefulness syndrome (VS/UWS) pose distinct ethical issues [5].
A variety of clinical tools have been developed to assess consciousness, such as the Glasgow Outcome Scale, Glasgow Outcome Scale-Extended, and Coma Recovery Scale-Revised (CRS-R) [6]. In the assessment using these scales, persons with DOC are provided with a variety of stimuli to identify meaningful behavioral responses to them. Among the various clinical assessment tools, the CRS-R is considered a standardized assessment tool for diagnosing DOC and has been widely used in research as well as in clinical practice [1, 3, 7]. However, it is prone to measurement error [8–10] as the level of response to a stimulus can vary depending on the circadian rhythm, medication use, comorbidities, and presence of coexisting sensory or motor impairments [11]. Moreover, unexpected distracting stimuli in the environment in which the assessment is being conducted and the skill level of the raters can affect the accuracy of the assessment results [5, 12]. To overcome these limitations, repeatedly assessing persons with DOC using the CRS-R has been proposed [10, 13]. However, owing to the complexity of the CRS-R assessment, it takes a substantial amount of time to conduct the assessment [14], making repeated use of the CRS-R infeasible in a clinical setting.
Including visual tracking as part of the diagnostic criteria for DOC was proposed due to the higher prevalence of visual response among persons with MCS than among persons with VS and its association with more favorable prognostic outcomes [15–17]. Furthermore, visual responses, such as visual pursuit or visual fixation, have been recognized as the earliest and most frequently observed signs of consciousness [18–21]. Another study showed multimodal evidence that, among persons with DOC, the group with cortically mediated visual responses had qualitatively different visual processing compared to the group with reflexive visual responses [22]. Therefore, there have been attempts to assess the level of consciousness using eye-tracking technology [23–26], which have demonstrated that utilizing eye-tracking technology to determine levels of consciousness achieves accuracy comparable to clinical assessments agreed upon by domain experts [24], suggesting its utility in predicting the emergence from DOC [23, 25, 26]. For these reasons, expert consensus obtained through the Delphi method has also presented a unanimous agreement on the necessity of employing eye-tracking (or pupil-tracking) technology in the assessment of DOC [1]. Pupil tracking technologies generally necessitate an initial calibration procedure to ensure accuracy in data collection [24]. For persons with DOC, the calibration process is challenging or even impossible. Consequently, previous studies either did not perform the calibration process or did not consider it [23, 26, 27]. Wannez et al. sought to overcome the inaccuracy of uncalibrated pupil trajectories by extracting meaningful values from the pupil trajectories of persons with DOC [24]. However, their method using the Pearson correlation value between the coordinates of the pupil trajectory and the mirror trajectory is inherently vulnerable to measurement errors due to the high granularity of the pupil trajectory coordinates, leading to errors during signal acquisition, filtering, and statistical processing.
Recently, virtual reality (VR) technology has been increasingly employed in clinical settings for assessment and therapy [28]. The integrated information theory, which explains consciousness in terms of measurable amounts of information, is a framework that quantifies the level of consciousness by calculating the amount of information generated by the brain [29]. The composition of stimuli perceived by the patient, influenced by exogenous stimuli, affects the amount of integrated information (φ-value), a phenomenon previously referred to as the contextuality of consciousness [30, 31]. Therefore, to determine the level of consciousness through the calculated amount of integrated information, it is essential to control the influence of exogenous stimuli. Exogenous stimuli can be divided into intended stimuli and noise. It is impossible to completely control the influence of exogenous stimuli in the real world due to the uncontrollable nature of noise. However, use of VR apparatus makes it possible to eliminate the noise and effectively control the intended stimuli by configuring the immersive environment. Consequently, in a VR environment, the level of consciousness can be estimated through the amount of integrated information calculated from responses to the same intended stimuli with minimal bias (represented in Fig. 1). This technology has the potential to function as a debiasing tool at a level unattainable by previous studies using computer screens or mirrors [23, 24, 26, 27]. Therefore, we expect that evaluation of the level of DOC using a VR-based eye-tracking system could complement the CRS-R to improve diagnostic accuracy. Accordingly, in this study, we aimed to devise a VR-based eye-tracking system using visuoauditory stimuli delivered through a head-mounted display (HMD) and to demonstrate the validity and clinical utility of the system in assessing consciousness while addressing the limitations of uncalibrated pupil trajectories.