Participants
The study involved 56 participants aged over 60 years, consisting of 20 individuals with HI and 36 typical hearing (TH). All participants, who were native Korean speakers, met the following criteria: they obtained scores within normal ranges on the Korean version of the mini mental state examination (K-MMSE)36, the Seoul verbal learning test (SVLT), the digit span tests from the standardized Seoul neuropsychological screening battery-II37, and a short version of the geriatric depression scale (< 8 out of 15)38. None of the participants reported any visual impairments, learning difficulties, or a history of brain injuries.
All experiments were conducted in accordance with relevant guidelines and regulations. This research was approved by the Institutional Review Board on Human Subjects of Ewha Womans University (Approval No. 2022-0084), and all procedures were performed following ethical standards and institutional policies. Prior to participation, all individuals provided written informed consent.
An independent sample t-test was conducted, revealing no statistically significant differences in age, education years, or screening results of neuropsychological tests between the TH and HI groups (all p-values > 0.05). Detailed demographic information and descriptive statistics are provided in Table 1.
Table 1
Participants’ descriptive information and screening results of neuropsychological tests; TH, Typical hearing; HI, Hearing impairment; M(SD), mean(standard deviation); K-MMSE, Korean version of the mini-mental state examination; SVLT-imme., Seoul verbal learning test - immediate recall; SVLT-delayed, Seoul verbal learning test - delayed recall; Digit span-F, Digit span test forward; Digit span-B, Digit span test backward; dB HL, decibels hearing level.
| | TH Group n = 36 | HI Group n = 20 | Test statistics | p-value |
Gender | Male | 23 (63.8%) | 14 (70%) | | |
Female | 13 (36.1%) | 6 (30%) | | |
Age (years) | M(SD) | 64.03 (3.72) | 65.55 (3.66) | 1.481 | 0.146 |
Range | 60–72 | 60–72 |
Education (years) | M(SD) | 14.56 (2.97) | 13.95 (3.18) | -0.698 | 0.490 |
Range | 9–23 | 9–23 |
K-MMSE (max: 30) | M(SD) | 28.97 (1.36) | 28.75 (0.96) | -0.709 | 0.482 |
Range | 26–30 | 26–30 |
SVLT-imme. (max: 36) | M(SD) | 21.28 (3.74) | 20.45 (3.84) | -0.780 | 0.440 |
Range | 15–31 | 15–30 |
SVLT-delayed (max: 12) | M(SD) | 7.35 (2.03) | 6.86 (2.09) | 0.854 | 0.398 |
Range | 4–11 | 4–12 |
Digit span-F (max: 14) | M(SD) | 10.44 (2.22) | 8.95 (2.85) | -2.024 | 0.051 |
Range | 6–14 | 5–14 |
Digit span-B (max: 14) | M(SD) | 7.64 (2.76) | 6.20 (2.74) | -1.876 | 0.068 |
Range | 4–14 | 4–14 |
Hearing threshold -left ear (dB HL) | M(SD) | 19 (5.97) | 33 (9.05) | 6.47 | < 0.001 |
Range | 10–41 | 24–58 |
Hearing threshold -right ear (dB HL) | M(SD) | 19 (4.34) | 37 (10.59) | 7.19 | < 0.001 |
Range | 11–28 | 25–62 |
Hearing threshold -better ear (dB HL) | M(SD) | 17 (3.83) | 33 (7.90) | 8.71 | < 0.001 |
Range | 10–24 | 25–41 |
M(SD) | 3.47 (0.73) | 3.25 (0.82) |
Audiology measurements
The data for this study were specifically limited to participants diagnosed with sensorineural hearing loss, which was assessed using with a standard pure-tone audiometry device (AS608 Basic; Interacoustics A/S, Middelfart, Denmark) at audiometric frequencies of 0.5, 1.0, 2.0, and 4.0 kHz. The averaged thresholds were calculated by assigning greater weight to mid-frequencies crucial for speech perception (calculated as: average threshold = (a + 2b + 2c + d) / 6, where a, b, c, and d represent the thresholds at 0.5, 1.0, 2.0, and 4.0 kHz, respectively)39. Participants were classified as having HI if their average hearing threshold in the better ear exceeded 25 decibels hearing level (dB HL) 40, and only those individuals whose hearing threshold differences between the two ears were within 15 dB HL were included. Audiological reports for each group are detailed in Table 2, with the corresponding audiograms presented in Fig. 1.
Table 2
Audiological details for each group; TH, typical hearing; HI, hearing impairment; SD, standard deviation.
| Frequency in Hertz |
Audiometry | Group | Measure | 500 | 1000 | 2000 | 4000 |
Left | Right | Left | Right | Left | Right | Left | Right |
AS608 | TH group (n = 36) | Mean | 20 | 22 | 17 | 16 | 19 | 18 | 24 | 22 |
Median | 20 | 25 | 15 | 15 | 20 | 18 | 22 | 20 |
SD | 6.87 | 7.22 | 6.81 | 5.41 | 7.85 | 7.50 | 11.7 | 9.59 |
Minimum | 10 | 5 | 10 | 5 | 5 | 5 | 5 | 10 |
Maximum | 35 | 40 | 40 | 25 | 45 | 45 | 65 | 50 |
HI group (n = 20) | Mean | 26 | 28 | 28 | 30 | 40 | 39 | 50 | 53 |
Median | 25 | 25 | 28 | 27 | 40 | 35 | 48 | 50 |
SD | 9.45 | 9.53 | 9.63 | 11.5 | 14.2 | 17.8 | 17.5 | 21.2 |
Minimum | 10 | 10 | 15 | 15 | 20 | 10 | 20 | 15 |
Maximum | 55 | 50 | 55 | 55 | 70 | 80 | 85 | 100 |
Transcutaneous auricular vagus nerve stimulation (taVNS)
We administered stimulation using a commercial taVNS device (allearsTM TODOC, Seoul, South Korea) set at 25 Hz frequency, 200 µs pulse width, and a cycle of 30 sec ON and 30 sec OFF was repeated. The intensity of the stimulation was adjusted to 0.5 mA below each participant's pain threshold. The stimulation was calibrated to induce a tingling sensation, indicative of activating the afferent fibers of the auricular vagus nerve41, 42, and was targeted at the cymba concha of the ear, a site known for its strong activation of afferent vagal pathways43. The stimulation was applied to either the left or right ear, with counterbalancing across participants.
N-back task
We conducted n-back tasks targeting two distinct WM domains: verbal vs. visuo-spatial. The verbal n-back task was based on the SemBack version44, while the visuo-spatial n-back task was adapted from the study by Christensen and Wright45. To manipulate task complexity, we varied the 'n' values (1-back vs. 2-back). The number of task items was matched, with 22 target items out of 73 items in the 1-back condition and 22 targets out of 84 items in the 2-back condition. Each participant engaged in a preparatory block consisting of 10 practice items, including 2 targets, before beginning each n-back task.
In the verbal n-back task, each word in written form was presented on the screen, and participants were asked to press the spacebar if the current item belonged to the same semantic category as the item presented 1 or 2 back. The semantic categories included three distinct groups: fruits, animals, and clothes (Supplementary 1). The syllable length and structure in Korean were balanced across the three semantic categories.
The visuo-spatial task featured three-dimensional block cubes from Shepard and Metzler46, arranged in various configurations across four blocks. Participants were asked to press the spacebar when the current item matched the shape of the block presented 1 or 2 back.
All the series of stimuli used in this study were presented visually using PsychoPy47 on a 39.6 cm laptop screen (Samsung Galaxy Book Pro 39.6 cm Core™ i5), as illustrated in Fig. 2. Each trial began with the presentation of a fixation cross, which was displayed for 2750 ms, followed by a stimulus presented for 750 ms. Between each stimulus, the fixation cross was consistently displayed for 2750 ms, maintaining a regular interval throughout the experiment.
Experimental protocol
The study employed a two-session, within-subject, randomized crossover, single-blind, sham-controlled design (as detailed in Fig. 3). The two sessions were divided into an active session and a sham session, based on the presence or absence of taVNS stimulation, and the order of stimulation counterbalanced. Participants received both active taVNS and sham stimulation, with each session spaced at least 7 days apart. To ensure consistency, the sessions were preferably conducted at the same time of day.
Prior to each session, participants were instructed to refrain from tobacco (2 hours prior), caffeine (6 hours prior), medications, alcohol, and intense physical activity (24 hours prior). To mitigate potential expectation biases, no information was provided regarding the specific stimulation condition administered in each session or its expected effects.
The order of the verbal- and visuo-spatial n-back was balanced but consistent for individual participants across both sessions. Upon completing the tasks, stimulation ceased, and participants were given a questionnaire regarding any negative effects resulting from the stimulation.
Statistical analysis
Statistical analysis was conducted using Python package (v1.2.0). The dependent variables included accuracy (%) and response time (ms). Accuracy (%) was determined by considering trials where participants correctly responded in the n-back tasks. The number of correct responses were divided by the total number of responses (44) per condition, multiplied by 100 to calculate the percentage of correctly responded trials. Response time (RT)—the duration from the presentation of stimuli on the screen to the moment the button was pressed—was measured in milliseconds (ms). Only trials where participants responded correctly to the n-back tasks were included into the RT analyses.
For each dependent variable, two-separate four-way mixed-design analyses of variance (ANOVAs) were conducted with group (TH vs. HI), stimulation condition (Sham vs. Active), WM domain (Verbal vs. Visuo-spatial), and task complexity (1-back vs. 2-back) as independent variables.