2.1. Kinect-based system setup and multifactorial test battery for fall risk assessment
Considering the practical tracking range of Kinect (max. range of 4.5 m for Kinect v2) and necessary space for different tests, the Kinect-based system setup (Fig. 1) was first optimized for easy and robust data acquisition through our internal tests. A Kinect v2 device was placed on a table, 3.2 m away from a chair and 0.7 m above the ground. A cone and a balance pad (Airex) were placed in between the Kinect device and the chair for some subtests in the test battery.
A list of representative fall risk factors covering physiological (subsystems for sensory input, central processing, and motor response), psychological (fear of falling, depression etc.), and integrated functions (gait and mobility, postural adjustments etc.) were generated from the literature review of previous studies. A corresponding Kinect-based multifactorial test battery to assess those risk factors was designed afterwards. In order to make this test battery to be scientific and practical, each subtest under the test battery should be not only valid and reliable for assessing corresponding risk factors, but also simple and quick for older people to undertake. Our developed test battery (Fig. 2) was composed of seven subtests to assess intrinsic fall risk factors for physiological, psychological, and integrated functions [6]: 1) Sensory Organization Test-SOT for sensory inputs and static balance, 2) Limit of Stability-LOS for postural stability, 3) Sit to Stand 5 times-STS5 for postural adjustment and lower-limb strength, 4) Timed Up and Go-TUG for mobility and dynamic balance, 5) Range of Motion test-ROM for joint movement and flexibility, 6) Choice Stepping Reaction Test-CSRT for sensorimotor and cognitive function, and 7) Short Fall Efficacy Scale-FES for psychological risk factor, especially fear of falling. All subtests were developed using Kinect SDK 2.0 (Microsoft Corporation, Redmond, Washington, United States) and Unity3D (Unity Technologies, San Francisco, California, United States), and they were adopted from previous research with modifications if necessary.
The walking distance of standard TUG was modified from 3 m to 2 m due to the limited tracking range of Kinect. The maximum tracking range of Kinect is about 0.8–4 m and the actual range of stable full-body tracking is even smaller. We followed the earlier studies and chose 2 m walking distance for Kinect-based TUG [16, 20]. CSRT has two versions: CSRT-M using a rubber pad and CSRT-E using an electronic pad [21]. Ejupi et al. [17] developed Kinect-based CSRT with two stepping panels on the left and right, however, the present study implemented Kinect-based CSRT with four stepping panels (left, front-left, right, and front-right) to keep the same design as the original CSRT. Each stepping panel was randomly illuminated in one trial for a total of five times, so the total trials for four panels were 20. Five-second break was given in between trials. Figure 2 shows the implemented subtests to assess fall risk, and Table 1 describes the detailed protocol of each subtest and its representative outcome measures.
Table 1
A Kinect-based multifactorial test battery to assess fall risk: subtests and representative outcome measures
Subtest name
|
Subtest protocol
|
Representative outcome measures
|
Sensory Organization Test (SOT)
|
Stand under the different conditions (C1–C4) while maintaining balance for 20 seconds
C1: Eyes open on firm surface/ground
C2: Eyes closed on firm surface/ground
C3: Eyes open on soft surface (balance pad)
C4: Eyes closed on soft surface (balance pad)
(Each condition was repeated twice)
|
Equilibrium Score (ES): The average center of gravity (COG) sway for each condition in AP/ML directions;
Composite Equilibrium Score (CES): A weighted average of equilibrium scores under the 4 conditions. It is derived from the individual equilibrium scores;
Sensory analysis ratios: Ratios of the average ESs of C2 (somatosensory), C3 (visual), and C4 (vestibular) to ES of C1 (baseline);
Sway area: Sway area of COG under each of the condition;
Sway area ratios: Ratios of sway areas of C2, C3, C4 to sway area of C1
|
Limit of Stability (LOS)
|
Shift the body weight and reach the arm as far as possible to different directions while maintaining balance
C1: Reach forward
C2: Reach leftward
C3: Reach rightward
(Each condition was repeated twice)
|
Actual reach distance: The average distance of an outstretched arm in a maximal reach from the beginning;
Normalized reach distance: Actual reach distance normalized by the stature;
COG reach distance: The average length of COG position in a maximal reach from the beginning;
Lateral imbalance: Ratio between lateral reach distances (The larger reach distance is always a numerator)
|
Sit to Stand 5 times (STS5)
|
Stand and sit on a chair 5 times as fast as possible
(The test was repeated twice)
|
Total completion time: The average elapsed time from the beginning to last standing up;
Average time of sit-stand (stand-sit): The average elapsed time from sitting to standing or standing to sitting;
COG moving distances: The average COG moving distance in AP and ML directions while sit-stand and stand-sit
|
Timed Up and Go (TUG)
|
Stand from a chair, walk 2 m, turn, come back, and sit on the chair with normal walking speed
(The test was repeated twice)
|
Total completion time: The average elapsed time to complete the TUG test;
Elapsed time of each phase: The average elapsed times for sit-to-stand, walking, turning, and stand-to-sit phases;
Duration of each phase (%): The percentage of the elapsed time of each phase to the total completion time;
Step width: The average step width in the walking phase;
Step duration: The average duration of each step in walking phase;
Number of steps: The total number of steps in walking phase
|
Range of Motion test (ROM)
|
Sit on the chair, fully extend a knee and hold for 2 sec, then bend the knee and hold for 2 sec
(Repeat 3 times for each knee)
|
Knee flexion angle: The average of minimal knee angles while bending the knee;
Knee extension angle: The average of maximal knee angles while extending the knee;
Knee range of motion: The average of difference between knee extension and flexion angles
|
Choice Stepping Reaction Test (CSRT)
|
Step on an illuminated panel as fast as possible (four random panels: left, left-front, right and right-front; each panel was illuminated five times)
|
Total completion time: The total elapsed time to complete 20 trials of the test;
Reaction time: The average time between when a panel is illuminated and a foot starts to move;
Movement time: The average time between when a foot starts to move and when the illuminated panel is stepped
|
Short Fall Efficacy Scale (FES)
|
Choose the right level of concern about falling during seven daily activities
|
FES score: The total score of seven evaluation questions (Range: 7–28)
|
C1–C4: condition 1–condition 4, AP: anterior-posterior, ML: medio-lateral, COG: center of gravity |
2.2 Experimental Participants
All participants were community-dwelling volunteers from three cities (Cheongju, Sejong, and Incheon) in South Korea. The eligibility criteria were as follows: age ≥ 65 years, female, and able to walk independently without the use of assistive devices. This study focused on older women because they were reported to have higher fall risks than older men [22, 23]. In total, 106 community-dwelling older Korean women participated in this study. This sample was based on convenience sampling instead of random sampling due to challenges from COVID-19 pandemic. Each participant gave the written informed consent prior to participation. All participants performed the seven subtests sequentially (in the order of SOT, LOS, STS5, TUG, ROM, CSRT, and FES) and completed the test battery within 25-min. Their self-reported history of falls in the past 1-year was also collected. The study was ethically approved by KAIST Institutional Review Board (IRB No: KH2020-015).
2.3 Investigation of prospective falls
The events of prospective falls were investigated over six months after the fall risk assessment [24]. The investigation was conducted biweekly by text message or by telephone if there was no text reply. Four participants lost fall monitoring as three failed to follow up and one decided to withdraw from the study. Therefore, a total of 102 (106-4) participants remained in this study and their data were further analyzed.
Participants who experienced prospective falls during the 6-month follow-up period were classified as ‘high fall risk group’; otherwise, they were classified as ‘low fall risk group’. According to the above criteria, 22 (21.6%) older participants belonged to the high fall risk group and the rest of 80 (78.4%) participants belonged to the low fall risk group. Table 2 shows the sample characteristics of the high and low fall risk groups. Compared with the low fall risk group, the high fall risk group was significantly older (p = 0.048) and experienced more falls in the past 1-year (p = 0.046), but there were no significant differences in height, weight and BMI (p > 0.05).
Table 2
Sample characteristics of high fall risk group and low fall risk group
Characteristics
|
High fall risk group
(N1 = 22)
|
Low fall risk group
(N2 = 80)
|
#Two-sample comparison,
p-value
|
Age (years)
|
76.6 ± 5.0
|
74.2 ± 5.1
|
0.048
|
Height (cm)
|
155.1 ± 4.1
|
154.8 ± 4.6
|
0.790
|
Weight (kg)
|
58.3 ± 6.8
|
57.2 ± 6.7
|
0.523
|
BMI (kg/m2)
|
24.0 ± 3.1
|
23.9 ± 2.8
|
0.883
|
Fall history in the past 1-year
|
|
|
0.046
|
Yes
|
11 (33.3%)
|
22 (66.7%)
|
|
No (reference)
|
11 (15.9%)
|
58 (84.1%)
|
|
#Continuous variables were analyzed with two-sample t-tests and categorical variables were analyzed with chi-squared test. |
2.4 Kinect data processing and feature extraction
Figure 3 shows 25 skeletal joints tracked by Kinect v2 and their coordinate system. The joint position data collected by Kinect were first filtered by a first-order Butterworth low-pass filter with a cutoff frequency of 10 Hz [25] to remove noise in the time-series data. Then various algorithms were developed for extracting meaningful fall risk measures from the skeletal data. During this process, different skeletal joints were used for different subtests due to their high relevance with the corresponding subtests and the specific movement patterns recorded during each subtest [6]. For example, hand joints were used to compute reach distances in LOS; for ROM, hip, knee, and ankle joints were used; while for TUG, spine base, spine shoulder, elbow, and foot joints were used.
In order to simplify the explanation of algorithm development for extracting important features from the Kinect data to compute outcome measures in subtests, one of the most complicated subtests for feature extraction-TUG was illustrated as a representative example (Fig. 4). The entire TUG task was divided into four phases: sit-to-stand, walking, turning, and stand-to-sit as Kargar et al. [19]. The turning phase was determined by the absolute difference between x-coordinates of the left and right elbows as shown in Fig. 4B (top, left). When turning, the x-coordinates of left and right elbows will theoretically overlap. Therefore, the position difference between two elbows along the x-axis will first reach to a local minimum and then return to the original difference. This pattern would happen again in the transition between the walking phase and the stand-to-sit phase because the subject should turn around and sit on a chair. The entire TUG phase (from start to end) was extracted by using the z-coordinates of the spine base joint as shown in Fig. 4B (top, right). Since the spine base joint was located close to the center of mass, its position data tracked by Kinect was very stable. The initial moment when the z-coordinate of this joint starts to decrease is the start moment of the test, and the moment when the z-coordinate becomes the smallest is the turning moment. After turning, the z-coordinate will continue to increase until it stabilizes at the initial position at the end of the test. After the starting point, the end of the sit-to-stand phase (i.e. just before the walking phase) and start of the stand-to-sit phase (i.e. right after the walking phase) were determined by comparing the y-coordinate of the spine shoulder joint with its height when full standing, as shown in Fig. 4B (bottom, left). Gait-related outcome measures were derived from two foot joints. As shown in Fig. 4B (bottom, right), the local peaks of the difference between the z-coordinates of the left and right feet were related to the gait cycle and characteristics, and they were used to further calculate the number of steps, step width, and step duration. Due to the validity issue [20, 26], all gait outcome measures were calculated using only data before the turning phase.
2.5 Statistical analysis and fall risk modeling
Figure 5 summarizes the whole process of statistical data analysis and fall risk modeling. Two-sample t-tests were conducted first on all outcome measures from seven subtests to identify significant ones between high and low fall risk groups. Then, a Receiver Operating Characteristic (ROC) analysis was performed with each significant outcome measure to examine the discriminative power on classifying the fall risk groups. Fall risk classification models were constructed afterwards by using only significant outcome measures in both t-test and ROC analysis as predictors. The same analysis process was performed for the sample characteristic variables in Table 2, which were collected through surveys and easy to be included in the fall risk classification model. For significance tests of categorical variables, such as the history of falls, Chi-squared test and univariate logistic regression were used (Phase 1). Because the performance of a classification model can be different how to split training and test datasets, three exclusive dataset splits were made to investigate the overall performance of classification models. Phases 2 and 3 in Fig. 5 represent an example to develop a fall risk classification model using a dataset split. Each dataset split consisted of 70% of the training set and 30% of the test set. Due to the concern of potentially biased classification results caused by the high imbalance between high and low fall risk groups (22 vs 80), the Synthetic Minority Over-sampling Technique (SMOTE) was applied for the training set before constructing the classification model, as suggested by previous studies [27] (Phase 2). Afterwards, the classification model was constructed by using the random forest algorithm for the oversampled training set (Phase 3). The hyperparameters of the random forest algorithm were tuned by using the random search method with 3-fold cross-validation, and the final model was evaluated by the test set. The balanced accuracy, sensitivity and specificity were used to evaluate the model classification performance, as the high and low fall risk groups were imbalanced [28].
Statistical analysis of significance tests was conducted using IBM SPSS Statistics 20 (IBM Corporation, New York, United States) with a significance level of 0.05. Kinect data were processed on Visual Studio 2019 (Microsoft Corporation, Redmond, Washington, United States). Data augmentation and classification model construction were performed using the imbalanced-learn and scikit-learn packages in python [29].