Inferring Grasp Intentions from Arm Trajectories via Deep Learning to Enable Functional Movement in Quadriplegia

doi:10.21203/rs.3.rs-18757/v1

Download PDF

Short report

Inferring Grasp Intentions from Arm Trajectories via Deep Learning to Enable Functional Movement in Quadriplegia

https://doi.org/10.21203/rs.3.rs-18757/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 25 Aug, 2020

Read the published version in Bioelectronic Medicine →

You are reading this older preprint version

Read the latest preprint version →

Background

Cervical spinal cord injury severely affects grasping ability of its survivors. Fortunately, many individuals with quadriplegia retain residual arm movements that allow them to reach for objects. We propose a wearable technology that utilizes pattern recognition and deep learning methods to automatically classify arm trajectories and infer grasping intentions. Further, this technology can enable individuals with SCI to grasp objects without assistance via neuromuscular stimulation.

Methods

Two cervical SCI participants performed various reaching movements and smooth trajectories in space, which were recorded using an inertial sensor worn on their wrist. Time series classifiers were trained to recognize the trajectories using either a Dynamic Time Warping (DTW) algorithm or a Long Short-Term Memory (LSTM) recurrent neural network. Successful trajectory prediction in real-time was demonstrated using DTW, which when used in combination with a high density neuromuscular stimulation sleeve with textile electrodes, enabled participants to perform functional grasps.

Results

In offline comparisons, LSTM (mean accuracies 98% and 99%) performed significantly better than DTW (mean accuracies 94% and 83%) for both 2D and 3D reaching movements, respectively. Type I and II errors occurred more frequently for DTW (up to 38% and 15%, respectively), whereas it stayed under 3% for LSTM. Also, DTW achieved online accuracy of 79 ± 5%.

Conclusions

We demonstrate the feasibility of inferring grasping intention from reaching trajectories using wearable sensors. Importantly, this technology can be successfully used to control neuromuscular stimulators and restore functional independence to individuals living with paralysis.

Trial registration: NCT, NCT03385005. Registered September 26, 2017, https://clinicaltrials.gov/ct2/show/NCT03385005

Biotechnology and Bioengineering

Physical Medicine & Rehab

Spinal cord injury

neuromuscular stimulation

inertial measurement unit

IMU

machine learning

neural networks

wearable

In United States alone, every year there are more than 17,700 new cases of spinal cord injury (1). A majority of these injuries results in incomplete (48%) and complete (12%) quadriplegia, which severely affects arm and hand movements of the survivors and undermines their quality of life. Neuromuscular stimulation offers a viable solution to assist with arm and hand movements to increase independence, but often users find it challenging to efficiently control such stimulation devices for everyday use. Therefore, several different modalities have been developed to extract user intent for controlling neuromuscular stimulation devices in order to restore grasping. These modalities range from conventional push button or shoulder position control (2,3), to implanted muscle sensors (4), and most recently brain implants (5,6).

Grasping an object is often preceded by reaching for the object. In fact, previous studies have shown that grasping intentions of amputees and able-bodied participants could be inferred from their muscle activity (electromyogram signals) during reaching (7). Quadriplegia is most often caused by damage to the C5 vertebra and importantly, individuals with C5 and below level injury retain sufficient control over their deltoid and biceps muscles, which allows them to reach for objects (8,9). Therefore, we proposed to develop a non-invasive approach that can infer grasping intentions of quadriplegics from their reaching and other novel arm trajectories. Unlike previous studies that used multi-channel surface electromyography for deciphering reaching movements (7), here we used a single low-cost, wearable and easy-to-setup inertial sensor. Further, we combined our non-invasive grasp inference technique with a custom built neuromuscular stimulator and sleeve, to facilitate hand opening and closing in quadriplegic SCI participants and enabled them to perform functional movements (e.g. eat a granola bar).

In recent years, inertial measurement units (IMU) are extensively being used for human computer interactions, particularly for gesture recognition and wearable sensing (10). With advancement in portable computing devices, sophisticated machine learning algorithms such as recurrent neural networks, can be readily deployed for deciphering IMU data (11). In this study, we compared a well-known pattern recognition algorithm called Dynamic Time Warping (DTW) with a recurrent neural network for time series classification called Long Short-Term Memory (LSTM) and classified reaching trajectories in 2-dimensional (2D) and 3-dimensional (3D) space. We hypothesized that while DTW-based techniques are easily deployable and computationally inexpensive, LSTM networks with inherent long-term dependencies will perform more consistently across multiple days.

In Sect. 2, methods for the paper describing experimental setup, study protocol, and training of machine learning algorithms are presented. Section 3 presents results from offline and online validation of the algorithms, based on data from two SCI participants and discusses its significance.

A Participants

Two participants with quadriplegia were recruited for the study after providing informed consent. The study protocols were approved by the Institutional Review Board of Northwell Health (Great Neck, NY). Participant 1 was a 32 year old male, injured 6 years prior, with a C4/C5 ASIA (American Spinal Injury Association) B injury. He participated in 10 sessions, out of which 7 sessions were used to record 2D and 3D arm movement trajectories. During the remaining 3 sessions, grasping intentions were decoded online (in real-time) and used to drive a custom neuromuscular stimulator with textile-based electrodes housed in a sleeve (12). This in turn allowed the participant to perform functional movements (e.g. eat a granola bar). Participant 2 was a 28 year old male, injured 10 years prior, with a C4/C5 ASIA A injury. He participated in 3 sessions, which involved 2 training and 1 online testing session.

B Experiment Setup and Data Collection

Participants were seated with their hands initially resting on a table. A wireless sensor module was attached to the wrist of their arm using a Velcro strap. While both participants were bilaterally impaired, each still possessed residual movement that allowed reaching with at least one of their arms and was eventually used for the study. The sensor module consisted of a 32-bit ARM microcontroller unit (MCU) from Adafruit (Feather Huzzah32) and a Bosch SensorTec BNO055 9-axis IMU. The IMU has a built-in processor and algorithms to estimate its orientation and perform gravity compensation in real-time to produce linear acceleration in three orthogonal directions. Linear acceleration along the X, Y, and Z axes was available externally via an I2C interface. A flexible printed circuit board was designed to interconnect the IMU with the MCU as shown in Fig. 1B. Data was continuously streamed from the MCU at 50 Hz via Bluetooth to MATLAB 2019a running on a desktop PC and stored for offline processing.

During the experiments, verbal cues associated with different 2D and 3D movement trajectories were randomly called out to the participant. The participants were instructed to perform the reaching trajectories starting from the edge or corner of the table and move towards the center, using smooth movements that were up to a second long. Three different 3D reaching trajectories: a sideways arc (e.g. reaching for a cup or bottle, Fig. 1A), a vertical arc (e.g. reaching for a pen or marker lying on a table), and a corkscrew motion were trained. Additionally four 2D trajectories (performed in the horizontal plane) corresponding to well-known English and Greek letters: S, ε (epsilon or E), γ (gamma), and M were trained. Experiments were conducted in blocks of 18–20 trials and sufficient breaks were given between blocks to minimize participant fatigue. Initially, the participants were asked to perform only S and ε trajectories because these were simple to learn and didn’t cause fatigue. Later, once the participants became comfortable with moving their arm, we included additional 2D and 3D trajectories. Thus, in our final datasets there was a higher percentage of 2D trajectories (especially, S and ε) than the remaining trajectories.

During online (real-time) testing of grasp inference, participants also wore a custom-built fabric sleeve with 128 textile-based electrodes over their forearm to receive neuromuscular stimulation, which activated the appropriate extensor and flexor muscles to open their hand and evoke different grasps (e.g. pinch, cylindrical, etc.). Neuromuscular stimulation was provided by a custom proprietary, battery-operated, 8-channel, voltage-controlled stimulator, with a stimulation pulse frequency of 20 Hz. The stimulation channels were mapped to individual or multiple electrodes on the fabric sleeve, in order to evoke various finger flexion and extension type movements. By grouping multiple stimulation channels and sequencing their activation profile, we could program different grasp types such as cylindrical and pinch grasps. Figure 1C shows still images of an SCI participant using a simple 2D trajectory (e.g. M) to grasp and eat a granola bar with his paralyzed hand.

C Data Processing and Machine Learning

The 3-axis linear acceleration obtained from the IMU was band-pass filtered (Butterworth, 8th order, 0.2–6 Hz) and processed offline for identifying training samples. The magnitude of the 3-axis acceleration vector was used to identify onset of movement by setting a threshold of 0.95 g. The movement onsets were then used to segment the acceleration data over time along the X, Y, and Z axes into windows ranging − 0.1 s to 0.9 s with respect to onset. Each trial was visually confirmed to be free from any noise artifacts or if it exceeded the 1 s window and such trials were excluded from further analysis. Next, two time series classifiers based on either a Dynamic Time Warping (DTW) distance measure or Long Short Term Memory (LSTM) network algorithms were trained separately for 2D and 3D trajectories.

The DTW algorithm optimally aligns a sample trajectory with respect to a previously determined template trajectory such that the Euclidean distance between the two trajectories is minimized. This is achieved by iteratively expanding or shrinking the time axis until an optimal match is obtained. For multivariate data such as acceleration, the algorithm simultaneously minimizes the distance along the different dimensions using dependent time warping (13). In our DTW-based classifier, this algorithm was used to compute the optimal distance between a test sample and pre-defined templates associated with the 2D and 3D trajectories. Ultimately, the template with the smallest optimal distance to the test sample, was selected as the classifier’s output. Since the classifier’s output is dependent on the quality of its templates, we used an internal optimization loop to select the best template trajectory from a set of training trajectories. Within this loop, the DTW scores of each training sample with every other training sample was computed. Then the training sample with the least aggregate DTW score, was chosen as the template for that trajectory.

To implement the LSTM network we used MATLAB R2019b Deep Learning Toolbox with default values for most parameters. Specifically, an LSTM network comprising of a single bidirectional layer with 10 hidden units was used. This transformed the 2D or 3D linear acceleration data into inputs for a fully connected layer whose outcome was binary, i.e. 0 or 1. Next, a softmax layer was used to determine the probability of multiple output classes. Finally, the network output mode was set at ‘last’, so as to generate a decision only after the final time step has passed. This allowed the LSTM classifier to behave similarly to DTW and classify trajectory windows. During training of the LSTM network weights, an adaptive moment estimation (ADAM) solver was used with a gradient threshold of 1 and maximum number of epochs of 200. Since all the training and validation data were 1 second long, zero padding was not used.

During real-time classification of arm trajectories, the linear acceleration signals were filtered and processed in real-time using a MATLAB script that looped at 50 Hz. Within the loop, the acceleration data was divided into 1 second long segments with 98% overlap. To demonstrate proof-of-concept, only the DTW-based classifier was implemented and was designed to compare the incoming acceleration windows with 2D trajectories. If the optimal distance between trajectories were below 10 units (empirically determined), then positive classification was issued, which then triggered our custom neuromuscular stimulator to perform a complete movement sequence of opening and closing of the hand.

Over 250 training samples across 7 movement trajectories were recorded for participant 1 and 96 samples from 5 movement trajectories were recorded for participant 2. Trials with noisy sensor data or incorrect labels were visually identified and removed from the training set. Table 1 shows the distribution of samples across different 2D and 3D trajectories for both the participants. The top row also shows the relative position estimation for the participant’s hand in space, which was obtained by double integration of IMU data.

Given the unequal distribution of samples in our dataset, a 5-fold stratified cross-validation scheme was selected for evaluating DTW and LSTM based classifiers. Figure 2 shows the mean ± standard deviation (SD) classification accuracy for the 2 participants. In the offline scenario both DTW and LSTM based classifiers performed well for 2D trajectories, achieving 94 ± 5% and 98 ± 3% accuracy, respectively. For offline 3D trajectories however, LSTM outperformed DTW and obtained 99 ± 3% accuracy over 83 ± 16%. Using two-sided Wilcoxon rank sum test, LSTM based classification accuracy was significantly better than DTW (p < 0.05) in both cases. Also shown in Fig. 2, is the online performance of DTW based classifier for 2D trajectories. During online classification, we either compared between 2 trajectories (e.g. S v/s ε) or between a single trajectory and rest (e.g. M v/s rest) and achieved 79 ± 5% accuracy.

To further evaluate each classifier’s performance for type I and II errors, we calculated their cumulative confusion matrices by adding the confusion matrices from each fold for each participant. The resulting confusion matrices for both classifiers and for both types of trajectories are shown in Fig. 3.

For DTW-based classifier, type I error occurred more frequently for 3D than 2D trajectories. The highest percentage of type I error occurred for the corkscrew trajectory (37.8%), followed by vertical arc (14%), ε (10.2%) and M (10%) trajectories. In terms of type II errors, DTW-based classifier misclassified vertical arc (14.5%), side arc (13.8%) and S (8.33%) trajectories as compared to rest of the classes. For LSTM-based classifier the type I and II errors were very low and ranged from 0–3% for almost all trajectories, with the exception M trajectory that had a type I error rate of 40%. This is probably because we had only 10 trials of M trajectory for training, which weren’t enough for the LSTM classifier to distinguish it from other classes that had larger number of samples.

A potential limitation of this study is that the LSTM-based classifier has not been validated during online testing. This is still under development and will be reported in a future publication of this study. Nonetheless, LSTM’s highly robust offline performance, suggests that its online performance will be at least as good as or better than DTW’s online performance. Another limitation is that a reasonable degree of residual arm movements should be preserved in order for the deep learning algorithms to reliably infer grasp intentions. However, given that most quadriplegics include individuals with C5 and lower level injury that retain sufficient arm movements, a majority of SCI survivors will be able to operate this technology.

This study demonstrates the feasibility of inferring grasp intentions, merely from reaching and other novel arm motions of individuals with cervical SCI and enables them to benefit from neuromuscular stimulation-based assistance. This approach has clinical viability and could be deployed in rehabilitation centers in the future for use in not only SCI patients, but also individuals living with paralysis from stroke, multiple sclerosis, traumatic brain injury, or other injuries or diseases. Importantly, the rewarding experience of being able to control your own movements, may lead to increased patient engagement during therapy and ultimately, lead to better motor recovery.

SCI – Spinal Cord Injury; ASIA – American Spinal Injury Association; 2D – Two Dimensional; 3D – Three Dimensional; IMU – Inertial Measurement Unit; MCU – Microcontroller Unit; DTW – Dynamic Time Warping; LSTM – Long Short-Term Memory; SD – Standard Deviation;

Ethics approval and consent to participate

This research study was approved by the Northwell Health Institutional Review Board (FWA# 00002505), and is being conducted in adherence to IRB study # 17–0070. All study participants provided written informed consent prior to the initiation of any research procedures.

Consent for publication

All study participants provided written informed consent for publication as part of their consent for participation in the research study. Additionally, separate audio and visual authorization was provided for the utilization of images and video in publication

Availability of data and materials

Data from the publication is available from the corresponding author upon request.

Competing interests

CEB holds patents in related fields and is the founder of Neuvotion, LLC, a company focused on movement restoration.

Funding

This study was supported by donors and the Feinstein Institutes for Medical Research at Northwell Health.

Authors' contributions

CB conceived the study and edited the manuscript; KK and NB designed the software with inputs from CB; NB, KK, and RR performed/assisted with the study; NB performed data analysis and wrote the manuscript. All authors read and approved the final manuscript.

Acknowledgements

We thank our study participants for their dedication and time for the study. We would also like Santosh Chandrasekaran for his assistance during the experiments. Finally, we thank Northwell Health and the Feinstein Institutes for Medical Research for providing support for the study.

NSCISC. National Spinal Cord Injury Statistical Center, Facts and Figures at a Glance. Birmingham, AL: University of Alabama at Birmingham. 2019.
Ragnarsson KT. Functional electrical stimulation after spinal cord injury: current use, therapeutic effects and future directions. Spinal Cord. 2008;46(4):255–74.
Cornwall R, Hausman MR. Implanted neuroprostheses for restoration of hand function in tetraplegic patients. The Journal of the American Academy of Orthopaedic Surgeons. 2004.
Kilgore KL, Hoyen HA, Bryden AM, Hart RL, Keith W, Peckham PH. An Implanted Upper-Extremity Neuroprosthesis Using Myoelectric Control. J Hand Surg Am. 2008;33(4):539–50.
Bouton CE, Shaikhouni A, Annetta N V., Bockbrader MA, Friedenberg DA, Nielson DM, et al. Restoring cortical control of functional movement in a human with quadriplegia. Nature. 2016;000(7602):1–13.
Ajiboye AB, Willett FR, Young DR, Memberg WD, Murphy BA, Miller JP, et al. Restoration of reaching and grasping movements through brain-controlled muscle stimulation in a person with tetraplegia: a proof-of-concept demonstration. Lancet. 2017;389(10081):1821–30.
Batzianoulis I, Krausz NE, Simon AM, Hargrove L, Billard A. Decoding the grasping intention from electromyography during reaching motions. J Neuroeng Rehabil. 2018;15(1):1–13.
Nas K, Yazmalar L, Sah V, Aydin A, Ones K. Rehabilitation of spinal cord injuries. World J Orthop. 2015 Jan;6(1):8–16.
Prasad VSSV, Schwartz A, Bhutani R, Sharkey PW, Schwartz ML. Characteristics of injuries to the cervical spine and spinal cord in polytrauma patient population: Experience from a regional trauma unit. Spinal Cord. 1999;37(8):560–8.
Siddiqui N, Chan RHM. Multimodal hand gesture recognition using single IMU and acoustic measurements at wrist. PLoS One. 2020;15(1):1–12.
Kim M, Cho J, Lee S, Jung Y. IMU Sensor-Based Hand Gesture Recognition for Human-Machine Interfaces. Sensors. 2019;19(18):1–13.
Ciancibello J, King K, Meghrazi MA, Padmanaban S, Levy T, Ramdeo R, et al. Closed-loop neuromuscular electrical stimulation using feedforward-feedback control and textile electrodes to regulate grasp force in quadriplegia. Bioelectron Med. 2019;5(1):1–8.
Shokoohi-Yekta M, Hu B, Jin H, Wang J, Keogh E. Generalizing DTW to the multi-dimensional case requires an adaptive approach. Data Min Knowl Discov. 2016/02/15. 2017 Jan;31(1):1–31.

Due to technical limitations, Table 1 is provided in the Supplementary Files section.

table1.png

Download PDF

Journal Publication

published 25 Aug, 2020

Read the published version in Bioelectronic Medicine →

Editorial decision: Major revision
03 May, 2020
Review #1 received at journal
01 May, 2020
Review #2 received at journal
23 Apr, 2020
Reviewer #2 agreed at journal
14 Apr, 2020
Reviewer #1 agreed at journal
08 Apr, 2020
Reviewers invited by journal
06 Apr, 2020
Editor assigned by journal
31 Mar, 2020
Editor invited by journal
30 Mar, 2020
Submission checks completed at journal
24 Mar, 2020
First submitted to journal
21 Mar, 2020

You are reading this older preprint version

Read the latest preprint version →

Inferring Grasp Intentions from Arm Trajectories via Deep Learning to Enable Functional Movement in Quadriplegia

Status:

Journal Publication

Version 1

Abstract

Figures

1. Introduction

2. Methods

3. Results And Discussion

4. Conclusions

Abbreviations

Declarations

References

Table

Supplementary Files

Status:

Journal Publication

Version 1