5.1 Participants
B-mode ultrasound images and SWE elasticity of 52 NSLBP patients were collected from August 2020 to April 2021 from the University of Hong Kong-Shenzhen Hospital. Before data collection, VAS was used to evaluate the pain intensity of the patients. Each patient was informed of the purpose and process of the experiments and an informed consent form was collected from all the subjects. Ethical approval was authorized by the University of Hong Kong-Shenzhen Hospital ([2020]178).
Table 7 Descriptive statistics from the data.
|
Mild
(VAS <= 3)
|
Moderate-severe
(VAS > 3)
|
Male
|
12
|
15
|
Female
|
12
|
13
|
Total
|
24
|
28
|
|
mean±std
|
|
Weight (Kg)
|
65.52±11.52
|
64.38±10.82
|
Height (m)
|
1.69±0.10
|
1.69±0.07
|
BMI
|
22.91±2.72
|
22.54±3.09
|
Age
|
35.96±7.62
|
41.11±10.24
|
Notes: There were no significant differences among the groups in terms of the weight, height, age or body mass index (BMI).
Abbreviations: BMI, body mass index; std, standard deviation; Kg, kilogram; m, meter.
The patients who had LBP with a significantly intense pain during rest and/or daily activities (according to VAS ≥ 1) that lasted for more than 3 months were included in the study. The exclusion criteria included history of spinal or lower limb fractures, spinal surgery or spinal deformities. According to the pain intensity, the patients were divided into two groups: (1) NSLBP patients with a mild pain (VAS <= 3) (2) NSLBP patients with a moderate-severe pain (VAS > 3). Data statistics are shown in Table 7.
5.2 Experimental Design
5.2.1 Data acquisition
Mindray DC-80 (Mindray, China) with an 8.5 MHz linear array ultrasonic transducer was used for data acquisition, which can collect B-mode ultrasound images and the SWE elasticity values (mean and standard deviation) of the muscles. In addition, the quality of the obtained SWE elasticity values was checked using DC-80 self-checking module for the elasticity imaging quality, to make the SWE elasticity measurement more stable.
5.2.2 Data acquisition process
The data acquisition process included the following steps:
1) Taking the prone position, a thin pillow was put under the abdomen of the patient to make the low back flat. Then, the arms were placed flat on both sides (left side and right side) of the body, as shown in Fig. 2(a).
i. B-mode ultrasound images of the muscle (MF, ES, TrA and TLF) on both sides of the patient’s L2-L3 lumbar spine as well as the SWE elasticity values (mean and standard deviation) of the muscle region of interest (ROI) were acquired.
ii. B-mode ultrasound images of the MF on both sides of the patient's L4-L5 lumbar spine as well as the SWE elasticity values (mean and standard deviation) of the muscle ROI were acquired.
2) Taking the tabletop position[32], the patient's low back was kept flat and relaxed, as shown in Fig. 2(b).
i. B-mode ultrasound images of the MF on both sides of the patient’s L2-L3 and L4-L5 lumbar spine and the SWE elasticity values (mean and standard deviation) of the muscle ROI were acquired.
Finally, we obtained the B-mode ultrasound images as well as the SWE elasticity values of a total of 14 images from the patient's 4 muscles (MF, ES, TLF and TrA), as shown in Table 8. The representative B-mode ultrasound images are shown in Fig. 3.
Table 8 Data collection sites and the patients’ positions.
|
L2-L3
|
L4-L5
|
L&R
|
Prone position
|
Tabletop position
|
SWE elasticity
|
Muscle thickness
|
MF
|
√
|
√
|
√
|
√
|
√
|
√
|
|
ES
|
√
|
|
√
|
√
|
|
√
|
|
TLF
|
√
|
|
√
|
√
|
|
√
|
√
|
TrA
|
√
|
|
√
|
√
|
|
√
|
√
|
Notes: The√ represents the imaging of the muscle in the corresponding position.
Abbreviations: L2-L3, L2-L3 lumbar spine muscle; L4-L5, L4-L5 lumbar spine muscle; L&R, left and right sides of the lumbar spine muscle; SWE, shear wave elastography; MF, multifidus muscle; ES, erector spinae; TLF, thoracolumbar fascia; TrA, transversus abdominis.
5.3 LBP Framework
This experiment was based on the MIFS framework, as shown in Fig. 4, which was mainly composed of feature extraction and feature selection. First, 55 B-mode ultrasound image features were extracted from the ROI of muscles in multiple sites (details are shown below). Then, they were combined with the SWE elasticity feature from the ROI to form the total feature set. After feature standardization and selection, important features were selected from the total feature set to construct the optimal feature set. Finally, the optimal feature set was used to train the SVM model and classify NSLBP patients.
5.4 Data Processing
5.4.1 Feature Extraction
Based on the MIFS framework, this experiment extracted the features from the B-mode images as well as SWE elasticity values, including the following features:
- Muscle morphological feature: composed of average thickness feature of the muscle (TLF and TrA). Thickness is usually defined as the distance between the midpoints of the upper and lower muscle and calculated by the mean of the left and right sides of the muscle (TLF and TrA) thickness. In order to ensure the accuracy and validity of the value, we double checked the thickness measurement.
- Mean image frequency analysis features (MFAF) of the ROI: calculated separately as the maximum entropy method and multi-window method[33, 34].
- Image texture features: composed of two features. One is the first-order statistical (FOS) feature derived from the gray-level histogram, including the integrated optical density, mean, standard deviation, variance, skewness, kurtosis and energy. The other is the high-order texture feature, including the Haralick feature calculated from the Gray-Level Co-occurrence Matrix (GLCM)[35], Galloway feature calculated from the Gray-Level Run-Length Matrix (GLRLM)[36] as well as local binary patterns feature (energy and entropy)[21].
- SWE elasticity feature: composed of the mean and standard deviation of SWE elasticity from the muscle ROI.
Finally, we obtained 800 features (57 features per ROI of an image and 14 images per subject) and 2 average thickness features of the TLF and TrA of each subject. The details of the features are listed in Table 9.
Table 9 Details of the features.
Feature type
|
Feature names
|
Notes
|
Muscle morphological feature
|
muscle (TLF and TrA) average thickness feature
|
N = 2
|
Image frequency analysis feature
|
MFAF
|
Calculated by the maximum entropy method and multi-window method. N = 2
|
FOS feature
|
Integrated optical density, mean, standard deviation, variance, skewness, kurtosis and energy
|
Derived from the gray-level histogram. N = 7
|
Haralick feature
|
Contrast, correlation, energy, entropy, homogeneity and symmetry
|
Calculated from the Gray-Level Co-occurrence Matrix (GLCM) with 4 directions: 0°, 45°, 90° and 135°. N = 24
|
Galloway feature
|
Short Run Emphasis (SRE), Long Run Emphasis (LRE), Gray-Level Non-Uniformity (GLNU),
Run Length Non-Uniformity (RLNU) and Run Percentage (RP)
|
Calculated from the Gray-Level Run-Length Matrix (GLRLM)) with 4 directions: 0°, 45°, 90°, and 135°. N = 20
|
Local binary patterns feature
|
Energy and entropy
|
N = 2
|
SWE elasticity feature
|
Mean and std
|
N = 2
|
Abbreviations: N, the number of features; TLF, thoracolumbar fascia; TrA, transversus abdominis; FOS, first-order statistical; MFAF, mean image frequency analysis feature; SWE, shear wave elastography; std, standard deviation.
5.4.2 Feature Selection
1) The variance between the features of the total feature set obtained in this experiment was relatively large. To prevent the variance of some features from being much larger than the variance of other features, which could result in slow convergence or non-convergence of the model, this experiment standardized the total feature set to make the mean and variance of the feature equal to 0 and 1, respectively, as shown in Eq. 1.
2) In this experiment, the number of features obtained by each patient was 800. To prevent the model from overfitting or having difficulty converging, it was necessary to perform feature selection on the total feature set and to reduce the feature dimension. We used SelectPercentile of sklearn.feature_selection in the machine learning package Scikit-learn (v. 0.22.1) for feature selection. The score_func parameter contains several feature selection methods. In this experiment score_func chose the default f_classif as the feature selection method, and the kernel method of f_classif was the analysis of variance (ANOVA)[37]. Due to the ability to combine all the feature information, ANOVA does not only improve the efficiency, but also increases the reliability of the feature selection. After feature selection with ANOVA, the optimal feature set was finally extracted from the total feature set.
5.5 Classification
1) The Scikit-learn (v. 0.22.1) machine learning library was used in Python (v. 3.7.6) to build a machine learning-based pipeline to analyze the feature data.
2) SVM was used to classify NSLBP patients. SVM is a two-classification model, and its basic model is a linear classifier with the largest interval defined in the feature space. The basic idea behind SVM is to solve the separation hyperplane that can correctly divide the dataset and have the largest geometric interval. SVM has several complex kernel methods, which can make the data that are inseparable in the linear space separable in other dimensions. At the same time, the addition of regularization enhances the robustness of the SVM model and reduces the possibility of overfitting. The SVM model of this experiment used a linear kernel and L2 regularization.
3) When the amount of data is sufficient, datasets are usually divided into training set, validation set and test set, and the final performance results come from the test set. However, since the amount of data in this experiment is limited, the above-mentioned single test set could not reliably report the performance indicators of the classifier[21]. To overcome this disadvantage, five-fold cross-validation combined with grid search was used to optimize and evaluate the classification model in this experiment.
4) In the classification task of NSLBP of this experiment, NSLBP patients with a moderate-severe pain were treated as positive cases and those with a mild pain as negative ones. The metrics of accuracy, specificity, sensitivity, AUC (area under the receiver operator characteristic curve), precision and negative predictive value (NPV) were used to quantify the classification results. At the same time, the classification results of using the SWE elasticity feature, B-mode ultrasound image feature and SWE elasticity feature combined with B-mode ultrasound image feature were compared.