Figure 1 shows the study’s workflow followed to perform these experiments: samples collection in the hospital, sample preparation, video recording under the microscope and video analysis.
Samples collection
Human blood samples from healthy donors, SCD, HS, and THAL patients were collected from University Hospital Vall d’Hebron for our study. (Figure 1A).
Experimental process
RBCs separation and experimental solution
RBCs were isolated from the blood samples and diluted before perfusing them through the microfluidic device. (Figure 1B)
Considering that shear stress and changes in physiological solution can cause osmotic stress on RBC, rendering echinocytes or blocking visibility when recording videos;3,24 we prepared different RBCs solutions and dilutions. We found that the optimal working solution for our experiments was physiological serum containing 1% BSA, 0, 25% of 0,25M EDTA and 15% of glycerol. (see supplemental Figure S1A, Supplementary material).
Figure 2 (A. Spleen filtering unit on a chip designed in the study. It consists of a main channel branched until forming eight parallel microchannels. Each microchannels contain a row of filtering funnel-shaped micro-constriction to mimic the IES section of the spleen. B. zoom out of filtering funnel-shaped constriction. C. Representation of a healthy RBCs and a RHHA RBC passing through the microconstricion. A healthy RBC deforms its shape and recovers it soon after passed the slits. On the contrary, in a RHHA patient the RBC capacity of returning to the original shape is compromise)
Spleen filtering unit on a chip and RBCs perfusion
Optical inspection for experiments were performed using an optical Zeiss microscope and videos were recorded using a mono camera coupled to microscope. A Precision Pressure Control System was used to regulate the flow pressure in the microfluidic device. Videos were recorded for analysis while RBCs were perfused through the chip (Figure 1C).
A microfluidic device was designed and fabricated with the aim of mimicking the filtering function of the red pulp’s spleen. It consisted of a main channel branched until forming eight parallel microchannels. Each microchannels contained a row of filtering funnel-shaped micro-constriction to mimic the IES section of the spleen. Due to their funnel shape, the distance between two slits varied from 1.5 μm to 6.8 μm. Of particular importance, the 1,5 μm narrowest distance defined to ensured RBC deformation when crossing a slit. Also, along each canal, there was a matrix of pillars (Figure 2A-B). This matrix was designed to mimic the reticular mesh of the spleen. Total length for channel was 9.5 mm and height was 4.5 μm. Note that the height of the device was selected to limit RBC movement and maintain it in a planar orientation for a better visualization of its shape under the microscope when deforming itself while flowing through the micro-constrictions.
Video analysis
Video analysis consisted of three steps: cell localization and automatic Region Of Interest (ROI) extraction, feature selection and data refining, and finally classification step. (Figure 1D)
Dataset collection and analysis of images from human RBCs
79 different videos were recorded and analyzed from the 32 subjects included in this study. Cells were automatically localized in each video frame, and a ROI around each cell was extracted and stored as a single data sample. The cell deforms its shape and recovers it soon after passed the slits in the case of a healthy RBC. On the contrary, when the RBC was from a RHHA patient, the RBC capacity of returning to the original shape is compromised (Figure 2C). According to this assumption, we focused the analysis on the ROIs collected after the barrier.
Table 1 shows the number of ROIs for each anemia condition (Control vs RHHA) after the slit barrier.
Sample
|
Number of VIDEO
|
Number of ROIs after the barrier
|
Total
|
79
|
3442
|
Control
|
30
|
1259
|
RHHA
|
49
|
2183
|
SCD
|
11
|
876
|
THAL
|
10
|
406
|
HS
|
28
|
901
|
ROI (Region Of Interest); RHHA (rare hereditary hemolytic anemia); SCD (sickle cell disease);
THAL (thalassemia); HS (hereditary spherocytosis)
Table 1. (Number of videos and ROIs analyzed in the study. The numbers in the table represent the number of videos recorded during the experiments and the number of ROIs extracted from the corresponding videos. Numbers are listed considering the total of the individuals included in the study, then divided in categories.)
Each extracted ROI was processed through a Deep Learning architecture (Figure 1D) and coded into a list of numerical descriptors with the aim of assessing the lack of deformability. Due to the fact that we used a pretrained DL network, and no further fine tuning procedure is performed over the dataset acquired in this study, this procedure is called “transfer learning”.
We implemented Leave-One-Experiment-Out (LOEO) cross-validation procedure to assess the performance of the proposed methodology. Such a strategy was preferred to demonstrate the robustness of the approach, avoiding depending on different acquisition conditions (i.e. image illumination, channel pressure, RBC positioning of the chip). In this strategy, the ROIs extracted in a given video are left out for testing and the remaining ones acquired in the other videos are used for training the model. The procedure is then repeated exhaustively over all the 79 videos considered in the analysis.
We performed two different studies for the validation procedure: first, we calculated the accuracy of recognition of healthy (label 0) vs unhealthy (label 1) videos. Then, we also considered a more challenging scenario in which we tried to recognize individual kinds of anemia, by discriminating healthy (label 0) vs SCD (label 1), THAL (label 2), and HS (label 3) categories.
Figure 3 (Confusion matrices A. Confusion matrices reporting the results of the majority voting and the unhealthy percentage limit criteria for the two-class problem. 30% and 40% were the two percentage limit values considered. B. Confusion matrices reporting the results of the majority voting and of the maximum trustiness criteria for the four-class problem)
Two-class problem: healthy vs unhealthy
In this first study, results were initially collected in terms of classification accuracy at the single cell level and then in terms of the final label assigned to the entire experiment. Two distinct cooperative strategies are used for the task: majority voting and unhealthy percentagelimit criteria.
The rationale was that even in the case of RBCs in RHHA not all RBCs show the same loss of deformability. Anyway, by applying a cooperative strategy procedure over the labels assigned to the cells in a given experiment, the approach allows understanding a global RBCs phenotype rather than an individual RBC behavior.
In addition, we also argue that RBC deformation persistence is exposed to temporal variation thus leading to the fact that in some frames the same cell appears as a “normal” cell while in some other frames it appears with a persistent loss of elasticity. Not less relevant is the fact that each frame represents the 2D view of a quasi-3D scene in which cells move in a 3D space and are visualized over a 2D domain with focus z plan automatically set by microscope. The projection errors may also contribute to the visual lack of persistence of the cell deformation.
The majority voting assigns to the video the most voted class among those assigned to the ROIs extracted. In the unhealthy percentage limit, the system assigns the unhealthy label (i.e., label=1) if the percentage of ROIs assigned to an unhealthy label exceeds a predetermined limit value (e.g., 30%, 40%, etc.) over the entire video.
Figure 3A reports the accuracy results for the majority voting and the unhealthy percentage limit criteria for the two-class problem, considering two percentage limit values of 30% and 40%. Note that, using the 30% limit value we totally recognize RHHA subjects with no false-negative values.
Four-class problem: healthy vs SCD, THAL, and HS
In the four-class problem, in addition to the majority voting cooperative strategy, we applied a maximum trustiness criterion. Given that the classification model provides a label and a score associated, in this second approach, the video is assigned to a certain class if the sum of the scores assigned to the ROIs of the same video to that class (out of the four considered) is the highest one. Figure 3B illustrates the confusion matrices of the majority voting and of the maximum trustiness criteria.
To fully understand the role of the scores and evidence the potential of the method, we also show a sketch of the scores assigned to each of the 79 videos and related ground truth label in Figure 4.
The height of the vertical bars indicates the score values assigned to each video (video index on the x-axis). Colors indicate the category assigned according to the legend to the top-right corner. The black solid stair line indicates the expected category for each video as indicated by the right y-axis labels.
Note that, for controls, not only the scores were generally higher (the range of the score is [0,1]) but also there are not any healthy videos incorrectly assigned to a different category. Regarding the three anemia conditions, the values of the scores are smaller indicating the critical task to solve, but also in this case, there are a very few errors of classification, mostly due to the misclassification between THAL and HS samples (Figure 4).