A wearable stethoscope for accurate real-time lung sound monitoring and automatic wheezing detection based on an AI algorithm

doi:10.21203/rs.3.rs-2844027/v1

Download PDF

Article

A wearable stethoscope for accurate real-time lung sound monitoring and automatic wheezing detection based on an AI algorithm

https://doi.org/10.21203/rs.3.rs-2844027/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

The various bioacoustics signals obtained with auscultation contain complex clinical information used as traditional biomarkers, however it is not widely used in clinical for long-term studies due to spatiotemporal limitations. Here, we developed a wearable stethoscope for skin-attachable, continuous and real-time auscultation using a lung sound monitoring patch (LSMP). The LSMP can monitor respiratory function through mobile app and classify normal and adventitious breathing by comparing the unique acoustic characteristics they produced. Heart and breathing sounds from humans can be distinguished from complex sound consisting of a mixture of the bioacoustic signal and external noise. The performance was further demonstrated with pediatric asthma and elderly chronic obstructive pulmonary disease (COPD) patients. We implemented a counting algorithm to identify wheezing events in real-time regardless of the respiratory cycle. As a result, the AI-based adventitious breathing event counter distinguished over 80% of events, especially wheezing events, in long-term clinical application.

Biological sciences/Biotechnology/Nanobiotechnology/Biosensors

Physical sciences/Engineering/Electrical and electronic engineering

Wearable stethoscope

AI algorithm

Lung sound

Real time monitoring

Automatic wheeze detection

After the invention of the stethoscope by René Raeneck in 1816, the modern analog binaural stethoscope was developed in 1854 and continues to be widely used while maintaining the same shape ¹. The purpose of the stethoscope is to perform auscultation, the method of listening to internal sounds of the body for precise diagnoses ^{2, 3}. It is the most important, noninvasive, and inexpensive traditional method for identifying the basic states of internal organs. Pulmonary auscultation can help diagnose respiratory symptoms; crackles, wheezing, pleural rub, stridor, and cough ^{4, 5}. Continuous auscultation is said to be more sensitive than intermittent auscultation in determining respiratory symptoms with exacerbations of diseases such as asthma⁶ or chronic obstructive pulmonary disease (COPD) ^{7, 8}. In particular, most respiratory diseases often get worse at late night and dawn than in daytime, and such events occur intermittently, so continuous auscultation could provide the solution.

The most common symptoms of asthma and COPD include wheezing sounds ⁹, which can be diagnosed by auscultation with a stethoscope in a clinical setting. However, lung auscultation can sometimes have questionable accuracy in clearly detecting abnormal breathing sounds, it has been reported 60.3% for medical students and 80.1% for fellows. Several points affect auscultation accuracies due to the clinician’s lack of experiences, the weak strength of the signal, or the mixing of signals from other organs ^{10, 11}.

With the development of digital stethoscopes, more accurate diagnosis has become possible through recording, noise reduction, amplification, and data processing^{12, 13}, so that it can be effectively used for telehealth ^{14, 15} and clinical education ^{16, 17}. Despite these advantages, such digital stethoscopes are not widely used due to their similar shape with an analog stethoscope and inconvenience. Recently, wearable devices have been developed for health care ^{18, 19, 20, 21}, especially wearable stethoscopes have been developed using an accelerometer ^{22, 23, 24} or a microphone ^{25, 26, 27} for signal acquisition. Regarding the former cases, researchers have investigated the possibility of measuring biosignals such as speech, cough, swallowing, and snoring by attaching an accelerometer based bio-adhesive device that measures a vibration-based signal ^{28, 29, 30}. However, it is difficult for such devices to detect acoustic signals due to the characteristics of vibration measurement-based sensors³¹. Alternatively, research on wearable sensors using MEMS microphones is actively in progress because of their small size³², high SNR, low power consumption³³, and flat frequency response^{34, 35}. Bioacoustic data through microphone can be managed in the same way that pulmonologist auscultate an internal sound with stethoscope, and in particular, it is possible to analyze frequencies according to small changes in sound objectively through visualization.

Although there are many advantages as a wearable device for auscultation, it requires extensive time and labor to analyze huge data continuously collected by clinicians. Recently, powerful models that predict abnormal pulmonary sounds, such as wheezing, cracking, and stridor, have been developed based on deep neural networks, such as convolutional neural networks (CNNs)⁸ and recurrent neural networks (RNNs)³⁶; these models have been shown to predict abnormal sounds directly with high accuracy (approximately 99.5%). However, these previous studies focused on classifying the type of diseases once and did not monitor changes in symptoms or its severity; if a patient's symptoms do not appear clearly or are critically severe, they need to be monitored for an extended period. In addition, various respiratory diseases are treated for 3 to 4 days by evaluating the change in lung sound, and the treatment may vary depending on the improvement ³⁷.

This paper describes the entire process of developing a wireless, skin-attached, and low-powered wearable stethoscope for continuous lung sound monitoring to diagnose respiratory function. The composition is presented here through the design of lung sound monitoring patch (LSMP) systems optimized for continuous auscultation and the characterization of the device(1); the clinical assessment of healthy subject(2) and several respiratory patients (asthma(3) and COPD(4)); the propose a new machine learning-based data analysis algorithm that counts the number of adventitious sound events (specifically, forced breathing and various wheezing sound) over long periods of time(5).

2.1. Wearable skin attached real-time LSMP sensor

We developed a thin, flexible device that can be attached to the human body and is controlled by wireless communication for continuous monitoring and long-term analysis of lung sounds. The key factors in its development were that the data be highly reliable and sensitive to allow lung sounds to be clearly distinguished and that the skin not be irritated even when attached for several days. Figure 1(a) shows schematics of the LSMP device, and (b) includes a photo of the flexible printed circuit board (fPCB) (left) and the biocompatible silicone used as a cover. The cover is open to operate the slide-type power switch and charger port for charging the battery. The blue LED indicates the operation status; through blink (waiting) or light on (pairing). All the other components are sealed through a cover for circuit protection and noise reduction purposes. The component list of LSMP is described in Table S1.

Figure 1(c) shows a block diagram of the major parts driving the LSMP device, which was programmed in C using embedded software. The power source is a 3.7 V battery that supplies a 3.3 V voltage via the power management unit to the circuit: the red hatch part. A MEMS microphone, purple hatch part, receives clock signals and power through the MCU and transmits the bioacoustic signals in Pulse Density Modulation (PDM) format to a mobile device through the Bluetooth Low Energy (BLE), blue hatch part. And a blue LED indicates the operation status.

We conducted a test to evaluate the basic characteristics of the developed LSMP. Figure 1(d) shows the schematic of the experimental setup to evaluate the acoustic response using LSMP and a mobile device. LSMP is attached to Child Sim and collects bioacoustic signals through the acoustic path formed at the fPCB. The bioacoustic signal is wirelessly transmitted to the mobile device through the BLE. Figure S1 shows the photograph of the experimental setup.

Streaming data are received through a customized iOS app on a mobile device, whose screen is shown in Fig. 1(e); the bioacoustic signal can be played, visualized, and analyzed in real-time through the app. Data analysis using AI is not included in the app function due to the memory resource limitation of the mobile device at this time. The customized iOS app function is explained in detail in Experimental section and Movie S1.

Due to the high-performance characteristics of the MEMS microphone, unexpected external noise can be mixed with the bioacoustics signal. To evaluate the effect of external noise according to the directionality of the microphone, the sound generated by the 1/3 octave-band tuning fork was recorded and reproduced with the same intensity as a reference signal, respectively. First, we collected bioacoustics signals from the Child Sim using the LSMP with one of two different MEMS microphones—one unidirectional, the other omnidirectional—in the presence of 60 dBA, reference noise signals centered at different frequencies and generated by the noise generator. Figure 1(f) shows the opposite directional acoustic response ratio of measured noise by the two MEMS microphones for the different frequency bands (along with the intensity of the noise at those bands). The results shown in the figure demonstrate a difference of up to 15 percent between the unidirectional and omnidirectional microphones, with lower noise levels detected by the unidirectional microphone. Thus, we chose a unidirectional microphone for implementation in LSMP sensor to help reduce the influence of external noise.

We next sought to assess the effect of different types of Bluetooth (BT) module antennas on the RSSI for the system. We equipped the LSMP with either an embedded or an external antenna, then attached it to the Child Sim as shown in Figure S1. Figure 1(g) shows the values of the RSSI for the different distances between the antenna and the receiver. It shows an average signal strength of -70 dBm up to 5 m when an external antenna is used; this kind of antenna allows greater circuit flexibility than an embedded antenna, but the reception strength is lower. It has been reported that smooth communication can be achieved without delay and interruption when the RSSI value is above − 80 dBm. ³⁸. This shows the value of conventional BT-based communication, which is the most commercially available and embedded antenna type, and indicates that data communication is smooth under the proper conditions. Photograph and schematic illustration of the LSMP with the different antenna types is explained in detail in Figure S2.

For continuous monitoring with the LSMP, we used medical-grade adhesives to attach our sensor to the skin and a simple in-vivo test to investigate their sustainability and compatibility with the skin. We attached four samples to the posterior (auscultation position between the scapula and vertebral line) of the human subject and took one off after 1, 3, 5, and 7 days of daily living; Fig. 1(h) shows the surface of the skin after the test. After seven days of having the sensor attached, the skin became slightly reddened, but the sensor did not fall off, and skin irritation did not occur. Therefore, we have verified that LSMP can be attached to human skin for more than five days for use as a long-term continuous pulmonary monitoring device.

2.2. Characterization of the LSMP sensor with a normal subject

We demonstrated the performance of the LSMP with an artificial simulator that generates sound signals in Figure S3, S4 and described in supplementary SI section 1. The feasibility of classifying respiratory symptoms was also confirmed by assessing differences in the acoustic properties of wheezing and crackling sounds, both of which are types of adventitious breathing. However, the auscultation from an actual human body contains the sounds from other organs in addition to the lungs, and we need a way to separate those sounds for further analysis. In this experiment, we evaluate the performance of LSMP with a human subject. Figure 2 shows the complete data analysis process and development of the algorithm for classifying the original bioacoustic signals recorded by the LSMP for heart rate (HR) and respiratory rate (RR). These are processed simultaneously for interpretation, as shown in Figs. 2(a) and 2(b), respectively. These processing steps are conducted via the app after the entire auscultation signal is acquired. Figure 2(c) shows the original, 12-second bioacoustic signal acquired using the LSMP from the posterior left lung field of a healthy volunteer (36 years old, male). Prior to processing, the original bioacoustic signal consists of indiscriminate mixing of the information from the heart (blue line) and lungs (red line); the boundary between inhalation and exhalation is unclear, and it is challenging to classify systolic (S1)/diastolic (S2) of the heart sound. Figure 2(d) and 2(e) show the results of the data processing of the original sound, producing heart and respiration sounds, respectively, that can be used for further classification. Next, the estimated heart rate and respiration rate are calculated by counting the cardiac and respiration cycles, respectively, for 10 seconds and are presented as the beats per minute and breaths per minute. Figure 2(f) shows an expansion of the red dot box shown in Fig. 2(d), highlighting the S1 and S2 signals within a cardiac cycle. HR and HRV can be calculated through the analysis of cardiac S1 and S2 information. This can provide important information (arrhythmia and heart failure) to cardiologists. Figure 2(g) shows a spectrogram of the entire auscultation signal, corresponding to normal breathing and the heartbeat. Each frequency component of the signal is included in its time-intensity plot, but by nature, they cannot be displayed directly; thus, the spectrogram is used to simultaneously analyze the frequency components of the wave file and their intensity over time. The heartbeat can be strongly visualized in the 20–100 Hz band, and normal respiration can be seen as a soft, broad signal in the 100–1000 Hz band. Since the main purpose of LSMP is to monitor lung sounds for a continuous long-term using wireless communication, communication must be maintained without interruption even when the sensor and cell phone are separated by a certain distance or while wearing clothes. Figure 2(h) shows the RSSI values obtained when the subject was and was not wearing clothes as a function of distance from the mobile device. Since the antenna embedded within the LSMP and the mobile device communicate directly, the distance at which the data can be received varies greatly depending on the experimental conditions. The presence of obstacles (clothes) results in an average difference of 10 dBm within 5 m and an average signal strength of -78 dBm without any other communication problems. These findings indicate that when attached to the skin of a healthy subject, the LSMP can maintain a steady communication strength within 5 m of the receiver without interruption. In summary, the data processing algorithm of our system is capable of distinguishing the HR and RR of a healthy subject from the bioacoustic data acquired through the LSMP, which extracted a soft, breezy, broadband breath sound. We also confirmed that the LSMP can classify heart and lung sounds even when they are mixed. Furthermore, it is possible to classify the S1 and S2 periods of the heartbeat, indicating that the use of the LSMP can be extended to the monitoring of cardiovascular diseases.

2.3. Clinical study of pediatric patients with asthma

Diagnosis of respiratory diseases including asthma and COPD through pulmonary function tests (PFTs) analysis is known as the gold standard. Given the findings of our assessment of the proposed device, we expect long-term monitoring using the LSMP to help evaluate the degree of worsening in pediatric asthma patients under 6 years old for whom PFTs cannot be performed ³⁹. To confirm the acoustic characteristics of pediatric asthma patients, we conducted additional measurements and analyses with the LSMP in a patient from this population. Figure 3(a) shows a photograph of the pediatric asthma patient recruited for this experiment. And red dash box represents a magnified view of the attached LSMP. The LSMP was attached to the patient's back according to the clinician’s instructions, and bioacoustic data were recorded for 15 minutes. Figure 3(b) shows 12 seconds of representative time-series data, in which normal and abnormal breathing were recorded. The plot shows both the inhalation and exhalation cycle, and clear differences in intensity between normal and abnormal breathing. In particular, the 1st, 2nd, 3rd, and 6th of the abnormal breaths show relatively strong intensity, which can be used as an indication of an abnormal breathing sound and has a similar trend to the presentation of typical wheezing as observed in the time-intensity plot from the Child Sim, shown in Figure S3(b). Figure 3(c) shows spectrograms of normal breathing period (blue dot box) and Fig. 3(d) shows spectrograms of abnormal breathing period (red dot box), the quantitative details of individual physiological events during normal and abnormal breathing. An analysis of the spectrograms reveals that during normal breathing, no specific frequency peak develops, and exhalations are slightly stronger than inhalations. Meanwhile, during the abnormal breathing period, wheezing signatures (duration > 200 ms) were confirmed. A clear wheezing signature show 4 times during the exhalation phase as marked black dot box. Figures 3(e) and 3(f) show the FFT of the signal during exhalation in the normal and abnormal breathing periods, respectively. During normal breathing, no signal characteristics other than the background sound component can be observed, while the FFT of the abnormal breathing period reveals a characteristic peak for wheezing. In summary, we used LSMP with a pediatric asthma patient to analyze the inhalation/exhalation phase of normal breathing as well as identify the wheezing during abnormal respiration. In addition, it can be evaluated that the RR per minute of this pediatric asthma patient is 57 breaths on average, which is significantly higher than the average RR of 28 to 46 breaths in normal same pediatric ages.

2.4. Clinical study of elderly patients with COPD

We also conducted measurements and analyses of the acoustic characteristics of COPD patients with LSMP. Figure 4(a) shows a photograph of an elderly patient with COPD. The LSMP was attached to the patient's back and bioacoustic data were recorded for 15 minutes. Figure 4(b) shows 12 seconds of representative time-series data, in which normal and abnormal breathing were successively recorded among the continuously measured data.

Although the inhalation/exhalation cycle can be observed during normal breathing, any differences in intensity with the abnormal breathing are challenging to assess in the time-intensity plot due to the noisy situation. However, the three breaths captured from 6 to 12 seconds have a relatively long duration and strong intensity, and although there is extensive noise, it can be interpreted as a sign of abnormal respiration. Figure 4(c) shows the spectrogram for the normal breathing period, demonstrating the presence of external heating, ventilation & air conditioning (HVAC) system, very short-duration, and relatively broadband noise between respirations. Figure 4(d) shows the spectrogram for the abnormal respiration period, in which different wheezing signals of varying duration and frequency band can be observed during the three exhalations. This shows a wheezing signature consisting of a strong intensity at a certain frequency in the spectrogram for a certain duration, which can be easily distinguished as the inflection line (red box) distinguished from the background sound for each adventitious exhalation. Figures 4(e) and 4(f) show the FFT during one exhalation for normal and abnormal breathing, respectively. During the 400 ms expiratory phase of normal breathing (red box in Fig. 4(c)), no characteristic signal other than the background sound can be observed. In Fig. 4(f), FFT analysis was performed to quantify the three types of wheezing signals observed during the expiratory phases of the abnormal breathing period. The analysis reveals a 400 ms monophonic wheezing sound, presenting with a strong, single 600 Hz peak, and two polyphonic wheezing sounds, demonstrating a strong 400 Hz peak and a 580 Hz peak during 1 second. Polyphonic wheezes are a known symptom of patients with extensive airflow obstruction (asthma, COPD, chronic bronchitis, etc.) and manifest as a high-pitched wheeze during breathing when the airway is narrow or stiff ⁴⁰. Similar to conclusion for the pediatric asthma patient, we showed that LSMP can be used to distinguish between normal breathing and abnormal breathing through short, continuous monitoring in a noisy environment. Given our findings, we expect that long-term monitoring using the LSMP can be useful for classifying the characteristics of abnormal breathing in elderly patients with respiratory diseases.

2.5. AI-based wheezing counting algorithm for long-term lung sound analysis

Machine learning is an effective tool for the classification of sounds, and we just need a well-structured model and raw data. We conducted a simple demonstration to show how machine learning can be used with LSMP to classify breathing sounds as shown in Fig. 5.

Open access database are used for these learning data, an example of which is shown in Fig. 5(a). The constructed database consists of 18 types of normal and 11 types of wheezing breathing sounds that were extracted. We modified the length of the extracted sound, representing changes from 1x speed to 0.5x speed in 0.1x decrements, as shown in Fig. 5(b). Then the data were extracted and converted into Log-Mel spectrogram images, as shown in Fig. 5(c). Figure 5(d) shows the flow of the data processing in the deep learning architecture. We used the max-pooling for computational efficiency and memory saving, and a dropout layer for preventing overfitting. And as the output layer, we used the SoftMax function because the sum value of output (= 1) can be effectively utilized in counting algorithm. The model uses binary cross-entropy as the loss function and Adaptive Moment Estimation (Adam) as the optimizer. Training and validation data were divided at a ratio of 8:2 from learning data; additional details of the model architecture and the data splitting are shown in Figs. 5(a) and S6(b), respectively. The receiver operating characteristic (ROC) curve of the trained model, depicted in Fig. 5(e) and Figure S7, indicates that the model had excellent training efficiency. More detailed information about AI algorithm are described in supplementary SI section 2.

The use of the LSMP to continuously monitor respiratory patients and quantify the extracted breathing sounds represents a novel clinical diagnostic application that can overcome the limitations of the existing intermittent use of existing stethoscopes. The LSMP was attached to a patient's anterior right lung field by a clinician and used to record the patient’s lung sounds, as shown in Fig. 6(a). Bioacoustic data were continuously recorded using the LSMP for 79 minutes while the patient lay on an air mattress and received oxygen therapy. Figure 6(b) shows a time-series plot of the entire continuously measured waveform; three 12-second portions of the data are highlighted and blown up in the next three panels as examples of the characteristics of the patient’s breathing. During forced respiration, as seen in Fig. 6(c), a strong signal can be seen than in the normal respiration part when compared to pediatric data as shown in Fig. 3(b) and COPD patient data as shown in Fig. 4(b). The strong regular intensity and breathing duration are observed during normal breathing caused by the oxygen therapy device as shown in Figure S8(a, d). A simple time-series analysis could cause forced respiration to be confused with abnormal respiration; however, this signal is the result of the artificial ventilation provided to the patient, producing relatively rough and high-intensity respiration but no abnormal respiration. The abnormal respiration shown in Fig. 6(d) is rare among the symptoms of respiratory diseases, depicting a low-pitch wheeze in both the inhalations and exhalations. In addition to the strong intensity signal due to forced breathing, the wheezing signature due to the deformation of the airway can be observed, and although a strong background sound component is present, it can be distinguished by a distinct inflection line as shown in Figure S8(b). The abnormal respiration depicted in Fig. 6(e) shows the wheezing signature in the exhalation, a symptom in typical asthma and COPD, as well as the presence of polyphonic wheezing rather than single component wheezing.

To reduce the clinician’s labor and misdiagnosis, we developed an AI-based event counting algorithm to monitor the time-varying symptoms from COPD patient clinical data with the LSMP. Figure 6(f) shows a 30-second segment of clinical data for the preliminary test, covering 12 cycles of inhalation and exhalation. The blue line is bioacoustic data used for test data, and the trained model’s predictions are marked in yellow dots. The trained model predicts the value between 0 to 1 for each label by the result of the SOFTMAX activation function. In this case, we use prediction values for the ‘wheeze’ label. The clinical data were sliced with a fixed 0.6 second window taken every 0.06 seconds; these segments were then input into the model to calculate the predicted values for each continuous, sliced segment of the lung sound. The prediction resolution was sufficiently high to detect normal and wheezing sounds in one breath cycle. Because the trained model predicts the incoming signals every 0.06 seconds, we included an algorithm to count the number of wheezing events. The predicted values range between 0.0 and 1.0, indicating how close the input data are to a wheezing sound; a value of 1.0 indicates a wheeze, and a value of 0.0 indicates normal breathing. We set the thresholds for wheezing and normal breathing to 0.9 and 0.1 respectively for the values are the appropriate hyperparameters for the model’s high prediction accuracy; in this way, when the predicted value dropped from 0.9 (wheezing) to 0.1 (normal breathing), the AI counted a single wheezing event; this is shown schematically using an example wheezing sound in the dashed red square box in Fig. 6(f), which is blown up in Fig. 6(g). The plotted yellow rectangles are the range over which we predict a wheezing sound. The counted number of events and the timing precisely matched the 12 wheezing events.

Figure 6(h) shows a comparison of the number of events counted over time by the AI algorithm and a human observer (pulmonologist). Over a total of 1630 breaths, the AI algorithm and clinician counted the number of wheezing events every 5 minutes with the 79-minute COPD lung sound described above, respectively. The total count was 1450 for the clinician and 1430 for AI; Figure S9(a) shows that the average match rate over the entire set of counts was 80.5%. The results show that the AI algorithm can classify normal and wheezing sounds with high accuracy, especially in asthma or COPD patients, indicating that the LSMP can monitor lung sounds to determine the severity and change of symptoms over time. As a supplement, we compare the count trajectory every one minute (Figure S9(b)) and plot the prediction of three extracted regions whose length is approximately 18 seconds (Figure S10).

Conventional auscultation devices, especially analogue stethoscopes, and E-stethoscopes are not suitable for continuous auscultation due to structural limitations, although it is important to evaluate changes in lung function through continuous auscultation in respiratory patients. A patient with acute or intermittent respiratory disease should accurately be evaluated the change in symptoms and take adequate treatment (oxygen, rescue inhaler) through continuous auscultation ⁴¹. If the information such as lung sounds and the number of wheezing, crackle, and cough could be delivered to the clinician or the user, it can take preemptive measures to prevent dangerous situations.

Here, we developed a wearable stethoscope, which can overcome the spatiotemporal limitation of the conventional stethoscope, for continuous lung sound monitoring that uses wireless communication and is controlled by a mobile device. Table 1 summarizes pros and cons of wearable stethoscopes with two different sensing system; accelerometer or microphone. Compared with the reported wearable stethoscopes, the LSMP shows superior performances of lung auscultation and automate the classification as validated by the clinical study. Through various evaluations of the characteristics of the LSMP sensor, the optimal combination for wireless auscultation was identified, leading to the construction of an ideal sensor. Based on the results, the LSMP is wirelessly miniaturized and lightweight so it can be attached to the skin for a long time without skin irritation. Adventitious breathing sounds with different characteristics were evaluated through a clinical assessment of pediatric asthma and an elderly COPD patient. In particular, LSMP can be used as a novel continuous auscultation device that can evaluate lung function in cases requiring symptom control for pediatric asthma patients, for whom PFTs cannot be performed, under 6 years old.

Table 1

Summary of various wearable stethoscope devices using two different detection systems (accelerometer and microphone),
Sensors	Structure	Features	Application	Connectivity	Limitation	Ref
Single IMU	soft, flexible/thin patch	Conformability on skin	seismocardiography, ECG, speech recognition	wired	not applicable to detail breath sound	³⁰
Single IMU	soft, flexible patch	Placement (suprasternal notch)	stagnant subject RR/HR, swallow, talking, sleep study	wireless BLE	not applicable to detail breath sound	²⁴
Dual IMUs +Temp	soft, flexible patch	Motion noise cacelling, placement (SN + SM)	ambulatory subject, spatio-temporal mapping, daily activities, COVID-19 patient recovery monitoring	wireless BLE, cloud platform	not applicable to detail breath sound	⁴²
Electret mic	stethoscope head	Recording and playback of heart sounds	heart sound and korotkoff sound	wired	performance depends on environmental conditions not suitable long-term monitoring	⁴³
Piezoelectric mic	diaphragmless acousto-electric	Stacking a silicone rubber and a piezoelectric film	clear thoracic sound	wireless	signal attenuation with impedance unmatching	⁴⁴
Fiber optic mic	fiber optic cable shape	High sensitivity long range - high SNR	medical spectroscopy, imaging applications, for long-distance recognition in the military-public security	wired	need for an external ference light source not suitable long-term monitoring	²⁶
Accelerometer + electret mic	stethoscope head shape	Both of acoustic and vibration analysis attenuation with diaphragm thickness	viscoelastic materials for diaphragm off-the-shelf components	wired	signal attenuation with diaphragm composition not suitable long-term monitoring	⁴⁵
Piezo Mems Mic multi array	stethoscope head shape	Mimic the bandpass characteristics of traditional filter	reduce the digital processing requirements on the embedded processor	wireless	no clinical data (artificial data only)	²⁵
Mems Mic multi array	4stethoscope head	Placement (posterior 4point)	automate classification with auscultation	wired	not suitable long-term monitoring	²⁷
MEMS microphone	flexible, thin	Continuous auscultation specific in lung function	automate classification with HR, RR, adventitious breathing	wireless BLE	hard to get clear signal in harsh condition	this works

Furthermore, the rate of classifying abnormal breathing sounds achieved a high accuracy of pulmonologist fellow level (80.5%). The effort of clinicians for long-term lung function evaluation was minimized through AI analysis of the extracted signal.

Future studies will involve active noise cancellation to allow for auscultation during daily life and long-term monitoring over 24 hours; this should allow sufficient time for understanding the relationship between lung sounds and environmental changes or drug administration, providing clinical data for medical decision-making. In particular, continuous monitoring of chronic respiratory patients can provide useful information to evaluate the status of the disease because cough, average RR, dyspnea, and wheezing, which occurs in these diseases, is related to worsening symptoms.

4.1. Fabrication of LSMP sensor

Flexible PCB (fPCB) was designed using Orcad version 17.2(Cadence), and PADS version VX2.7 (SIEMENS) for LSMP. All production, assembly, and inspection were carried out by fPCB manufacturer (KS Electronics, Korea). Medical grade adhesive tape (Medical tape 1524, 3M) was bonded with the hydrocolloid dressing film (Easyderm, CGBio) to the bottom layer of fPCB. An acoustic path, cylinder shape and the hole size was determined as 2 mm, was designed and formed in the adhesive part that directly contacts the microphone bottom hole to enhance the sound collection. Main components consist of uni and omni direction MEMS microphone, MCU, lithium polymer battery, etc. The encapsulating enclosure was 3D modeling with Solidworks 2018(Dassault systems Corp.). It is printed using an SLA (stereo lithography apparatus) type 3D printer (form3, Formlabs) with biocompatible resin (elastic 50A, Fomlabs).

4.2. Experimental set up for Child Sim

For the basic performance test of the developed LSMP sensor, a child heart lung sound trainer simulator (SB48061U, Simulaids Inc. UK) was used. A Child Sim, which used for training auscultated, generates the reference voice, heart, lung sound of an actual 4 years old such as normal, wheezing and crackle. In order to avoid the influence of unintended environmental noise, the experiment was conducted inside a 10t-thick acrylic box (130*100*70cm), and a cut-off noise level 30dB was implemented through 3t-thick sound insulation material (rubber, KdongCnc Soundproofing, Korea) and 50t-thick sound absorption material (polyurethane form, KdongCnc Soundproofing, Korea). To test the effect of external noise, a Bluetooth speaker (SRS-XB22, Sony) capable of generating the same reference noise was constructed at the same height as the sensor location as shown in Figure S1.

4.3. Data acquisition

The measurement of the wireless connection between the LSMP sensor and the mobile device involves a comparison of received signal strength indicator (RSSI) values. Due to the characteristics of wireless communication whose signal strength varies depending on the distance, the presence of obstacles (including clothing), and the attachment site, all tests were conducted with Child Sim and human, and measured five times at a distance of 0 to 5 m (incremental of 50 cm). The standard deviation between measurements at all distances is defined as the error bar in the plot. The iPhone SE2 (Apple Inc., USA) and Customized iOS app were used as the mobile device for all measurement. The iOS App implements the following main functions. 1) Collection of identification information and management of stored files for each subject (patients). 2) Device equipment signal reception (battery information, BT connection) 3) Real-time streaming of bio-acoustic signals collected through the LSMP sensor 4) Based on the received signal, real-time visualization of three screens and full screen composition by chart 5) Reproduction and recording through real-time application of the equalization function of the received signal 6) Share and replay saved files.

4.4. Experiments for clinical study

All clinical studies were conducted by attaching LSMP to the skin directly on the anterior or posterior part where the subject's breathing sound could be auscultation according to the clinician's instructions. This study was approved by the institutional review board at Nowon Eulji Medical Center, Eulji University (IRB No. EMCS 2021-07-003) and volunteer subjects gave informed consent.

4.5. Analysis for bio acoustic signals

The iPhone SE2 (Apple Inc., USA) and Customized iOS app were used as the mobile device for all measurement All data was collected by the iPhone SE2 device through the iOS app for LSMP, and the data was analyzed using Matlab2019. A digital manipulation applied 4th order Butterworth filter which frequency range was 100–2000 Hz. For frequency analysis, FFT and spectrogram were confirmed using a hanning window and 50% overlap. 4th order band pass filter of 20–200 Hz is applied for active filtering in cardiac auscultation to filter out heart rate from the original data. In order to classify the breath sound from the original data in which heart and breath data are mixed, a 4th order band pass filter of 50–500 Hz is utilized to achieve ideal results in pulmonary auscultation.

4.6. Input data pre-processing and Machine learning

All the codes are dealt with Python interpreters. We used librosa packages for pre-processing acoustic raw data. We extracted the necessary region of data for training and transformed them into Log-Mel spectrogram by modules named feature.melspectrogram and amplitude_to_dB from librosa. Converted spectrograms are sliced into 10,000 length of fixed window, overlapping the 10% of total window length by module of librosa.uitl.frame. Then augmented spectrograms are saved as png image. And saved images are processed with OpenCV packages.

For constructing layers of deep learning, we used TensorFlow–keras packages. Layer is stacked with keras.layers method. Conv2d, MaxPooling2D, dropout, flatten, dense layers are used, and the summary of deep-learning architecture is described in Figure S5. Input data is loaded from the saved png images and trained by fit method in keras.Sequential. In addition, modules such as Scikit-Learn, IPython, pandas, noisereduce, scipy are used.

Credit authorship contribution statement

K.-R.L., T.W.K, S.H.I., contributed equally to the work. K.-R.L., T.W.K, and S.H.L. designed the experiments, tested, fabricated, and verifying the devices, analyzed the data. Y.J.L., S.E.J., H.H.S. designed the circuits and sensors, performed data processing. S.H.I, M.H.K., J.G.L, D.H. K., contributed data analysis. G.S.C. performed clinical study and analyzed the data. K.-R.L., T.W.K, S.H.I., and S.H.L. wrote the paper. D.S.K., S.C.S. and S.H.L. secured funds for this project.

Declaration of competing interest

The author declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgments

This work was supported by Korea Environment Industry & Technology Institute (KEITI) through Digital Infrastructure Building Project for Monitoring, Surveying and Evaluating the Environmental Health program, funded by Korea Ministry of Environment (MOE). (Grant number: 2021003330008). This work was supported by KIST Internal program (2E32163). This work was supported by the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI) and Korea Dementia Research Center (KDRC), funded by the Ministry of Health & Welfare and Ministry of Science and ICT, Republic of Korea (grant number:HU20C0164). K.-R.L. thanks the support by Basic Science Research Program through the National Research Foundation of Korea (NRF) funded by the Ministry of Education (grant number: 2022R1A6A3A01087298).

Data and materials availability

The data and code that support the finding of this study are available in https://github.com/sunghoon-most/LSMP-project.git. All other data are present in the paper or the Supplementary Materials.

Geddes LA. Birth of the stethoscope. IEEE Eng Med Biol Mag 24, 84-86 (2005).
Sarkar M, Madabhavi I, Niranjan N, Dogra M. Auscultation of the respiratory system. Ann Thorac Med 10, 158-168 (2015).
Gavriely N, Nissan M, Rubin AH, Cugell DW. Spectral characteristics of chest wall breath sounds in normal subjects. Thorax 50, 1292-1300 (1995).
Emmanouilidou D, Patil K, West J, Elhilali M. A multiresolution analysis for detection of abnormal lung sounds. Annu Int Conf IEEE Eng Med Biol Soc 2012, 3139-3142 (2012).
Haider NS, Joseph J, Periyasamy R. An investigation on the statistical significance of spectral signatures of lung sounds. Biomed Res-India 28, 2801-2810 (2017).
Sutherland ER. Nocturnal asthma: Underlying mechanisms and treatment. Curr Allergy Asthm R 5, 161-167 (2005).
Sengupta N, Sahidullah M, Saha G. Lung sound classification using cepstral-based statistical features. Comput Biol Med 75, 118-129 (2016).
Rietveld S, Oud M, Dooijes EH. Classification of asthmatic breath sounds: Preliminary results of the classifying capacity of human examiners versus artificial neural networks. Comput Biomed Res 32, 440-448 (1999).
Aviles-Solis JC, et al. Prevalence and clinical associations of wheezes and crackles in the general population: the Tromso study. BMC Pulm Med 19, 173 (2019).
Rao A, Huynh E, Royston TJ, Kornblith A, Roy S. Acoustic Methods for Pulmonary Diagnosis. IEEE Rev Biomed Eng 12, 221-239 (2019).
Kim Y, et al. Respiratory sound classification for crackles, wheezes, and rhonchi in the clinical field using deep learning. Sci Rep-Uk 11, (2021).
Shah MA, Shah IA, Lee DG, Hur S. Design Approaches of MEMS Microphones for Enhanced Performance. J Sensors 2019, (2019).
Algamili AS, et al. A Review of Actuation and Sensing Mechanisms in MEMS-Based Sensor Devices. Nanoscale Res Lett 16, 16 (2021).
Lakhe A, Sodhi I, Warrier J, Sinha V. Development of digital stethoscope for telemedicine. J Med Eng Technol 40, 20-24 (2016).
Vilendrer S, et al. Patient Perspectives of Inpatient Telemedicine During the COVID-19 Pandemic: Qualitative Assessment. JMIR Form Res 6, e32933 (2022).
Mesquita CT, et al. Digital Stethoscope as an Innovative Tool on the Teaching of Auscultatory Skills. Arq Bras Cardiol 100, 187-189 (2013).
Legget ME, Toh M, Meintjes A, Fitzsimons S, Gamble G, Doughty RN. Digital devices for teaching cardiac auscultation - a randomized pilot study. Med Educ Online 23, 1524688 (2018).
Ye S, Feng S, Huang L, Bian S. Recent Progress in Wearable Biosensors: From Healthcare Monitoring to Sports Analytics. Biosensors (Basel) 10, (2020).
Pasche S, Angeloni S, Ischer R, Liley M, Lupranoe J, Voirin G. Wearable Biosensors for Monitoring Wound Healing. Adv Sci Tech 57, 80-87 (2009).
Jalloul N. Wearable sensors for the monitoring of movement disorders. Biomed J 41, 249-253 (2018).
Kim J, Campbell AS, de Avila BE, Wang J. Wearable biosensors for healthcare monitoring. Nat Biotechnol 37, 389-406 (2019).
Hu YT, Xu Y. An Ultra-Sensitive Wearable Accelerometer for Continuous Heart and Lung Sound Monitoring. Ieee Eng Med Bio, 694-697 (2012).
Gupta P, Moghimi MJ, Jeong Y, Gupta D, Inan OT, Ayazi F. Precision wearable accelerometer contact microphones for longitudinal monitoring of mechano-acoustic cardiopulmonary signals. NPJ Digit Med 3, 19 (2020).
Lee K, et al. Mechano-acoustic sensing of physiological processes and body motions via a soft wireless device placed at the suprasternal notch. Nat Biomed Eng 4, 148-158 (2020).
Prasad M, Sahula V, Khanna VK. Design and Fabrication of Si-Diaphragm, ZnO Piezoelectric Film-Based MEMS Acoustic Sensor Using SOI Wafers. Ieee T Semiconduct M 26, 233-241 (2013).
Hayber SE, Tabaru TE, Keser S, Saracoglu OG. A Simple, High Sensitive Fiber Optic Microphone Based on Cellulose Triacetate Diaphragm. J Lightwave Technol 36, 5650-5655 (2018).
Islam MA, Bandyopadhyaya I, Bhattacharyya P, Saha G. Multichannel lung sound analysis for asthma detection. Comput Methods Programs Biomed 159, 111-123 (2018).
Chung HU, et al. Binodal, wireless epidermal electronic systems with in-sensor analytics for neonatal intensive care. Science 363, 947-+ (2019).
Chung HU, et al. Skin-interfaced biosensors for advanced wireless physiological monitoring in neonatal and pediatric intensive-care units. Nat Med 26, 418-+ (2020).
Liu YH, et al. Epidermal mechano-acoustic sensing electronics for cardiovascular diagnostics and human-machine interfaces. Sci Adv 2, (2016).
Kraman SS, Wodicka GR, Pressler GA, Pasterkamp H. Comparison of lung sound transducers using a bioacoustic transducer testing system. J Appl Physiol 101, 469-476 (2006).
Kraman SS, Pressler GA, Pasterkamp H, Wodicka GR. Design, construction, and evaluation of a bioacoustic transducer testing (BATT) system for respiratory sounds. IEEE Trans Biomed Eng 53, 1711-1715 (2006).
Shkel AA, Kim ES. Wearable Low-Power Wireless Lung Sound Detection Enhanced by Resonant Transducer Array for Pre-Filtered Signal Acquisition. 2017 19th International Conference on Solid-State Sensors, Actuators and Microsystems (Transducers), 842-845 (2017).
Shkel AA, Kim ES. Continuous Health Monitoring With Resonant-Microphone-Array-Based Wearable Stethoscope. Ieee Sens J 19, 4629-4638 (2019).
Lee SH, Kim YS, Yeo WH. Advances in Microsensors and Wearable Bioelectronics for Digital Stethoscopes in Health Monitoring and Disease Diagnosis. Adv Healthc Mater, e2101400 (2021).
Serato JHL, Reyes R. Automated Lung Auscultation Identification for Mobile Health Systems Using Machine Learning. Proceedings of 4th Ieee International Conference on Applied System Innovation 2018 ( Ieee Icasi 2018 ), 287-290 (2018).
Klein M. Fundamentals of Lung Auscultation. New Engl J Med 370, 2052-2052 (2014).
Vallejo M, Recas J, del Valle PG, Ayala JL. Accurate human tissue characterization for energy-efficient wireless on-body communications. Sensors (Basel) 13, 7546-7569 (2013).
Reddel HK, et al. Global Initiative for Asthma (GINA) Strategy 2021 - Executive summary and rationale for key changes. Eur Respir J, (2021).
Lopez-Campos JL, Tan W, Soriano JB. Global burden of COPD. Respirology 21, 14-23 (2016).
DiMango E, et al. Risk Factors for Asthma Exacerbation and Treatment Failure in Adults and Adolescents with Well-controlled Asthma during Continuation and Step-Down Therapy. Ann Am Thorac Soc 15, 955-961 (2018).
Jeong H, et al. Differential cardiopulmonary monitoring system for artifact-canceled physiological tracking of athletes, workers, and COVID-19 patients. Sci Adv 7, (2021).
Bhaskar A. A simple electronic stethoscope for recording and playback of heart sounds. Adv Physiol Educ 36, 360-362 (2012).
Yilmaz G, et al. A Wearable Stethoscope for Long-Term Ambulatory Respiratory Health Monitoring. Sensors-Basel 20, (2020).
Kraman SS, Pressler GA, Pasterkamp H, Wodicka GR. Design, construction, and evaluation of a BioAcoustic transducer testing (BATT) system for respiratory sounds. Ieee T Bio-Med Eng 53, 1711-1715 (2006).

(Not answered)

LSPuserguide.mp4
A wearable stethoscope for accurate real-time lung sound monitoring and automatic wheezing detection based on an AI algorithm
SupplementalMaterial230406.docx

Download PDF

Version 1

posted

You are reading this latest preprint version

A wearable stethoscope for accurate real-time lung sound monitoring and automatic wheezing detection based on an AI algorithm

Status:

Version 1

Abstract

Figures

1. Introduction

2. Results and Discussion

2.1. Wearable skin attached real-time LSMP sensor

2.2. Characterization of the LSMP sensor with a normal subject

2.3. Clinical study of pediatric patients with asthma

2.4. Clinical study of elderly patients with COPD

2.5. AI-based wheezing counting algorithm for long-term lung sound analysis

3. Conclusions

4. Experimental Section/Methods

4.1. Fabrication of LSMP sensor

4.2. Experimental set up for Child Sim

4.3. Data acquisition

4.4. Experiments for clinical study

4.5. Analysis for bio acoustic signals

4.6. Input data pre-processing and Machine learning

Declarations

References

Additional Declarations

Supplementary Files

Status:

Version 1