Fault diagnosis method for rolling bearings based on BICNN under complex operating conditions

doi:10.21203/rs.3.rs-4370002/v1

Download PDF

Research Article

Fault diagnosis method for rolling bearings based on BICNN under complex operating conditions

https://doi.org/10.21203/rs.3.rs-4370002/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 17 Aug, 2024

Read the published version in Journal of the Brazilian Society of Mechanical Sciences and Engineering →

You are reading this latest preprint version

To address the issues of poor noise resistance and insufficient generalization performance in traditional fault diagnosis methods, an end-to-end rolling bearing fault diagnosis method based on Bidirectional Interactive Convolutional Neural Network (BICNN) is proposed. Firstly, the bearing vibration signal is directly input into the wide convolutional kernel for rapid feature extraction, reducing the interference of high-frequency noise. Secondly, a modified Rectified Linear Unit (M-ReLU) activation function is designed to solve the problem of "neuron death" in the ReLU activation function. Then, a bidirectional interactive feature extraction module is constructed, and the features extracted are input into the bidirectional interactive feature extraction module to capture the channel and spatial feature information simultaneously. Next, the extracted information is imported the presented feature enhancement module to achieve more valuable information transmission and accumulation. Finally, a small convolutional kernel is applied to further extract feature information, and a global average pooling layer is used to replace the fully connected layer, reducing the number of parameters while avoiding the problem of model overfitting. The Softmax is utilized to classify the types of bearing faults. Two different datasets are adopted to validate the fault diagnosis performance of the proposed model under − 4dB signal-to-noise ratio and variable working conditions. Experimental results show that compared with other fault diagnosis methods, the proposed model has higher fault diagnosis accuracy, stronger noise resistance, and generalization ability.

fault diagnosis

Bidirectional Interactive Convolutional Neural Network

Modified Rectified Linear Unit

feature enhancement module

rolling bearing

Rotating machinery is widely used in aerospace, CNC machine tools, industrial robots and other fields. As the support and rotating component of the shaft, rolling bearings play a crucial role in the stable operation and expected function of rotating machinery^[1–2]. However, due to harsh working conditions such as high temperature, high pressure and corrosion, rolling bearings are prone to failure, accounting for approximately 45–55% of rotating machinery failures^[3]. Therefore, studying the fault diagnosis of rolling bearings is of great significance for the safe operation of rotating machinery^[4–5]. In the face of a large number of equipment operation status monitoring data in modern industry, it is unrealistic to rely on manual extraction of fault features. In recent years, identifying fault types through vibration data has become the mainstream in the field of fault diagnosis, among them, which the intelligent fault diagnosis method based on deep learning has been favored by a large number of scholars^[6]. The model based on deep learning has strong big data learning ability and high generalization performance, which can automatically extract fault features from bearing vibration data without manual intervention, greatly reducing the dependence on expert experience and domain knowledge. As one of the important branches of deep learning, convolutional neural network (CNN) is widely used in fault diagnosis and has achieved remarkable results^[7]. Yao et al.^[8] proposed a stacked inverse residual convolutional neural network intelligent bearing fault diagnosis method, which improved the diagnosis speed of the model and the diagnosis effect in noisy environment. Cui et al.^[9] presented a multi-layer adaptive convolutional neural network bearing fault method, which enhanced the feature learning ability of the model and achieved high fault diagnosis accuracy under variable working conditions. Li et al.^[10] put forward an improved convolutional neural network fault diagnosis method, which effectively improved the feature extraction and generalization ability of bearing fault diagnosis. Chang et al.^[11] proposed an efficient and lightweight residual network fault diagnosis method based on attention mechanism, which maintained high accuracy while reducing time complexity and model size. Zhang et al.^[12] presented a fault diagnosis method using attention based dual-scale feature fusion capsule network, designed an attention based dual branch network to calculate the weights of different scale features, and based on this, performed dual scale feature fusion to achieve effective fault recognition. Liu et al.^[13] proposed a bearing fault diagnosis method based on multi-scale fusion attention CNN, which learned the importance of fault features through improved attention and achieved high fault recognition accuracy. Xu et al.^[14] put forward a multi-scale convolutional neural network fault diagnosis method based on channel space attention mechanism, which achieved good recognition results.

Although the above methods have achieved encouraging results, there are still the following problems: (1) Most models use the ReLU activation function. Since the negative part of the ReLU activation function is always 0, the phenomenon of “neuron death” will occur during model training, resulting in some neurons failing to be activated. (2) Most fault diagnosis methods use convolution to extract features first, and then use the spatial or channel attention mechanism to adjust the feature weights, which inevitably leads to the loss of channel or feature information. Aiming at the above problems, an end-to-end rolling bearing fault diagnosis method based on BICNN is proposed. A M-ReLU activation function is proposed for the first question, which can enhance the model’s feature learning ability while avoiding the phenomenon of “neuron death”. For the second issue, a bidirectional interactive feature extraction module and a feature enhancement module are designed. The bidirectional interactive feature extraction module can simultaneously extract channel and spatial feature information, making the model have stronger feature extraction and generalization capabilities. The feature enhancement module controls the degree of information retention through the gating unit, enabling the model to focus on more important feature information.

The subsequent arrangement is as follows: the section 2 introduces some relevant basic theories. The section 3 presents the fault diagnosis approach, and gives the framework of the model as well as the structural analysis. The section 4 describes the data set and conducts comparative experiments and performance analysis. Finally, some conclusions are summarized in the section 5.

2.1 Proposal of M-ReLU activation function

The activation function is an important component of neural networks, which plays a crucial role in improving the learning and expression abilities of models^[15]. During the training process, the activation layer plays a key role in learning complex function mappings from data. Most scholars use the ReLU activation function in the activation layer, and its expression is shown in Eq. (1):

$$f(x)=\left\{ {\begin{array}{*{20}{c}} {\begin{array}{*{20}{c}} x&{x \geqslant 0} \end{array}} \\ {\begin{array}{*{20}{c}} 0&{x<0} \end{array}} \end{array}} \right.$$

It can be seen from Eq. (1) that since the activation function value is 0 and the gradient is 0 at $x<0$, there are some neurons that will never be updated, and some valid information is discarded, so that the information in the network cannot be fully utilized. Aiming at the shortcoming of the ReLU activation function, a M-ReLU activation function is put forward. The image is shown in Fig. 1, and the expression can be described as Eq. (2):

$$f(x)=\left\{ {\begin{array}{*{20}{c}} {\begin{array}{*{20}{c}} x&{x \geqslant 0} \end{array}} \\ {\begin{array}{*{20}{c}} {\frac{{{e^x} - 1}}{{1+{e^{ - x}}}}}&{x<0} \end{array}} \end{array}} \right.$$

The M-ReLU activation function maintains the linear characteristics of the ReLU activation function at $x \geqslant 0$, and the left side has a certain soft saturation characteristic. When the input of the neuron is negative, the activation function still has a continuously changing non-zero gradient, and the negative input signal can also update some network parameters, thereby improving the learning ability of the model for useful features. When the negative input signal is large, there is a non-zero gradient, which enables the model to continue iterating. When the negative input signal is small and the gradient is small, the output of some neurons will be close to 0, which will cause the sparsity of the network and reduce the interdependence of the parameters, thus alleviating the over-fitting problem.

2.2 Design of Bidirectional Interaction Feature Extraction Module

The bidirectional interactive feature extraction module(BIFEM) is designed to address the issue of feature loss caused by using convolution first and then attention mechanism. The specific structure diagram is shown in Fig. 2, which means that the output of the convolutional branch can supplement the channel dimension information for the self attention branch, and at the same time, the output of the self attention branch can also supplement the spatial dimension information for the convolutional branch, thereby enhancing the modeling ability of the model’s channel-space dimension.

Firstly, the features are input into the convolution branch and the self-attention branch to extract the feature information, and then the interaction of the feature information between the two branches is realized. For the realization of the convolution branch to the self-attention branch, the SE-like design is adopted, that is, the output of Conv3×3 first passes through the global average pooling layer to compress the features, and then realizes the interaction of channel information through Conv1×1.After the BN layer and the M-ReLU activation function, the dimension is adjusted by Conv1×1. Finally, the eigenvalues are converted into probability distributions through the sigmoid function and applied to the V calculation of the self-attention branch. For the implementation of the self-attention module to the convolution module, it is similar to the former, but there is no global average pooling layer, and the number of channels becomes 1 after the second Conv1×1, and the output of self-attention becomes a probability distribution map in the spatial dimension after the sigmoid function, which acts on the Conv3×3 output in the convolution branch. Finally, the features extracted from the two branches are fused.

2.3 Construction of feature enhancement module

According to the importance of different positions in the feature map, a feature enhancement module(FEM) is constructed. The basic idea of this module is to enhance the traditional convolution operation by introducing a gating unit, that is, introducing a gating mechanism on the traditional convolution operation to capture the dependency relationships in sequence data and assign greater weight to the important feature information. The specific structure is shown in Fig. 3. The module consists of two key parts: gating unit and convolution kernel. The gating unit uses the Sigmoid function to generate a 0–1 value to control the degree of information retention. The convolution kernel is used to extract features. The output of the convolution kernel is multiplied by the gating unit and then the convolution operation is performed to obtain the final output, so as to achieve more valuable information transmission and accumulation. The specific implementation process is described as formulas (3) and (4):

$$x{\text{=Multiply([DwConv(Conv(BN(}}{x_{in}}{\text{))),Sigmoid(DwConv(Conv(BN(}}{x_{in}}{\text{))))}}$$

$${x_{out}}={\text{Concat}}([{x_{in}},Conv(x)])$$

3.1 Construction of BICNN rolling bearing fault diagnosis model

The model structure of BICNN is shown in Fig. 4. Firstly, the original vibration signal of the bearing is quickly extracted by the wide convolution kernel. The wide convolution kernel can increase the receptive field of the convolution kernel and reduce the interference of high frequency noise, so as to extract more feature information and improve the performance of the network. Secondly, the features extracted by the wide convolution kernel are pooled and input into the bidirectional interactive feature extraction module, and the feature information of channel and spatial dimension is extracted at the same time. Then, the extracted features are input into the feature enhancement module, and the features are enhanced through the gating unit and the convolution kernel to realize the transmission and accumulation of more important feature information. Then, the small convolution kernel is used to further extract features to improve the diagnostic accuracy. The global average pooling layer replaces the full connection layer, and the dropout value is set to 0.5 after the global average pooling layer. When the model is trained, the activation value of a neuron is stopped with a certain probability, which can make the model generalization ability stronger and prevent the model from overfitting. Finally, the extracted feature information is used to classify the fault of rolling bearing by Softmax classifier.

3.2 Structural parameters of BICNN model

The structural parameters of BICNN are shown in Table 1. Each convolutional layer is composed of convolution, BN layer and M-ReLU activation function. The BN layer can accelerate training and convergence speed, control gradient explosion and disappearance. The M-ReLU activation function can perform multi-layer nonlinear mapping in the convolutional layer and improve the nonlinear expression ability of the model.

Table 1

Structural parameters of BICNN model
Character layer	Number of convolutional kernels	Size of convolution kernel	Output size
Input	/	/	2048×1
Conv1	8	64×1	2048×8
Maxpooling	/	/	1024×8
Bidirectional Interaction Feature Extraction Module	/	/	1024×8
Feature enhancement module	/	/	1024×8
Maxpooling			512×8
Conv2	32	3×1	512×32
Maxpooling	/	/	256×32
GAP	/	/	1×32
Softmax	/	/	7

In this paper, Pycharm2020.1.2 software is used in the experiment. The software environment is Tensorflow. The hardware platform is Intel (R) Core (TM) i9-13900KF 3.00GHz processor and NVIDIA GeForce RTX4070Ti graphics card. The learning rate of the BICNN model is set to 1×10^− 3, and the Adam optimization algorithm is used to optimize the structural parameters of the model. The training set has 80 fault samples per type, and the test set has 100 fault samples per type. The experimental process is measured when the signal-to-noise ratio is -4dB, and the test results are taken as the average of 10 calculations.

In actual production and processing, variable speed and load of bearings are very common phenomena. The good fault diagnosis ability of the model under complex working conditions is a prerequisite for ensuring the safe operation of rolling bearings. In this paper, the rolling bearing dataset of Case Western Reserve University (CWRU) and the MFS deep groove ball bearing dataset of our laboratory are used for experimental verification.

4.1 Dataset and experimental verification of CWRU

4.1.1 Introduction of CWRU experimental dataset

The rolling bearing dataset used in this paper comes from the experimental data published on the Internet by the bearing experimental data center of Case Western Reserve University (CWRU). The CWRU data acquisition test rig^[16] is shown in Fig. 5, which is assembled by three-phase asynchronous motor, torque sensor and load. The bearing model used in this paper is SKF6205, and the fault diameters are 0.18 mm and 0.36 mm, respectively. It is a vibration signal collected by the test bench at a sampling frequency of 12KHz and a load of 1hp ~ 3hp(1hp = 0.746kW). For the convenience of representation, the CWRU datasets under loads of 1hp, 2hp and 3hp are denoted as dataset A, dataset B and dataset C respectively. The fault form of bearing is pitting corrosion, and the faults of rolling element, inner ring and outer ring are all single point faults formed by EDM. Each size consists of three types of faults: inner ring, outer ring, and rolling body, with a normal state added, totaling 7 types of faults.

4.1.2 Validation of the model under fixed operating conditions

In order to verify the effectiveness of the proposed method, CWRU dataset is adopted, and this method is compared with MSC-RCDN^[17], MMCNN^[18], MSC-MpresCNN^[19] and WKCNN^[20]. The fault diagnosis results under fixed conditions are shown in Table 2. It can be seen from Table 2 that the average recognition accuracy of the model in this paper is higher than that of other four comparison methods. Among them, the recognition accuracy under working conditions A-A is the highest, which is 99.43%, which is 0.58% higher than the MSC-MpresCNN model with the best recognition effect of the comparison method, and 3.49% higher than the WKCNN with the worst recognition effect, which verifies the effectiveness of the proposed method.

Table 2

Fixed condition fault identification results
Model	BICNN	MSC-RCDN	MMCNN	MSC-MpresCNN	WKCNN
Working condition	Identification accuracy/%
A-A	99.43	97.75	96.68	98.85	95.94
B-B	99.20	97.97	96.82	99.00	96.10
C-C	99.38	97.84	96.96	98.79	95.88
Average	99.33	97.85	96.82	98.88	95.97

4.1.3 Fault diagnosis under variable load conditions

The bearing data under three loads of 1-3hp are selected for experiments at a signal-to-noise ratio of -4dB to verify the generalization performance of BICNN. The experimental results are shown in Fig. 6. Among them, A-B represents dataset A as the training set and dataset B as the test set. It can be seen from Fig. 6 that the average recognition accuracy of BICNN under different working conditions is 97.71%, which is higher than that of other four methods. It is 1.5% higher than MSC-MpresCNN with the best diagnostic effect and 5.34% higher than the worst diagnostic result. This is because the bidirectional interactive feature extraction module of the BICNN model can extract richer and more sufficient feature information from the channel and spatial dimensions at the same time, the feature enhancement module can control the retention of information and achieve more important information transmission and accumulation simultaneously. It can also be seen from the error line in Fig. 6 that the BICNN fault diagnosis performance is also more stable. The reason is that the M-ReLU activation function can solve the problem of neuron death, improve the learning ability of the model for useful features, and enhance the representation ability of the model. In summary, the BICNN has higher fault classification accuracy and stronger robustness under the strong noise and variable load conditions.

In order to show the classification of the model more clearly, the confusion matrix is introduced for visual analysis. When there are 700 samples, Fig. 7 shows the confusion matrix of fault classification results of variable condition A-B and C-A. From Fig. 7(a), it can be seen that most faults can be classified well in strong noise environments. Only 12 are misclassified, with 9 outer ring faults of 0.18mm being misclassified as rolling element faults of 0.36mm. It can be seen from Fig. 7 ( b ) that 20 samples are misclassified. These show that BICNN can well identify different types of bearing faults under strong noise and variable load conditions, and has good classification ability.

4.1.4 Ablation experiment.

In order to verify the effectiveness of the module proposed in this paper, ablation experiments are carried out under the signal-to-noise ratio of -4dB. The ReLU-BICNN and BICNN models are BICNN models using ReLU and M-ReLU activation functions, respectively. The M-ReLU-LeNet5 model replaces the bidirectional interactive feature extraction module with the classic LeNet5 model. The experimental results are shown in Table 3. Experimental results show that compared with the ReLU-BICNN model, the model using the M-ReLU activation function has achieved significant improvement in fault recognition accuracy, and the average recognition accuracy has increased by 2.85%. This result not only verifies the effectiveness of the M-ReLU activation function, but also further verifies the superiority of M-ReLU in extracting effective feature information. Compared with the traditional ReLU activation function, this activation function can effectively enhance the performance of the model. Compared with the M-ReLU-LeNet5 model, the accuracy of the BICNN model has been improved under different working conditions, and the average recognition accuracy has increased by 3.86%. Especially in the B-A condition, the effect is improved by 4.6%, which proves the effectiveness of the bidirectional interactive feature extraction module proposed in this paper.

Table 3

Ablation experiment results.
Working condition	A-B	A-C	B-A	B-C	C-A	C-B	Average
Model	Identification accuracy/%
ReLU-BICNN	95.43	94.29	95.01	94.43	95.29	94.71	94.86
M-ReLU-LeNet5	94.07	93.51	93.43	94.19	93.60	94.28	93.85
BICNN	97.89	97.41	98.03	97.59	97.77	97.54	97.71

4.2 MFS dataset and experimental verification

4.2.1 Introduction of MFS dataset

The MFS test rig in our laboratory is shown in Fig. 8. The type of deep groove ball bearing is ER-16K, and the fault is processed by laser etching technology. The bearing data are collected by the signal collector at the sampling frequency of 15.36kHz and the speed of 1200 r/min, 1300 r/min and 1400 r/min, respectively. The bearing fault diameters are 0.6 mm and 1.2 mm, respectively. Each size is reflected in inner ring, outer ring and rolling body, and a normal state is added, a total of 7 fault types. The fault location is shown in Fig. 9. The datasets of MFS at 1200 r/min, 1300 r/min and 1400 r/min are recorded as dataset D, dataset E and dataset F respectively.

4.2.2 Variable speed condition fault diagnosis

The MFS bearing dataset in our laboratory is used to verify the generalization ability of the BICNN model at variable speeds. The experimental results are shown in Fig. 10. It can be seen from Fig. 10 that the model presented in this paper has the highest recognition accuracy under D-E conditions, which is 98.86%, which is 2.43% higher than that of MSC-RCDN and 6.15% higher than that of WKCNN. This is because the bidirectional interactive feature extraction module proposed in this paper can make the model extract features more fully, and the M-ReLU activation function can enhance the feature learning ability of the model while ensuring the sparsity of the ReLU activation function. From the average recognition accuracy of the model, it can also be seen that the recognition accuracy of the model in this paper is 98.25%, which is higher than other comparison methods, which verifies that the presented model has stronger fault diagnosis effect and better generalization ability under strong noise and variable speed conditions.

In order to clearly demonstrate the fault data classification of the model, the t-SNE nonlinear dimensionality reduction algorithm^[21] is introduced. The visual analysis is conducted on working condition D-E at the signal-to-noise ratio of -4dB, and the results are shown in Fig. 11. It can be seen from Fig. 11 that the data in the original state is in a disorganized state. After passing through the wide convolution layer, the same types of faults gradually gather, but it is still difficult to distinguish between different types of faults. Through the feature extraction of the bidirectional feature extraction module, different types of faults are basically aggregated, but the inter class boundary of some fault samples is not obvious enough. As the feature enhancement module further highlights more important feature information, the inter-class boundaries between different samples gradually become clearer, but the inter-class distance is smaller. After the final global average pooling layer, accurate classification of different types of faults is achieved, with the maximum inter-class distance and the minimum intra-class distance. This once again verifies the good generalization ability and strong fault diagnosis performance of the proposed model in strong noise environments.

This paper proposes a rolling bearing fault diagnosis method based on bidirectional interactive convolutional neural network(BICNN) to address the issues of poor noise resistance and insufficient generalization performance of models in complex and variable engineering environments. The specific conclusion is as follows:

(1) Aiming at the phenomenon of “neuron death” in the ReLU activation function during model training, a M-ReLU activation function is designed to improve the nonlinear expression ability and generalization performance of the model.

(2) This paper proposes a bidirectional interactive convolutional neural network, which can simultaneously extract channel and spatial information and improve the feature extraction ability of the model.

(3) A feature enhancement module is constructed for the different importance in different regions of the feature map. A gated unit presented is used to control the retention of information, so that the model focuses on more important feature information.

(4) A comparative analysis is conducted between BICNN and other deep learning methods using the CWRU bearing dataset and MFS bearing dataset. The experimental results show that the proposed model is superior to other comparison methods in terms of fault recognition, anti-noise and generalization performance.

Ethics approval

Not applicable.

Consent to participate

Not applicable.

Conflict of interest

The authors declare no competing interests.

Author contribution

Not applicable.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (51465035), the Natural Science Foundation of Gansu Province (20JR5RA466). This research is supported by funds, and the funds support Professor Chunli Lei.

Data availability

The data cannot be made publicly available upon publication because they contain sensitive personal information. The data that support the findings of this study are available upon reasonable request from the authors.

Zhao M, Zhong S, Fu X et al (2019) Deep residual shrinkage networks for fault diagnosis[J]. IEEE Trans Industr Inf 16(7):4681–4690
Wang Y, Yang M, Zhang Y et al (2021) A bearing fault diagnosis model based on deformable atrous convolution and squeeze-and-excitation aggregation[J]. IEEE Trans Instrum Meas 70:1–10
Rai A, Upadhyay SH (2016) A review on signal processing techniques utilized in the fault diagnosis of rolling element bearings[J]. Tribol Int 96:289–306
Sinitsin V, Ibryaeva O, Sakovskaya V et al (2022) Intelligent bearing fault diagnosis method combining mixed input and hybrid CNN-MLP model[J]. Mech Syst Signal Process 180:109454
An Y, Zhang K, Liu Q et al (2022) Rolling bearing fault diagnosis method base on periodic sparse attention and LSTM[J]. IEEE Sens J 22(12):12044–12053
Chen Z, Li W (2017) Multisensor feature fusion for bearing fault diagnosis using sparse autoencoder and deep belief network[J]. IEEE Trans Instrum Meas 66(7):1693–1702
Hoang DT, Kang HJ (2019) Rolling element bearing fault diagnosis using convolutional neural network and vibration image[J]. Cogn Syst Res 53:42–50
Yao D, Liu H, Yang J et al (2020) A lightweight neural network with strong robustness for bearing fault diagnosis[J]. Measurement 159:107756
Cui J, Li Y, Zhang Q et al (2022) Multi-layer adaptive convolutional neural network unsupervised domain adaptive bearing fault diagnosis method[J]. Meas Sci Technol 33(8):085009
Li F, Wang L, Wang D et al (2023) Transfer multiscale adaptive convolutional neural network for few-shot and cross-domain bearing fault diagnosis[J]. Meas Sci Technol 34(12):125002
Chang M, Yao D, Yang J (2023) Intelligent Fault Dignosis of Rolling Bearings Using Efficient and Lightweight ResNet Networks Based on an Attention Mechanism [J]. IEEE Sens J, (9): 9136–9145
Zhang Q, Li J, Ding W et al (2023) Mechanical fault intelligent diagnosis using attention-based dual-scale feature fusion capsule network[J]. Measurement 207:112345
Liu X, Lu J, Li Z (2023) Multi-Scale Fusion Attention Convolutional Neural Network for Fault Diagnosis of Aero-Engine Rolling Bearing[J]. IEEE Sens J
Xu Q, Jiang H, Zhang X et al (2023) Multiscale Convolutional Neural Network Based on Channel Space Attention for Gearbox Compound Fault Diagnosis[J]. Sensors 23(8):3827
Parhi R, Nowak RD (2020) The role of neural network activation functions[J]. IEEE Signal Process Lett 27:1779–1783
Smith WA, Randall RB (2015) Rolling element bearing diagnostics using the Case Western Reserve University data: A benchmark study[J]. Mech Syst Signal Process 64:100–131
Xu Z, Tang G, Pang B (2023) Multiscale cascade recurrent dilation convolution network for fault diagnosis of rolling bearing under cross-load conditions[J]. Meas Sci Technol 34(7):075101
Zhang K, Wang J, Shi H et al (2021) A fault diagnosis method based on improved convolutional neural network for bearings under variable working conditions[J]. Measurement 182:109749
Chao Z, Han T (2022) A novel convolutional neural network with multiscale cascade midpoint residual for fault diagnosis of rolling bearings[J]. Neurocomputing 506:213–227
Song X, Cong Y, Song Y et al (2021) A bearing fault diagnosis model based on CNN with wide convolution kernels[J]. J Ambient Intell Humaniz Comput, : 1–16
Belkina AC, Ciccolella CO, Anno R et al (2019) Automated optimized parameters for T-distributed stochastic neighbor embedding improve visualization and analysis of large datasets[J]. Nat Commun 10(1):5415

Download PDF

Journal Publication

published 17 Aug, 2024

Read the published version in Journal of the Brazilian Society of Mechanical Sciences and Engineering →

Reviewers agreed at journal
15 May, 2024
Reviewers invited by journal
15 May, 2024
Editor assigned by journal
06 May, 2024
First submitted to journal
04 May, 2024

You are reading this latest preprint version

Fault diagnosis method for rolling bearings based on BICNN under complex operating conditions

Status:

Journal Publication

Version 1

Abstract

Figures

1 Introduction

2 Theoretical background

2.1 Proposal of M-ReLU activation function

2.2 Design of Bidirectional Interaction Feature Extraction Module

2.3 Construction of feature enhancement module

3. Fault diagnosis model of rolling bearing based on bidirectional interactive convolutional neural network

3.1 Construction of BICNN rolling bearing fault diagnosis model

3.2 Structural parameters of BICNN model

4. Experimental verification and analysis

4.1 Dataset and experimental verification of CWRU

4.1.1 Introduction of CWRU experimental dataset

4.1.2 Validation of the model under fixed operating conditions

4.1.3 Fault diagnosis under variable load conditions

4.1.4 Ablation experiment.

4.2 MFS dataset and experimental verification

4.2.1 Introduction of MFS dataset

4.2.2 Variable speed condition fault diagnosis

5. Conclusion

Declarations

Author contribution

Acknowledgments

Data availability

References

Status:

Journal Publication

Version 1