Through the above experimental results, DenseCapsNet deep learning framework has high sensitivity to COVID-19, and the network framework can indeed obtain sufficient features on small datasets to realize accurate detection of COVID-19. The results obtained in experiment I can show that capsule network can significantly improve the performance of CNN in COVID-19 chest X-ray detection. Experiment II proves that DenseCapsNet does not rely on data augmentation and pre-training, and the framework is robust. We also want to test DenseCapsNet's ability to detect three classifications (Normal, Non-COVID-19 and COVID-19), through Experiment III and compared with COVID-Net, the framework does achieve optimal performance. Next, we will fully discuss the above three groups of comparative experiments.
Experiment I: According to Table 1, when using a traditional CNN alone to detect COVID-19 chest X-rays, the detection abilities of DenseNet121 and ResNet50 can be compared. Other than precision, the indexes are significantly different, and the sensitivity reaches a difference of nearly 20%. This is very important; the higher the sensitivity is, the lower the possibility of the false detection of COVID-19 patients. According to the confusion matrix of Experiment I in Fig. 5, we can clearly see that the two CNN frameworks can achieve good detection for normal X-ray images and have similar performance, but in the detection of COVID-19, the detection capability of the two network frameworks is not satisfactory. DenseNet121 detected 43 COVID-19 patients as normal, ResNet50 detected 84 COVID-19 patients as normal, and the number of false detections by ResNet50 was almost twice that of DenseNet121. This fully proves that DenseNet121 is better than ResNet50 in detecting COVID-19. This may be due to the deepening of the network level; DenseNet 121 can mine deeper features. DenseNet 121 also has excellent feature transfer and feature reuse functions, which make it superior to ResNet 50. However, the performance of DenseNet121 is not satisfactory. In Table 1, we can also clearly see that the combination of a CNN and capsule network is significantly better than the CNN alone in detecting COVID-19. We combine DenseNet121 with CapsNet to form DenseCapsNet, combine ResNet50 with CapsNet to form ResCapsNet, and test the ability of these two models to detect COVID-19. The results of these two frameworks are better than those of the ResNet50 and DenseNet121 models used alone. As can be seen from that confusion matrix of Experiment I in Fig. 5, both DenseCapsNet and ResCapsNet have high sensitivity in detecting COVID-19. ResCapsNet misdetects 5 COVID-19 patients, while DenseCapsNet misdetects only 3 patients, so DenseCapsNet is better. The purpose of our work is not only to improve the detection sensitivity of COVID-19 but also to measure the training time. Under the same parameters and the same test equipment, we use the same dataset to train ResCapsNet and DenseCapsNet. The number of epochs is 30; ResCapsNet takes 12 hours, 47 minutes and 56 seconds, and DenseCapsNet takes 6 hours, 3 minutes and 24 seconds. The training time of DenseCapsNet is less than half that of ResCapsNet, and DenseCapsNet also has significant advantages in terms of the training time.
Experiment II: When we decided to use the DenseNet121 network as the CNN part of the composite network, we performed another group of experiments on the obtained DenseCapsNet. We wanted to explore whether data augmentation and pre-training would have some influence on DenseCapsNet and to compare it with COVID-CAPS. The experimental results are shown in Table 2, and the confusion matrix is shown in Experiment II in Fig. 5. According to the above chart, we can clearly see that the impact of data augmentation and pre-training on DenseCapsNet's detection capability is very small; that is, DenseCapsNet can achieve excellent results without relying on data augmentation and pre-training. When we use the same dataset to train DenseCapsNet without data augmentation and pre-training, the number of iterations is 30, and the time taken is 3 hours, 40 minutes and 35 seconds. The substantial shortening of the training time is mainly due to the elimination of data augmentation. We can increase the batch size to speed up training. We used the same type of dataset as COVID-CAPS, and the amount of data we used was smaller than that for COVID-CAPS. As seen from Table 2, DenseCapsNet's overall performance is better than that of COVID-CAPS, and it can detect COVID-19 patients more accurately, which is of great significance for reducing the screening pressure of medical workers and can significantly increase the efficiency of early diagnosis.
Experiment III: Through the first two experiments, we show that DenseCapsNet can indeed screen COVID-19 patients from chest X-rays of unknown patients. If DenseCapsNet can detect normal images in more detail, the detection of other pneumonia and COVID-19 will be even better. In this way, COVID-19 and other pneumonia could be directly screened from massive chest X-ray data. Doctors could then isolate COVID-19 patients and treat them in a targeted way. Other pneumonia patients would not need separate isolation but centralized treatment. Healthy people would go home for isolation to reduce hospital pressure. We used the same type of dataset B as COVID-Net but with a small amount of data. In Table 3, DenseCapsNet is shown to be significantly more sensitive to the three types of chest X-ray data than COVID-Net, especially for COVID-19, reaching an excellent result of 99.52%. As can be seen from that confusion matrix of Experiment III of Fig. 5 DenseCapsNet correctly detected almost all chest X-rays of COVID-19 patients (only one case was misdetected). However, for the PPV in Table 3, DenseCapsNet's prediction for COVID-19 patients is slightly lower than that of COVID-Net, which means DenseCapsNet predicted some non-COVID-19 chest X-rays as COVID-19. A total of 6 cases were misdiagnosed as COVID-19, of which 5 cases were patients with other pneumonia, mainly because other pneumonia and COVID-19 have a high similarity on chest X-ray images. The detection capability of DenseCapsNet may be further improved by increasing the data volume of the training datasets.
In this work, the combination of a CNN and capsule network is shown to be better than using a CNN alone in the detection of COVID-19. However, the combination of networks also brings some problems, such as the increase of the number of network parameters and the extension of the training time. At present, the number of COVID-19 patients is continuously increasing, and we need a more robust and faster network to perform mass screening of COVID-19. Our work can relieve the pressure of the mass screening of COVID-19 patients to a certain extent, but we will also strive to build better models to cope with the daily spread of the epidemic.