In this study, we aimed to identify four mosquito species that transmit yellow fever or other arboviruses by using a CNN (AlexNet). We also wanted to investigate whether the algorithm performance in classifying the mosquitoes changes according to the body regions shown on the pictures submitted. Our study found that the AlexNet can accurately identify mosquito pictures of the genus Aedes, Sabethes and Haemagogus with over 90% accuracy. Furthermore, the algorithm performance did not change according to the body regions shown.
Lorenz et al.[26] classified mosquitoes based on morphometric characteristics of their wings using neural networks, achieving accuracies ranging from 86–100%. However, an identification system based only on a body structure such as the wing is more fragile because if the structure is not present in the analyzed photo, the identification is compromised. Therefore, a good identification system should work with any part of the insect's body. Sauer et al.[27] showed that best-performing CNN configuration achieved a precision of 99% to discriminate between Aedes and non-Aedes mosquito species; the mean precision to predict the Aedes species was 91% for RGB pictures. Motta et al.[16] used three pre-trained networks to identify urban mosquitoes (Aedes and Culex) achieving an accuracy of 76.2% for the GoogleNet, 52.4% for LeNet, and 51.2% for AlexNet. Okayasu et al.[28] showed better accuracy results (92.3%) with the identification of Ae. albopictus, Anopheles stefensi, and Cx. pipiens pallens using AlexNet based on data augmentation and 12,000 training pictures. More recently, Motta et al[17] optimized the CNN hyperparameters and obtained 97.3% accuracy in distinguishing between the mosquitoes of the genus Aedes and the Culex mosquitoes. Similarly, Goodwin et al.[29] and Park et al.[13] achieved 97% accuracy rates for mosquito identification (Anopheles, Culex, Psorophora, and Aedes species) using deep learning neural networks. These networks rely on morphological features like those used by taxonomists [13]. Kittichai et al[15] using two YOLO v3 model identified Ae. aegypti, Ae. albopictus, and Cx. quinquefasciatus at a mean average accuracy of 98–100%. A recent study has shown that the accuracy and robustness of the CNN may reach 99% accuracy by incorporating spatial dropout layers to regularize the network and by modifying its structure to incorporate multi-view inputs [30]. Concatenating two YOLO v3 model exhibited the optimal performance in identifying mosquitoes, with mean average accuracy of 99%. The Swin MSI successfully identified 100% subspecieslevel in Culex pipiens Complex. Based on pictures of all body regions, AlexNet identified Ae. scapularis, Ae. serratus, Ha. leucocelaenus, and Sa.albiprivus at 94% accuracy on average (Fig. 2). Compared to previous studies that have used neural networks for mosquito identification, our accuracy rate of 94% is aligned with most results obtained by others.
In our study, we did not find a significant difference in AlexNet performance in identifying mosquitoes based on different body regions. In fact, other studies have shown that CNNs are able to detect morphological differences in various body regions of Aedes mosquitoes[13], some of which are consistent with the most used dichotomous keys [31]. Such results reveal that deep learning models learn the distinctive morphological features of mosquitoes body areas; these are the same ones used by taxonomists. For instance, Aedes scapularis can be identified by its serrated abdomen, a proboscis that is larger than the anterior femur, and the mesonotum with white scales forming a circle. Aedes serratus is identified by its serrated abdomen, a proboscis that is similar to or smaller than the anterior femur, and a mesonotum that may or may not have a longitudinal stripe of white scales. These two species bear a striking resemblance to each other. Sabethes albiprivus has medium-sized legs with bluish scales, a golden-scaled abdomen that forms quadrilaterals, and a proboscis that is much smaller than the anterior femur. Sabethes albiprivus and Ha. leucocelaenus are two species with different morphological characteristics. Sa. albiprivus can be distinguished from Ha. leucocelaenus by its predominantly dull, dark color and pleura with two vertical lines of silvery scales[7]. Other studies show that the accuracy of CNNs in identifying other insects is not significantly affected by the body region shown on the picture [22]. Our findings show that the morphological characteristics used for the identification of the mosquitoes included in this study are present in multiple regions of the body and therefore any of the body regions here studied allowed the AlexNet to accurately identify the mosquito species.
Deep learning neural networks consist of multiple convolutional layers, and databases with more pictures are more conducive to learning [21]. Additionally, many studies indicate that a larger picture bank improves the algorithm's performance [13, 14, 16, 17, 22, 32, 33]. Even though a database with thousands of pictures is always desired, using a database with only 1,343 pictures, we reached accuracy rates similar to those using databases 10x bigger than ours [17, 28] AlexNet accuracy to identify Sabethes and Haemogogus mosquitoes was similar to the accuracy obtained with other CNNs used to identify other genera [17, 28]. However, the accuracy of the AlexNet in identifying Ae. serratus and Ae. scapularis was below 90% and thus suboptimal when compared with the performance of other CNNs (VGG-16, ResNet-50, SqueezeNet) that apply data augmentation and fine-tuning techniques to identify Ae. aegypti [16, 17], Ae. albopictus and Ae. vexans [13].
These accuracy values may be due to differences in algorithm training [13]. For instance, optimization of CNN hyperparameters increase the accuracy of mosquito identification [17]. The poor performance of the algorithm in some cases may have been influenced by the state of preservation of the specimens. Analysis of the misidentified pictures in all experiments showed that the photographed specimens were not well preserved, especially in the pronotum area, where bristles and scales were missing, as well as the legs. Due to their size and the presence of scales and bristles, mosquitoes are easily damaged during capture, freezing, and drying, resulting in the loss of critical morphological features necessary for proper identification. The state of preservation of the mosquitoes was a limiting factor in this work, and good preservation of specimens is important for optimal algorithm performance.
In this study we successfully identified four species of mosquitoes that transmit yellow fever or other arboviruses using AlexNet. Our results support the idea of applying CNNs to the AI-driven identification of mosquito vectors of tropical diseases. This approach can potentially improve the surveillance of yellow fever vectors by health services and the population as well. Additional studies applying algorithms that identify mosquitoes may clarify which visual information is most relevant for the AI-driven identification of mosquito species.