Application of computer vision technology to the regurgitation behavior of fruit fly (Diptera: Tephritidae)

doi:10.21203/rs.3.rs-3151863/v1

Download PDF

Research Article

Application of computer vision technology to the regurgitation behavior of fruit fly (Diptera: Tephritidae)

https://doi.org/10.21203/rs.3.rs-3151863/v1

This work is licensed under a CC BY 4.0 License

You are reading this latest preprint version

Fruit fly regurgitation contains a variety of behavioral information such as predation and defense. The study of regurgitation behavior in fruit fly helps to understand the intrinsic connection between other physiological behaviors of fruit fly,which is helpful for fruit fly-specific control and can significantly improve the quality and yield of fruits. In this paper, based on the existing network models, three different methods based on computer vision techniques are proposed to recognize fruit fly regurgitation, extract regurgitation spots and track the trajectong of fruit fly regurgitation. And the methods can be applied to other insect behavioral studies. The Top-1 Accuracy of I3D model in fruit fly regurgitation recognition registers 96.3 percent. The MIOU of the combination of Unet and CBAM attention mechanism in segmenting regurgitated spots can achieve 90.96 percent. Then we conducted threshold segmentation, using OpenCV to calculate the amount and area of regurgitation spots. The accuracy of Yolov5 in detecting fruit fly reached 99.8 percent. And combined with DeepSort model, it can track fruit fly accurately.

fruit fly

regurgitation

behavior recognition

semantic segmentation

object tracking

In recent years, the harm of fruit fly has been aggravated year by year in orchards all over the world and has a great impact on the quality and yield of fruits, even affecting economic growth. Therefore, fruit fly control and the study of their behaviors are important.

Since 2017, China's fruit planting area has gradually risen. The industry has become an important part of China's agriculture and played important role in improving agricultural development and reached residents' income. China's fruit production reached 300 million tons in 2021, up 4.5% year-on-year. Among them, apple production is 4579.34 tons, up 4.3% year-on-year; citrus production is 5595.61 tons, up 9.2% year-on-year; pear production is 1887.59 tons, up 6% year-on-year; banana production is 1172.42 tons, up 1.8% year-on-year. These figures reflect the huge production volume of fruits. But when growing them, fruits are particularly vulnerable to fruit fly, as Vayssieres et al. indicated in a 2010 study ¹: Fruit fly undermine the quality of mango fruit in Benin, which leads to significant production losses. And the maximum loss can exceed 70%, imposing a great impact on the economy. According to Badii et al., ² fruits and vegetables such as citrus, pineapple, papaya, bananas and tomatoes are consumed in large amount every year and create a great economic value, but more than 50% of the production is infested by the insect. Therefore, many countries and regions are investing heavily in controlling fruit fly, but the results are not satisfactory, expensive and inefficient. For example, the use of insecticides for control will be phased out due to the increasingly strict restrictions on the use of insecticides and the increasing demand for healthy food worldwide. ³ A green and efficient solution to control fruit fly is urgently needed. Current researches on fruit fly are also moving in this direction. Ant et al. indicated in a 2012 study, using insect sterility techniques to control fruit fly. ⁴ Navarro-Llopis et al. ⁵ and Lasa et al. ⁶ proposed to apply Mass trapping techniques, and trapping devices to trap fruit fly in large scale. The above program are effective in controlling the insect and confirms that it is important to study how to control fruit fly from the aspect of physiological habit.

Regurgitation is one of the typical physiological behaviors of fruit fly. Many experts are interesting in it because it contains a variety of behavioral information. Many insects have the behavior as well. For example, herbivorous insects regurgitate at the injury part of plants, and their regurgitation spots contains inducers that trigger different plant responses. Plants will use the inducers to distinguish mechanical damage and herbivorous insects’ damage so that they can adopt different responses of defense. However, the insects will confuse the plants by creating the wrong kind of inducers so as to suppress plants’ defenses. ⁷ Regurgitation can also be toxic to vertebrate predators and impose an impact on them. ⁸ Dipteran Pests will regurgitate and die without injuring other crops and insects when fed different concentrations of polyols, which provides support for achieving specific insecticide. ⁹ After feeding food the fruit fly, we found that regurgitation can also play an important role in capturing bacteria in the environment and potentially help adult fruit fly to eliminate ingested toxic substances. ¹⁰ Regurgitation by fly-like insects was also found in a study of Cáceres et al. ¹¹ According to Wasala, L et al., House fly regurgitation spots may be a source of E-coli O157:H7 contamination of leafy greens, ¹² Therefore, regurgitation spots can be extracted to detect E-coli O157:H7 and other bacteria on plants so that controlling measures can then be taken. In summary, It is important to both study the c behaviors of fruit fly and extract the regurgitated spotss, which will provide practical tools for agricultural experts to do insects researches.

Most of the current research on fruit fly regurgitation has been conducted by chemical reagents to detect regurgitation components or by dissection to investigate the relationship between fruit fly regurgitation and its internal body structure. There is no research on fruit fly regurgitation behavior through artificial intelligence methods yet. Currently, deep learning technology and computer vision technology have gained rapid development and have been applied to plant detection, disease identification and insect damage identification in the field of agriculture and insect research.^{13, 14} This is because with deep learning models, insect and plant research can be achieved simply and efficiently, and problems that cannot be solved manually can be handled.^15–18

In this paper, we combined deep learning techniques and computer vision techniques, and select three different models for three types of problems in fruit fly regurgitation, respectively behavior recognition, regurgitation spots extraction and trajectory tracking. We evaluated and compared the performance of each models in solving specific problems, and explore the feasibility of applying them in the study of other insect regurgitation. All of these efforts thus enhanced the possibility of insect behavior research.

2.1. Overview

The method proposed in this paper to study the regurgitation behavior of fruit fly is divided into three main parts as follows:

1. Detect and recognize the regurgitation behavior of fruit fly through a behavior recognition network.

2. Use Unet network combined with CBAM attention mechanism and other networks to segment the regurgitated spots, regurgitated spots can be extracted precisely and the area of each spots can be calculated by OpenCV, so that the total amount of regurgitated spots can be estimated.

3. In order to conduct a more comprehensive study of insect regurgitation, we used Deepsort and Yolov5 method to track the moving trajectory of insects so that the number of them and their moving trajectory during regurgitation can be recorded at the same time.

2.2. Experimental equipment and environment

The computer equipment used for the behavior recognition experiments is Intel(R) Core(TM) i9-9900 K CPU @ 3.60GHz, NVIDIA GeForce RTX 2080Ti with 11G video memory, and the software development environment used is Ubuntu 20.04.1, Python 3.7, and Cuda 11.3. The software development environment used is Ubuntu 20.04.1 operating system, Python environment is 3.7, Cuda 11.3, deep learning framework is Pytorch 1.10.0.

The computer equipment used for the regurgitated spots extraction and insect trajectory tracking experiments was an 11th Gen Intel(R) Core(TM) i5-11400H @ 2.70 GHz 2.69 GHz, and the graphics card was an NVIDIA GeForce RTX 3060 with 6 G of video memory. Chinese version, python environment is 3.8, Cuda11.5, and deep learning framework is Pytorch 1.10.0.

2.3. Model performance metrics

The first part of the behavior recognition experiment is about classification Top-1 Accuracy is used to evaluate the model accuracy. Top-1 Accuracy and Top-5 Accuracy are both important metrics used to evaluate the accuracy of the classification model. Top-1 Accuracy refers to tracking the category with the highest probability among the prediction labels as the prediction category, and if the prediction result is the same as the actual result, then it is judged to be correct. Top-5 Accuracy refers to taking the top five categories with the highest probability in the prediction labels as the prediction categories, and if one of the categories is the same as the actual result, then it is judged to be correct. In the behavior recognition experiment, only fruit fly regurgitation behavior and other behaviors were recognized, so Top-1 Accuracy was chosen as the evaluation index.

In the second part of the regurgitation spots extraction experiment, the main purpose is to evaluate the results of the semantic segmentation experiment and calculate the Miou of semantic segmentation, (Miou is Mean Intersection over Union). In semantic segmentation, the intersection and merge ratio of a single category is the ratio of the intersection and merge of the true label and the predicted value of that category. (Fig. 1).

Here the positive cases refer to regurgitated spotss and the negative cases refer to non-regurgitated spotss.

Miou is the average of the cross-merge ratio for each type of label in this data set. The calculation formula is as follows.

$$MIoU=\frac{1}{k+1}\sum _{i=0}^{k} \frac{{p}_{ii}}{\sum _{j=0}^{k} {p}_{ij}+\sum _{j=0}^{k} {p}_{ji}-{p}_{ii}}$$

Where $i$ denotes the true value, $j$denotes the predicted value, and ${p}_{ij}$ denotes the prediction of $i$ to $j$. Also equivalent to

$$MIoU=\frac{1}{k+1}\sum _{i=0}^{k} \frac{TP}{FN+FP+TP}$$

In the third part of the trajectory tracking experiment, two metrics, precision and recall, were used to evaluate the effectiveness of Yolov5 in detecting fruit fly. Precision is a measure of accuracy that describes how many of the predicted positive cases are true positive cases. Here positive cases refer to fruit fly and negative cases refer to non-fruit fly, and is expressed as follows:

$$\text{Precision }=\frac{TP}{TP+FP}$$

Recall is a coverage metric that describes how many positive examples were selected from a true outcome perspective, with the following expression.

$$\text{Recall }=\frac{\text{T}\text{P}}{\text{T}\text{P}+\text{F}\text{N}}$$

The loss function serves as another evaluation metric for each model training, which is used to estimate the degree of inconsistency between the predicted and true values of the model. It is a non-negative real-valued function. The smaller the loss function, the better the robustness of the model.

2.4. Data collection and processing

In the process of data collection, this paper mainly targeted at Bactrocera minax and Bactrocera tau. Both species were photographed at the Insect Ecology Laboratory of the College of Agriculture, Yangtze University. Bactrocera minax affects almost all fruits of the genus Citrus in the family Rutaceae, and its individuals are relatively large; Bactrocera tau is smaller in size compared to Bactrocera minax, and mainly affects squash, cucumber, tomato and other fruits.

In order to enable the fruit fly to regurgitate, Bactrocera minax was fed honey water with a concentration of 5% honey water and then placed in closed petri dishes one by one. The video of regurgitation behavior and other actions of the fruit fly was obtained by vertical filming using a Sony video camera (FDR-AX60) with a filming resolution of 1920×1080 and a filming frame rate of 50 fps.

In the behavior recognition experiment, the video needs to be edited into several clips. In this paper, the video of Bactrocera minax regurgitating was edited into 50 10s clips, and that of other actions were edited into 50 10s clips (other actions include various grooming behaviors and resting states).

In the semantic segmentation experiment, one image was extracted from every 20 frames of the regurgitation video as the dataset for semantic segmentation, and 200 images were finally obtained.

In the trajectory tracking experiments, videos containing insects at rest, walking, and while regurgitating were selected, and one image was extracted every 50 frames, and 300 images were obtained. This part of the experiment used the Bactrocera tau, which is much smaller in size than the Bactrocera minax. Because of the small size, it is more difficult to track and better to test whether the network meets the criteria for tracking insects. Additionally, the petri dish is limited in size, so smaller-sized fruit fly will more random which will make the experiment result more precise.

2.5. Regurgitation behavior recognition experiment

The behavior recognition task is to identify different behavioral actions from the video, and the actions can occur continuously or intermittently. Behavior recognition seems to be an extension of the image classification task to multi-frame detection, and then aggregating the predictions for each frame. Traditional behavior recognition focuses on feature extraction of video. It extracts local high-dimensional visual features of video regions, then combine them into fixed-size video-level descriptions, and finally use classifiers for final prediction. With the development of deep learning technology, 2D convolutional neural network (2DCNN) is applied to behavior recognition. 2DCNN is a two-dimensional matrix of input, so the input video will be transformed into images, and the sliding window operation can only be performed on one frame of a single channel. This approach cannot take into account the inter-frame motion information in the time dimension, so the application of 2DCNN in behavior recognition is not satisfactory. However, with 3D convolutional neural network (3DCNN), the behavior recognition can be done in a more effective manner. 3DCNN has three dimensions: image width, graphic height and image channel. And the convolutional kernel can move in three directions, and the input of one video will output another video, which retains the input temporal information so as to better capture the temporal and spatial information in the video.^19–23

In this paper, three typical networks are used for experiments, namely, 3D Convolutional Networks (C3D),²⁴ Inflated 3D ConvNet (I3D)²⁵ and Expanding Architectures for Efficient Video Recognition (X3D).²⁶ C3D can be regarded as a breakthough for it is a relatively early proposed to apply 3DCNN method to be behavior recognition. The paper proposed to apply 3D convolutional operations to extract spatial and temporal features from video data for behavior recognition. These 3D feature extractors operate in both spatial and temporal dimensions, thus capturing the motion information in the video stream. And the structure can generate information channels from adjacent video frames and perform convolution and subsampling in each channel separately to combine the information from all channels to obtain the final features. Compared to 2DCNN, C3D network is more suitable for learning spatio-temporal features, which can model temporal information by 3D convolution and 3D pooling, while 2D convolution can only learn features spatially. I3D network proposes to transform 2D into 3D, not only to process time repeatedly, which can be obtained by temporal inflation of all filters and pooling kernels. The main advantage of this method is that the model parameters can be extended to 3D with pre-trained 2D images, which solves the problem of not having 3D pre-trained parameters. X3D is a relatively new network model nowadays, with new improvements based on the previous networks. The previous 3D network mainly expands the 2D convolutional neural network in time dimension. ut expanding in time scale is not necessarily the best choice. It is worthy to try to expand in other scales, such as the total frame length of input data, frame rate of input data, size of input frames, network width and depth. The network eventually outperforms all previous networks in terms of accuracy while requiring only one-fifth of the previous computation and parameters, and it is found that the network can keep the number of channels low while maintaining high input pixels.

The above networks, are for human action behavior datasets in recognition, such as Kinetics, UCF101, HMDB-51 and other datasets. For example, Kinetics has 400 classes of datasets, and each sample comes from a different Youtube video, and the corresponding human action is extracted from the video into a video segment of about 10 seconds.

The main idea of this paper is to expand human behavior recognition to insect behavior recognition. Because Insect’s behaviors are much smaller than human behaviors, so we are not sure if the network model can have a good extraction of insect fine action features when detecting their behaviors. Therefore, we confirmed this problem through experiments. We labeled the prepared video clips with data, labeled each small video as an action, and then putting them into C3D, I3D and X3D networks for training respectively. (Fig. 2)

Table 1

Values of the hyperparameters for the three different network models evaluated in the study.
Model	Batch	Momentum	Optimizer	Initial learning rate	Training epochs
C3D	16	0.9	SGD	1e-4	50
I3D	16	0.9	SGD	1e-4	50
X3D	16	0.9	SGD	1e-4	50
SGD, stochastic gradient descent.

2.6. Regurgitation spots extraction experiment

After detecting regurgitation of fruit fly, we need to semantically segment their regurgitated spots and then calculate the area of regurgitated spots by threshold segmentation, which can provide a quantitative assessment for regurgitation studies.

In segmenting the regurgitated spots, the Unet network was firstly used. Why was the Unet network chosen? We are inspired by medical image segmentation. Medical semantics is simpler and more fixed in structure. And the organ itself is fixed in structure and not particularly rich in semantic information, so high-level semantic information and low-level features are important. The skip connection and U-shaped structure of Unet combine high-level semantic information and low-level features, so it is more suitable for medical semantic segmentation. The fruit fly regurgitated image features are similar to medical images. In other words, the regurgitated spots is like a group of ellipse-shaped cells. Its structure is more fixed, and the semantic structure is also relatively simple, so all that needs to be done is accurate segmentation.^27–29

In order to obtain better segmentation results, we have tried to modify the backbone network of Unet by using Vgg16 and ResNet50 respectively, and then added CBAM attention mechanism to it, which makes the segmentation effect further improved. In order to make the experiments more scientific, ablation experiments are also done in this paper. The semantic segmentation network deeplabv3 + is used, and Xception and MobileNetv2 are also used as the backbone network of DeeplabV3+. ^30–37 The training hyperparameter settings and training results are shown in Table 2.

Table 2

Model performance metrics at different training hyperparameter settings for the two convolutional neural networks evaluated in the study
Model	Backbone	Optimizer	Initial learning rate	Miou	Loss
Unet	Vgg16	adam	1e-4	89.5	0.056
Unet + CBAM	Vgg16	adam	1e-4	90.96	0.055
Unet	ResNet50	adam	1e-4	85.20	0.096
Unet + CBAM	ResNet50	adam	1e-4	85.95	0.077
deeplabV3+	Xecption	SGD	7e-3	80.69	0.3144
deeplabV3+	MobileNetv2	SGD	7e-3	80.66	0.3211

As shown in the Table 2, Unet's accuracy is higher when using vgg16 as the backbone, and combining with it the CBAM attention mechanism. Therefore, the training weights of this model were chosen to segment the randomly selected fruit fly regurgitated images. Before segmentation, a square millimeter piece of labeled paper was placed in a petri dish as a "scale" and photographed together with the fruit fly, so that the bottom area of the regurgitated spots could be derived from the pixels of the regurgitated spots through the area and pixels of the paper.

After the segmentation, the extracted regurgitated spots can be clearly seen, but the segmented image contains impurities, such as the fruit fly themselves and the tiny impurities on the petri dish, which not only bring visual disturbance, but also affect the next step of calculating the regurgitated spots area. Therefore, the threshold segmentation method can be used to remove the impurities and background. Since only regurgitated spots and marker paper pieces need to be retained, we chose binarization the simplest threshold segmentation, to assign black values to all the impurities and background, and to keep and deepen the color of regurgitated water droplets and marker paper pieces, so as to obtain an completely extracted regurgitated spots picture. The extraction process is shown in Fig. 3.

The number of closed shapes in the image and the pixels of each closed shape are calculated by OpenCV, and then the area of each spots is obtained by marking the pixels and area of the paper sheet.

2.7. Trajectory tracking experiment

The Yolov5 target detection algorithm combined with the DeepSort algorithm, which has a good tracking performance at present, is used in this paper to track the trajectory of fruit fly in the process of regurgitation, and it can realize the counting of fruit fly. We also used DeepSort network, the most important feature of which is the use of Kalman filtering algorithm and Hungarian algorithm, both of which can greatly improve the accuracy and speed of multi-object tracking. The Kalman filtering algorithm is divided into two processes: prediction and update. Prediction: when the target is moved, the target frame position and speed and other parameters of the current frame are predicted by the target and speed parameters of the previous frame. Update: the two positively distributed states of the predicted and observed values are linearly weighted to obtain the current system predicted transition state. In other words, the Kalman filter can predict the position of the current moment based on the position of the target at the previous moment and can estimate the position of the target more accurately than the sensor. The Hungarian algorithm mainly calculates the similarity to get the similarity matrix of the two frames before and after, so as to determine whether the target in the current frame is the same as the target in the previous frame. ^38–44

Although DeepSort network has high accuracy and high speed in multi-object tracking, it is mostly used for pedestrian and vehicle tracking and counting. It can achieve good results in tracking objects with relatively large targets and obvious features, but it is seldom used for insect trajectory tracking and counting.^{45, 46} This is because insects are small in size, relatively inconspicuous in features, and their trajectories are much messier than those of straight-line vehicles and pedestrians, and there is no obvious motion pattern. There is no clear movement pattern. In this paper, we want to try to use DeepSort network to track insects and explore whether there is a network model that can meet the requirements of tracking insects.

Therefore, 270 images of fruit fly were used to train the the Yolov5 network, and then 30 images to verify its effect. After 50 iterations of training, the accuracy of network could reach 99.8 percent. The best weights of Yolov5 training are used as the weights of DeepSort object tracking for tracking experiments. Two South Asian solid flies in a video are detected and tracked.

3.1. regurgitation behavior recognition

The same video data and labels were put into the model for training, and the experimental results with the same configuration parameters are shown in Table 3. Although the X3D network is more advanced and has better performance in human behavior recognition, it is found through this experiment that after applying to insect regurgitation behavior recognition, the I3D network performs better with higher accuracy and less training loss. But from the perspective of training time, the I3D model is still slightly inadequate, and the training time for each item is longer than that of other models. The training time will not directly affect the detection effect of the behavior recognition experiments, but if the model is promoted, it will affect the efficiency of the experiment when training a large number of different datasets. Therefore improving the network model will also become the main content of the later experiments, we planned to replace the network backbone with a lightweight network such as Mobilenet, which can improve the training speed while ensuring the accuracy.

Table 3

Model performance metrics for the three different network models evaluated in the study.
Network Model	Top-1 Accuracy	Loss	Training time s/iter
C3D	0.95	0.170	4.67
I3D	0.963	0.101	5.01
X3D	0.925	0.215	4.77

From the results, we can find that the idea of transferring human behavior recognition to insect behavior recognition is feasible. All the three networks can effectively recognize the regurgitation behavior of fruit fly, and we chose I3D model, the best performer. This also gives us another idea that we can extend this experiment to behavioral recognition beyond fruit fly regurgitation to recognize all behaviors of fruit fly and similar insects, including insects' forefoot, hindfoot and head grooming behaviors, etc.

3.2. Regurgitation spots extraction

The regurgitated spots picture was segmented twice and the spots area calculated by OpenCV is shown in Table 4.

Table 4

Number and area of regurgitated spots beads extracted in Fig. 3 (a) and (b).
Insect regurgitation spots	Item	Pixel (pt)	Area(mm²)	Totalarea(mm²)
a	piece of labeled paper	2455.5	1	1
	Spots bead No.1	894.5	0.36	1.27
	Spots bead No.2	165.5	0.06
	Spots bead No.3	845.0	0.34
	Spots bead No.4	242.5	0.10
	Spots bead No.5	62.5	0.03
	Spots bead No.6	331.0	0.13
	Spots bead No.7	325.0	0.13
	Spots bead No.8	293.0	0.12
b	piece of labeled paper	2059.5	1	1
	Spots bead No.1	235.5	0.11	2.64
	Spots bead No.2	2759.5	1.34
	Spots bead No.3	760.5	0.37
	Spots bead No.4	1698.5	0.82

The method allows a general assessment of the regurgitation amount. Although there is no precise result, but this is still beneficial to regurgitation research analysis. The experimental results are not completely accurate, because the spots beads are too small, the resolution of the pictures is not high enough, and the spots beads and the background will be blurred when the pictures are enlarged and labeled, resulting in less accurate labeling. In addition, the color of the background and the spots beads are similar, which makes it difficult to distinguish the spots beads from the background.

When studying insect regurgitation in a real environment such as fruit fly, it is difficult to observe with the naked eye. And the regurgitation behavior occurs for a short period of time and at a high frequency, so it is difficult to know the number of regurgitations even if it is observed with the naked eye. In contrast, the method in our experiment can clearly mark the number of regurgitated water droplets and extract the approximate area. As mentioned in the introduction, house fly regurgitation spots will bring E-coli O157:H7 for vegetables. This problem can be solved by extracting regurgitated spots. As the study of insects goes deeper, the extraction of insect footprints and the segmentation of individuals are also important, which can extract different insects from the complex environment and realize the study of insect numbers and individual behavior in many aspects, which is also a prerequisite step for intelligent diagnosis of pests.

3.3. Trajectory tracking

In order to be able to observe the trajectory of fruit fly more clearly, two ways of recording were chosen. The observer can see the trajectory of the fruit fly clearly by combining the two videos.(Fig. 4).

Tracking the trajectory of fruit fly during their regurgitation can complete the study of fruit fly regurgitation. Additionally, we can also apply this model to track and count the trajectories of other insects, which can help to analyze insect movement patterns, male/female relationships and predatory behavior.

This work demonstrates the feasibility of using deep learning techniques and computer vision techniques to study regurgitation in fruit fly. The proposed method is divided into three main parts. The first one is to apply I3D network to fruit fly regurgitation behavior recognition, the accuracy of which could reached 96.3 percent. The second is to segment the extracted regurgitation spots. The proposed Unet combined with CBAM attention mechanism model achieves an MIOU of 90.96 percent, which is 1.46 percentage points higher than the original Unet network and 5–10 percentage points higher than other network models. And then conduct threshold segmentation to obtain the regurgitated spots quantity and area. The third is to track the trajectory of fruit fly during their regurgitation by Yolov5 + DeepSort. The accuracy of Yolov5 detecting fruit fly can be stabilized at 99.8 percent, and the final tracking effect is satisfactory.The method can be used in fruit fly and other insects’ regurgitation studies, and more importantly, it can be extended to take advantage of deep learning to solve the manual observation problems and applied to more insect research tasks according to different needs, which can realize non-destructive research and real-time monitoring of insects.

The current study is only a part of fruit fly regurgitation research, and the following three parts will be implemented on the basis of this paper. 1. Identification of changes in the mouthparts of fruit fly during their regurgitation. When ruminating fruit fly’s mouthparts perform specific movements. Although this paper is able to detect the regurgitation behavior through this feature, there is no detection and visual analysis of the specific movement changes and movement occurrence pattern of the mouthparts. 2. Estimation of the volume of spots beads regurgitated by fruit fly. In current research, we extracted the regurgitated spots through filter paper, which is a traditional way. However, there are errors with the method and the experimental process is more complicated and time-consuming. The artificial intelligence method to automatically measure the regurgitated spots volume will be of great help to fruit fly regurgitation research. 3. Realize the counting of regurgitated spots per unit time. This paper can realize the counting of regurgitated spots beads of fruit fly, but our limitation is that it cannot be set flexibly to count the number of regurgitated spots beads per unit time, which will also be the next focus of work. It is also particularly important to count regurgitated fluid beads in different fruit fly at different units of time depending on different experimental studies.

Data Availability statement: Our dataset was provided by School of Agriculture, Yangtze University and Chinese Academy of Tropical Agriculture Sciences. It cannot be made freely available. We are only permitted to use some images in the article.
Funding statement: This work was supported by the fund of National Natural Science Foundation of China(62276032); the Open Research Project of The Hubei Key Laboratory of Intelligent Geo-Information Processing (KLIGIP-2021A07); the China University Industry-University-Research Innovation Fund “New Generation Information Technology Innovation Project” (2020ITA03012) and the 2020 Jingzhou Science and Technology Development Plan Project；the National Natural Science Foundation of China (31772206, 31972274).
conflict of interest disclosure: The authors declare that there is no conflict of interest regarding the publication of this paper.

Vayssieres J-F, Korie S, Coulibaly O, Van Melle C, Temple L and Arinloye D, The mango tree in central and northern Benin: damage caused by fruit flies (Diptera Tephritidae) and computation of economic injury level. Fruits; 64(4): 207-220 DOI Electronic Resource Number (2009).
Badii K, Billah M, Afreh-Nuamah K, Obeng-Ofori D and Nyarko G, Review of the pest status, economic impact and management of fruit-infesting flies (Diptera: Tephritidae) in Africa. 2015).
Dias NP, Zotti MJ, Montoya P, Carvalho IR and Nava DE, Fruit fly management research: A systematic review of monitoring and control tactics in the world. Crop Protection; 112(187-200 DOI Electronic Resource Number (2018).
Ant T, Koukidou M, Rempoulakis P, Gong H-F, Economopoulos A, Vontas J and Alphey L, Control of the olive fruit fly using genetics-enhanced sterile insect technique. Bmc Biology; 10(2012).
Navarro-Llopis V, Vacas SJT, the detection c and flies rotf, Mass trapping for fruit fly control. 513-555 DOI Electronic Resource Number (2014).
Lasa R, Ortega R and Rull J, TOWARDS DEVELOPMENT OF A MASS TRAPPING DEVICE FOR MEXICAN FRUIT FLY ANASTREPHA LUDENS (DIPTERA: TEPHRITIDAE) CONTROL. Florida Entomologist; 96(3): 1135-1142 DOI Electronic Resource Number (2013).
Timilsena BP, Mikó IJRI and Outcomes, Know your insect: The structural backgrounds of regurgitation, a case study on Manduca sexta and Heliothis virescens (Lepidoptera: Sphingidae, Noctuidae). 3(e11997 DOI Electronic Resource Number (2017).
Sword GA, Tasty on the outside, but toxic in the middle: grasshopper regurgitation and host plant-mediated toxicity to a vertebrate predator. Oecologia; 128(3): 416-421 DOI Electronic Resource Number (2001).
Díaz-Fleischer F, Arredondo J, Lasa R, Bonilla C, Debernardi D, Pérez-Staples D and Williams TJI, Sickly sweet: insecticidal polyols induce lethal regurgitation in dipteran pests. 10(2): 53 DOI Electronic Resource Number (2019).
Guillén L, Pascacio-Villafán C, Stoffolano Jr JG, López-Sánchez L, Velázquez O, Rosas-Saito G, Altúzar-Molina A, Ramírez M and Aluja MJJoIS, Structural differences in the digestive tract between females and males could modulate regurgitation behavior in Anastrepha ludens (Diptera: Tephritidae). 19(4): 7 DOI Electronic Resource Number (2019).
Cáceres C, Tsiamis G, Yuval B, Jurkevitch E and Bourtzis KJBm. Joint FAO/IAEA coordinated research project on “use of symbiotic bacteria to reduce mass-rearing costs and increase mating success in selected fruit pests in support of SIT application”. Springer, pp. 1-2 (2019).
Wasala L, Talley J, Fletcher J and Wayadande A, House fly regurgitation spots may be a source of E-coli O157:H7 contamination of leafy greens. Phytopathology; 101(6): S188-S188 DOI Electronic Resource Number (2011).
Dorman SJ, Kudenov MW, Lytle AJ, Griffith EH and Huseth AS, Computer vision for detecting field-evolved lepidopteran resistance to Bt maize. Pest Management Science; 77(11): 5236-5245 DOI Electronic Resource Number (2021).
Jin X, Sun Y, Che J, Bagavathiannan M, Yu J and Chen Y, A novel deep learning-based method for detection of weeds in vegetables. Pest Management Science; 78(5): 1861-1869 DOI Electronic Resource Number (2022).
Zhang Z, Zhan W, He Z and Zou YJI, Application of spatio-temporal context and convolution neural network (CNN) in grooming behavior of bactrocera minax (diptera: trypetidae) detection and statistics. 11(9): 565 DOI Electronic Resource Number (2020).
Zhan W, Zou Y, He Z and Zhang ZJMPiE, Key points tracking and grooming behavior recognition of Bactrocera minax (Diptera: Trypetidae) via DeepLabCut. 2021(2021).
Hong S, Zhan W, Dong T, She J, Min C, Huang H and Sun YJJoIB, A Recognition Method of Bactrocera minax (Diptera: Tephritidae) Grooming Behavior via a Multi-Object Tracking and Spatio-Temporal Feature Detection Model. 35(4): 67-81 DOI Electronic Resource Number (2022).
She J, Zhan W, Hong S, Min C, Dong T, Huang H and He ZJEI, A method for automatic real-time detection and counting of fruit fly pests in orchards by trap bottles via convolutional neural network with attention mechanism added. 101690 DOI Electronic Resource Number (2022).
Asadi-Aghbolaghi M, Bertiche H, Roig V, Kasaei S and Escalera S. Action recognition from RGB-D data: Comparison and fusion of spatio-temporal handcrafted features and deep strategies. In Proceedings of the IEEE International Conference on Computer Vision Workshops, pp. 3179-3188 (2017).
Roig Ripoll V. Multimodal 2DCNN action recognition from RGB-D data with video summarization. Universitat PolitÃ© cnica de Catalunya, (2017).
Urabe S, Inoue K and Yoshioka M. Cooking activities recognition in egocentric videos using combining 2DCNN and 3DCNN. In Proceedings of the Joint Workshop on Multimedia for Cooking and Eating Activities and Multimedia Assisted Dietary Management, pp. 1-8 (2018).
Wu D, Chen J, Sharma N, Pan S, Long G and Blumenstein M. Adversarial action data augmentation for similar gesture action recognition. In 2019 International Joint Conference on Neural Networks (IJCNN). IEEE, pp. 1-8 (2019).
Rastgoo R, Kiani K and Escalera SJESwA, Hand sign language recognition using multi-view hand skeleton. 150(113336 DOI Electronic Resource Number (2020).
Tran D, Bourdev L, Fergus R, Torresani L and Paluri M. Learning spatiotemporal features with 3d convolutional networks. In Proceedings of the IEEE international conference on computer vision, pp. 4489-4497 (2015).
Carreira J and Zisserman A. Quo vadis, action recognition? a new model and the kinetics dataset. In proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 6299-6308 (2017).
Feichtenhofer C. X3d: Expanding architectures for efficient video recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 203-213 (2020).
Zunair H and Ben Hamza A, Sharp U-Net: Depthwise convolutional network for biomedical image segmentation. Computers in Biology and Medicine; 136(2021).
Wang J, Yu Z, Luan Z, Ren J, Zhao Y and Yu G, RDAU-Net: Based on a Residual Convolutional Neural Network With DFP and CBAM for Brain Tumor Segmentation. Frontiers in Oncology; 12(2022).
Fayemiwo MA, Olowookere TA, Arekete SA, Ogunde AO, Odim MO, Oguntunde BO, Olaniyan OO, Ojewumi TO, Oyetade IS, Aremu AA and Kayode AA, Modeling a deep transfer learning framework for the classification of COVID-19 radiology dataset. Peerj Computer Science; 7(2021).
Sutaji D and Yildiz O, LEMOXINET: Lite ensemble MobileNetV2 and Xception models to predict plant disease. Ecological Informatics; 70(2022).
Campos J, Yee A and Vega IF, Simplifying VGG-16 for Plant Species Identification. Ieee Latin America Transactions; 20(11): 2330-2338 DOI Electronic Resource Number (2022).
Li Z, Li F, Zhu L and Yue J, Vegetable Recognition and Classification Based on Improved VGG Deep Learning Network Model. International Journal of Computational Intelligence Systems; 13(1): 559-564 DOI Electronic Resource Number (2020).
Xi D, Qin Y and Wang Z, Attention Deeplabv3 model and its application into gear pitting measurement. Journal of Intelligent & Fuzzy Systems; 42(4): 3107-3120 DOI Electronic Resource Number (2022).
Hao B and Dae-Seong K, Research on Image Semantic Segmentation Based on FCN-VGG and Pyramid Pooling Module. Journal of Korean Institute of Information Technology; 16(7): 1-8 DOI Electronic Resource Number (2018).
Shin S, Hun LS and Ho HH, A Study on Attention Mechanism in DeepLabv3+for Deep Learning-based Semantic Segmentation. Journal of the Korea Convergence Society; 12(10): 55-61 DOI Electronic Resource Number (2021).
Memon MM, Hashmani MA, Junejo AZ, Rizvi SS and Raza K, Unified DeepLabV3+for Semi-Dark Image Semantic Segmentation. Sensors; 22(14)2022).
Nagrath P, Jain R, Madan A, Arora R, Kataria P and Hemanth J, SSDMNV2: A real time DNN-based face mask detection system using single shot multibox detector and MobileNetV2 (vol 66, 102692, 2021). Sustainable Cities and Society; 71(2021).
GaoGao and Lee S, Design and Implementation of Fire Detection System Using New Model Mixing YoloV5 and DeepSort. The International Journal of Advanced Culture Technology; 9(4): 260-267 DOI Electronic Resource Number (2021).
Kim H-T and Lee S-H, A study on object distance measurement using OpenCV-based YOLOv5. The International Journal of Advanced Culture Technology; 9(3): 298-304 DOI Electronic Resource Number (2021).
He C, Wang J, Yin Y and Li Z, Automated classification of coronary plaque calcification in OCT pullbacks with 3D deep neural networks. Journal of Biomedical Optics; 25(9)2020).
Ren J, Wang Z, Zhang Y and Liao L, YOLOv5-R: lightweight real-time detection based on improved YOLOv5. Journal of Electronic Imaging; 31(3)2022).
Sujin Y, Jung I, KangDongHwa and Baek H, Real-Time Multi-Object Tracking using Mixture of SORT and DeepSORT. Journal of Korean Institute of Information Technology; 19(10): 1-9 DOI Electronic Resource Number (2021).
Wang H, Jin Y, Ke H and Zhang X, DDH-YOLOv5: improved YOLOv5 based on Double IoU-aware Decoupled Head for object detection. Journal of Real-Time Image Processing; 19(6): 1023-1033 DOI Electronic Resource Number (2022).
Song H, Zhang X, Song J and Zhao J, Detection and tracking of safety helmet based on DeepSort and YOLOv5. Multimedia Tools and Applications 2022).
Zhang Q, Multi-object trajectory extraction based on YOLOv3-DeepSort for pedestrian-vehicle interaction behavior analysis at non-signalized intersections. Multimedia Tools and Applications 2022).
Chen X, Jia Y, Tong X and Li Z, Research on Pedestrian Detection and DeepSort Tracking in Front of Intelligent Vehicle Based on Deep Learning. Sustainability; 14(15)2022).

Download PDF

Reviewers agreed at journal
25 Jul, 2024
Reviewers invited by journal
07 Jul, 2024
Editor invited by journal
07 Jul, 2024
Editor assigned by journal
12 Jul, 2023
First submitted to journal
08 Jul, 2023

You are reading this latest preprint version

Application of computer vision technology to the regurgitation behavior of fruit fly (Diptera: Tephritidae)

Status:

Version 1

Abstract

Figures

1. INTRODUCTION

2. MATERIALS AND METHODS

2.1. Overview

2.2. Experimental equipment and environment

2.3. Model performance metrics

2.4. Data collection and processing

2.5. Regurgitation behavior recognition experiment

2.6. Regurgitation spots extraction experiment

2.7. Trajectory tracking experiment

3. RESULTS AND DISCUSSION

3.1. regurgitation behavior recognition

3.2. Regurgitation spots extraction

3.3. Trajectory tracking

4. CONCLUSIONS

Declarations

References

Status:

Version 1