Deep Learning performance in predicting dairy cows’ behaviour from a tri-axial accelerometer data

doi:10.21203/rs.3.rs-2085003/v1

Download PDF

Article

Deep Learning performance in predicting dairy cows’ behaviour from a tri-axial accelerometer data

https://doi.org/10.21203/rs.3.rs-2085003/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

The accurate detection of behavioural changes represents a promising method to early reveal the onset of diseases in dairy cows. This study assessed the performance of deep learning (DL) in classifying dairy cows’ behaviour from accelerometery data and compared the results with those of classical machine learning (ML). Twelve cows with a tri-axial accelerometer were observed for 136 ± 29 min each to detect 5 main behaviours. For each 8s time-interval 15 metrics were calculated obtaining a dataset of 211,720 observation units and 15 columns. The whole dataset was randomly split into training (80%) and testing (20%). An 8-layer Convolutional Neural Network (CNN) was made of 3 convolution, 1 dropout, 1 max-pooling, 1 flattening and 2 dense layers. The CNN accuracy, precision and sensitivity/recall were calculated and compared with the performance of classical ML. The CNN overall accuracy and F1-score were equal to 0.94. The precision, sensitivity/recall, and F1-score of single behaviours had the following ranges 0.88–0.99, 0.88–0.99 and 0.89–0.99, respectively. The CNN outperformed all classical ML algorithms. The CNN in our specific raising conditions showed an overall high performance in successfully predicting multiple behaviours using a single accelerometer. Further studies considering different breeds, housing conditions and sensors are warranted.

In the dairy farming sector efforts are made to achieve the production and economic needs of farmers, reduce the environmental impact and preserve animal welfare and health [1]. Health of dairy cows is in fact positively associated with milk yield and profitability. However, although prevention of infectious diseases such as bovine rhinotracheitis and bovine viral diarrhea through vaccination is widely spread, metabolic diseases, mastitis and lameness remain a challenge for farmers [1, 2, 3]. Diseases affect animal health and welfare, reduce milk quality and production and increase farm management costs. Moreover, the presence of sick cows increases the use of antibiotics in the farms, thus contributing to the emergence of antibiotic-resistant bacteria [2]. For these reasons, early detection and treatment of sick cows is of great importance, since it improves the prognosis and reduces drugs intake [4]. Among early signs of a disease, the change of time spent performing different behaviours is one of the most informative and commonly used. Sickness behaviour can be shown before actual clinical symptoms of disease. It is generally characterised by a lethargic state which alters the usual animal time budget, increasing the resting and lying time and reducing the activities, including feeding, drinking and the interaction with conspecifics and farm personnel [4, 5]. Some authors found that mastitis in dairy cows is associated with changes in rumination time and physical activity [6] and in case of Escherichia coli-induced mastitis, animals show fewer variation in activity before the fever onset [7]. As regards lameness, lame cows are reported to show changes in feeding behaviour [8, 9] and moderately lame cows have longer lying bouts duration [3]. In summary, analyzing data on physiological behaviours of healthy cows in a herd gives the opportunity to farmers and veterinarians to timely spot changes from normality. Measuring different behaviours such as lying, resting, feeding, ruminating and others, gives also significant information on the animal welfare and the effectiveness of animal management procedures [10, 11]. Up to few years ago, the continuous detection of animal behaviours was a challenge because it required extensive periods of close observation [4]. Today, there are commercially available sensors, such as tri-axial accelerometers [12] and artificial intelligence tools that give the opportunity to routinely collect and analyse such data, returning the predictions of the duration of the different behaviours performed [13].

The reliability and overall accuracy of such predictions depend on many factors such as the type and sensitivity of sensors used, the number and location of sensors on the cow [14, 15], the kind and the number of behaviours investigated and obviously the predicting models used [9]. So far, the models used in the prediction of a wide range of events (e.g. oestrus, lameness, mastitis, ruminal acidosis, etc.) and behaviours in cows, starting from tri-axial accelerometer data [6, 9, 12, 14, 15] or data of other origin such as animal location [4], were based on classical machine learning (ML). The ML methods aim to find meaningful relationships between features or variables, through the detection of hidden patterns among them [2]. Distinct prediction models are based on distinct learning and/or training strategies which affect their overall accuracy [2]. Classical ML prediction systems are based on various

stages, such as pre-processing (e.g., denoising, filtering), feature engineering (time and frequency domain descriptors), feature selection, and classification algorithms [16].

Among the latter, especially as regards behaviour prediction, support vector machine (SVM), Extreme Boosting Algorithm (XGB), Random Forest (RF), k-nearest neighbours (KNN) and artificial neural networks are commonly used [ 4, 9, 16]. Although Classical ML systems have been proved to provide an overall acceptable performance in predicting behaviours [9, 14, 15], in the feature engineering stage, there is a manual extraction of features which can potentially lead to the missing of some meaningful information [16].

In contrast with classical ML, deep learning (DL) models, also known as deep neural networks, do not need any hand-crafted design of feature extractors, but automatically capture complex features, considering non-linearity in the feature space and they are able to acquire complex patterns and spatial or temporal dependencies from the underlying raw data streams [16, 17]. The performance of DL models greatly depends on the internal architecture of the algorithms which can be described for example as recurrent neural networks, convolutional neural networks (CNNs), long short-term memory networks (LSTM), etc. [16, 17]. The DL models were initially created in the 1990s for the application in computer vision, but since then they have been applied to miscellaneous domains such as self-driving cars, finance and even livestock farming [16, 17, 18]. Although DL models in animal production have been recently applied to computer vision with the aim of identifying for example individuals [17] or behaviours [19], at our knowledge their application to tri-axial accelerometery data for behaviour classification is scarce and mainly applied to humans [16, 20]. The aim of this study was to assess the performance of a CNN in classifying the behaviour of healthy dairy cows on the basis of data acquired through a tri-axial accelerometer and compare the results with the performance attained from the same raw data through the use of classical ML models [21].

Accuracy alone is considered to be an inappropriate measure of performance for imbalanced classification problems. Any model’s accuracy can often be overestimated due to the overwhelming number of cases. For this reason, also the precision, recall, and F1-score were calculated. The latter combines both precision and recall (sensitivity) in a single measurement that considers both properties and, in our case, confirmed the value of overall accuracy [22]. As reported in Table 1, the overall accuracy, macro and weighted average of precision, recall, and F1-score of the model were all equal or higher than 0.93 in the testing dataset. Among the specific behaviours (Table 1), resting showed the highest precision, recall, and F1-score in both datasets, whereas standing still showed the lowest precision and F1-score. Moving was the behaviour with the lowest recall. The precision, recall, and F1-score of single behaviours have ranges of 0.88-0.99, 0.88-0.99 and 0.89-0.99, respectively.

Table 1. Assessment of the accuracy, precision, recall, and F1-score of the CNN model in the prediction of multiple behaviours in the testing dataset.

Behaviour	Precision	Recall	F1-score	Number of Observation units
Feeding	0.94	0.94	0.94	8,199
Moving	0.92	0.88	0.90	8,866
Resting	0.99	0.99	0.99	13,168
Ruminating	0.93	0.97	0.95	4,975
Standing still	0.88	0.91	0.89	7,136

Metrics
Accuracy			0.94	42,344
Macro average	0.93	0.94	0.93	42,344
Weighted average	0.94	0.94	0.94	42,344

The performance attained through the application of the CNN model in correctly classifying dairy cows’ behaviours greatly overdid the Classical ML models used (Fig. 1). The Fig. 1 highlights that there is an important difference (0.18 points) in overall accuracy between CNN and RF, which was reported to be the best working classical ML algorithm in the behaviour detection [21]. Precision and sensitivity/recall are very similar between CNN and classical ML during resting behaviour. In behaviours that are quite similar as regards their movement patterns such as feeding, moving and standing still, precision and sensitivity/recall are higher in CNN of about 0.17-0.29 and 0.15-0.28 points, respectively. As regards the ruminating behaviour, precision was very similar between CNN and RF with a difference of 0.06 points, whereas the difference for sensitivity/recall between CNN and the highest value shown by ML algorithms, in this case represented by XGB, was 0.34 points.

In this study, we investigated the performance of a CNN model in classifying five behaviours of healthy dairy cows on the basis of a tri-axial accelerometer data. The DL models, including CNN were known since the 90’s, but at the beginning they did not break through due to the lack of suitable computational resources and enough suitable and available data [16]. More recently, thanks to the wide spread of powerful computational resources, and the availability of large amount of suitable data, DL models have been applied to almost every domain, including animal health [2], individual recognition [17] and application of computer vision systems to animal farming [18]. In the latter field, even pose estimation and locomotion pattern recognition achieved excellent results [19]. However, so far behaviour classification in dairy cows through the application of tri-axial accelerometers has been mainly performed through classical ML models, whereas DL application to accelerometery data was investigated in humans [16, 20].

The CNN model performance was also compared with that attained from the same raw data through the use of classical ML models [21] by using common metrics such as precision, sensitivity/recall and overall accuracy. Sensitivity and recall are used in ML and DL respectively, but are calculated in the same way (Sensitivity or Recall = TP / (TP + FN)). Accuracy, although our data were imbalanced as regards different behaviours, was used in the comparison between DL and classical ML since it gave the same results than F1-score and was considered suitable.

Feeding, moving and standing still were among the most difficult behaviours to discriminate, as reported also by Benaissa et al. [15], who used leg accelerometers in cows. The same issue is reported by researchers working on physical activity classification in humans, where discriminating between walking and standing has the lowest F1-score [16]. During all these behaviours in fact cows are standing and can move their head and body, and in case of moving and feeding they can also move the legs for small changing of position [15]. Moreover, during these behaviours the sensor had a similar orientation of the 3 axes [16, 21], which instead changed during resting and ruminating in lying position [15, 21].

The highest accuracy of the CNN model and in general of other DL models [2, 18, 19] is favoured by their structure: these algorithms are made up of simple units organized in layers which are stacked to form “deep networks” [19]. The links between units can be trained on original data from the whole dataset and learn to extract information by adding non-linearity in the feature space, which is instead usually overlooked during manual feature extraction [16, 19]. Particularly, the CNN model has likely learnt the non-linear temporal dynamics of predictors derived by X, Y, Z acceleration values, an outcome that is not possible with conventional prediction models, as reported also by Con et al. [23] as regards deep learning models for clinical prediction in human diseases.

The high overall accuracy (0.94) of CNN was obtained despite the rather high number of behaviours tested, which is thought to severely affect prediction reliability [9]. Furthermore, the high overall accuracy, precision and sensitivity in classifying correctly all the five behaviours considered were achieved by using a single sensor, which implies reducing costs and have less drawbacks for both the cow and the farmer, compared with the application of multiple sensors [15, 21]. Our findings were more accurate or precise than those of previous studies reporting multiple behaviour prediction with the use of a single sensor and classical ML algorithms, such as Roland et al. [24] and Martiskainen et al. [25], who found an overall accuracy of 0.71 and 0.69, respectively and in general lower precision and sensitivity/recall for all the specific behaviours investigated. Surprisingly precision and sensitivity/recall for feeding and ruminating behaviours were in line with those found by authors looking for a limited number of behaviours who applied the accelerometer to the neck [9, 15]. Fitting the sensor on the neck favours the prediction of feeding and ruminating because those behaviours are characterized by movements of the head, but it hampers the recognition of behaviours characterised by bodily position, such as resting.

From a practical point of view, achieving a higher accuracy in the determination of the time spent by cows resting, feeding, ruminating and standing allows farmers to better spot the change of time spent performing different behaviours, which is a sign of an incoming disease, such as in the case of mastitis detection [6]. Moreover, overall cows’ time budget can be associated to housing, management and feeding issues, such as resting area design, overstocking, ration formulation and its delivery [26].

The improving of performance attained in this study switching from classical ML models to CNN suggests that applying DL models might improve overall performance in the detection of cow’s and more in general animal’s behaviours by using accelerometery data. Similar outcomes have been reached in the detection of sub-clinical mastitis from milk composition data [2] and even in the prediction of inflammatory bowel diseases in humans [23]. On the other hand, Aways at al [16] found just a slight improvement in the recognition of human activity of a DL model when compared with classical ML and suggested that when a performance plateau was reached, further improvements might not be achievable, regardless whether ML or DL models are used. Among the major drawbacks of DL models there is their 'black box' approach which makes impossible to produce a causal link between predictors and results [23]. Another drawback is the need for a good deal of labelled data, suitable software and powerful computational resources [23], even though the increasing application of new sensors and technology in the farming sector should fulfil this need.

To conclude, the application of a CNN model to data acquired through a single tri-axial accelerometer to mid-lactating dairy cows showed an overall high performance in successfully predicting multiple behaviours, ranging from resting to feeding, ruminating and moving. The CNN model outperformed the outcomes previously obtained by the application of classical ML models on the same raw data. These results demonstrate the huge potential of DL in the application to precision livestock farming. Although the results look promising there is the need of results confirmation from further studies considering different breeds, housing, farming systems and sensors.

2.1 Ethical Statement

The trial was carried out in accordance with D.Lgs. 26/2014 and EU Directive 2010/63/EU concerning experiments on animals and was endorsed by the animal welfare committee (Organismo Preposto al Benessere Animale committee – OPBA – official number 167326) of Padova University. The experimental protocol was approved by the licensing committee OPBA (official number 167326) of Padova University according to D.Lgs. 26/2014. Furthermore, all methods were performed in accordance with the OPBA’s guidelines and regulations in compliance with D.Lgs. 26/2014. The research adhered to all aspects of the ARRIVE guidelines to inform both study design and reporting. The protocol consisted in the observation for 27.3 h of 12 healthy randomly chosen mid lactating dairy cows wearing a tri-axial accelerometer to assess the accuracy of a deep learning model in predicting cows’ behaviour.

2.2 Data collection

Animal husbandry and data collection are described in detail by Balasso and colleagues in a paper reporting the use of classical ML to classify dairy cow behaviour [21]. Briefly, the trial was carried out in an Italian dairy farm raising Italian Red-and-White cows in loose housing conditions. Twelve randomly selected healthy cows with 2.87±0.91 lactations and 180 ± 35 days in milk (average ± SD) were observed by trained personnel for on average 136 ± 29 min per cow in a period of 12 days. Animals were observed approximately between 1100 h and 1500 h, to include the highest variety of behaviours as possible by two trained operators who took down cow behaviour in real time using Microsoft Excel 2010 (Microsoft, Remond, WA, USA) on a computer synchronized with the sensor. Inter-observer reliability reckoned through Cohens’ kappa [27] was 0.91. During the observation sessions cows were wearing a tri-axial (X, Y, Z) accelerometer (model MSR145W, PCE Italia srl, Capannori, LU, Italy), applied to the center of the left side paralumbar fossa through an elastic band [21]. The sensor was set to collect data at a 5 Hz frequency [21] to be able to spot short‐term behaviours and at the same time save the battery life. The accelerometer was fixed so as to have on a standing animal X, Y and Z axes in a pre-set position: X vertical, Y parallel to the ground and the Z orthogonal to the cow’s flank. Five behaviours were considered: moving, standing still, feeding, ruminating, and resting, as reported in Table 2.

Table 2. Behaviour description for dairy cow.

Behaviour	Definition¹

Standing still	Cows stand still without moving their legs or showing any sign of activity
Feeding	Cows ingest the feed and chew it at the feed bunk

Moving (Walking or moving slightly)	Cows walk across the pen or, while standing, perform other behaviours other than those here described, which are characterized by at least one step every 10 seconds.
Ruminating	Chewing that begins upon regurgitating a bolus and ends when the bolus is swallowed, both in standing or lying position.
Resting	Cows lie on the floor, not moving nor ruminating

¹Adapted from Balasso et al. [21].

2.3 Dataset preparation

Acceleration data on the X, Y and Z axes were exported as .csv file using the software MSR 5.12.04 (PCE Italia srl, Capannori, LU, Italy). Data were then imported in Excel 2010 (Microsoft, Redmond, WA, USA), where the collection time (date, h, min, s, hundredths of a second), acceleration values on the X, Y and Z axes, and the corresponding behaviour were reported for each row in different columns. Statistical analyses were performed using R, version 3.2.1 (R Core Team 2013). Tri-axial accelerations were recorded every 0.2 seconds, corresponding to 27.3 h of observation (n = 490,900). The observations during which the behaviour was unclear were excluded by the dataset, leaving 25.4 h of observation suited for analyses (n = 456,730), including feeding (4.68 h; n = 84,206), moving (4.69 h; n = 84,400), resting (7.84 h; n = 141,055), ruminating (2.98 h; n = 53,744), and standing still (5.18 h; n = 93,325).

A list of metrics was obtained considering a rolling window of 15 observations. These metrics are: standard deviation (sd), average (avg), percentage change between an observation and the previous one, and the binary value related to it (if the percentage change is negative, the value given is 0; otherwise it is 1) applied to X, Y and Z acceleration data, for a total of 15 variables. As reported in Fig. 2. each interval of 40 observations (8 s) with a sliding interval of 13 observations (33%) was summarized into one observation unit and associated with a specific behaviour, obtaining a dataset of 211,720 observation units and 15 columns. All the intervals in between two different behaviours were excluded. The 8 s interval was chosen because it offered the best compromise in differentiating one very short behaviour, such as walking, from others.

To build up a predictive model, the dataset was randomly split into training (80% of the observations, n = 169,376) and testing (20% of the observations n = 42,344) datasets. The latter was used to estimate the performance of the model. All variables were normalized considering the mean and the standard deviation of the training dataset.

2.4 Data Modelling

As reported in Table 3 a CNN model made of 8 layers was built using 5 kinds of layers: convolution (n = 3), dropout (n = 1), max-pooling (n = 1), flattening (n = 1), and dense layers (n = 2).

Convolution is a process in which a small matrix (the Kernel or filter) is slid across the input dataset which is transformed on the basis of the filter values. As reported in Table 2, in the Conv1d_1, Conv1d_2 and Conv1d_3 layers, the filters were set up as 128, 64, 32, respectively. For all three layers, the kernel size was set up as 3 and the activation function used was the rectified linear unit (ReLu). The dropout layer randomly selected neurons which are ignored during training. This helps prevent overfitting. To do this, a rate frequency is adopted at each step. In this model the rate was set to 0.3.
Max-pooling was used to reduce the size of the tensor and accelerate calculations. It downsamples the input representation by calculating the largest value over the window as defined by pool size, which in our case was set up to 2.
Flattening reduces data into an array so that the CNN can read it by removing every dimension but one. As reported in Table 2, the output shape of the layer is 544, which is equal to the multiplication of 17 times 32, the two dimensions of the previous layer.
The Dense layer consists of a finite number of neurons (mathematical functions) which receive as input one vector and return another one as output. The first dense layer was made of 100 neurons with a ‘RELU’ activation function and was connected to the last dense layer with a softmax activation function and a length of 5, which is equal to the number of activities to be classified by the model. The model was deployed in Python using Keras [28] with a Tensorflow backend.

It is noteworthy that the final layer’s output shape is 5, given that there are 5 behaviours to classify.

Table 3. Summary of the Deep Learning model architecture, with description of the layers used, output shape, and the number of parameters used in the model for each layer.

Layer (type)	Output Shape	Parameters
Conv1d_1 (Conv1D)	(None, 38, 128)	5,888
Conv1d_2 (Conv1D)	(None, 36, 64)	24,640
Conv1d_3 (Conv1D)	(None, 34, 32)	6,176
dropout_1 (Droput)	(None, 34, 32)	0
max_pooling1d_1 (Max-pooling)	(None, 17, 32)	0
flatten_1 (Flatten)	(None, 544)	0
dense_1 (Dense)	(None, 100)	54,500
dense_1 (Dense)	(None, 5)	505
Total parameters: 91,709
Trainable parameters: 91,709
Non-trainable parameters: 0

This CNN model was chosen out of three CNN and a one CNN Long short-Term Memory Network models since it gave the best performance. Training the models took about 90 minutes for each model by using Google Colaboratory, which is a cloud-based notebook environment that lets you write, execute, and share code in Google Drive. Google Colaboratory gives free access to GPUs (Graphics Processing Unit) and TPUs (Tensor Processing Unit) with the following characteristics and performance (Table 4).

Table 4. Summary of the Graphics Processing Unit (GPU) characteristics and performance made available in Google Collaboratory.

Parameter	Value
GPU	Nvidia K80 / T4
GPU Memory	12GB / 16GB
GPU Memory Clock	0.82GHz / 1.59GHz
Performance	4.1 TFLOPS / 8.1 TFLOPS
Support Mixed Precision	No / Yes
GPU Release Year	2014 / 2018
No. CPU Cores	2
Available RAM	12GB (upgradable to 26.75GB)

CPU, central processing unit; RAM, random access memory.

Fig. 3 reports the learning curve of the model, a line plot showing how the accuracy of the model increases over training. Models are trained over a large number of epochs allowing the learning algorithm to run until the error from the model has been sufficiently reduced. The epoch is a unit meaning that each sample in the training dataset has had an opportunity to update the internal model parameters and the number of epochs is a hyperparameter that gives the number of times that the learning algorithm will work through the whole training dataset [29].

Model Assessment

Average accuracy (macro and weighted), recall, precision, and F1-score were calculated in order to measure CNN capability in predicting cow behaviour [22]. Once the numbers of true positives (TP), true negatives (TN), false positives (FP), and false negatives (FN) have been set, average accuracy is reckoned as accuracy = (TP + TN) / (TP + FP + FN + TN) and gives an overall measure of correctly identified behaviours [21]. Note that in average accuracy, all classes are assigned equal weight when contributing their portion of the precision value to the total. This might not be a realistic calculation whenever there is a large amount of class imbalance. In the latter case, a weighted macro average is more informative. Weights are calculated by the frequency of the class in the truth column. The other parameters were calculated as follows: Recall = TP / (TP + FN); Precision = TP / (TP+FP); F1-score = (2 * Precision * Recall) / (Precision + Recall). The latter is a single score that balances both the concerns of precision and recall in one number, as reported in literature [22].

Data availability

The data presented in this study are available on request from the corresponding author. [GM].

Acknowledgments

This research was funded by the University of Padova, project DOR1904730 and by UniSmart

Author contributions

P.B. contributed to methodology, formal analysis, validation and writing. C.T.: contributed to review and editing. L.S. contributed to methodology, validation, review and editing. L.M. contributed to investigation, review and editing. I.A. contributed to conceptualization, supervision and review and editing. G.M. contributed to conceptualization, writing and investigation.

Competing interests

The authors declare that there is no conflict of interest.

Additional information

Correspondence and requests for materials should be addressed to G.M.

Britt, J. H. et al. Review: Perspective on high-performing dairy cows and herds. Animal 15, 100298 (2021).
Ebrahimi, M., Mohammadi-Dehcheshmeh, M., Ebrahimie, E. & Petrovski, K. R. Comprehensive analysis of machine learning models for prediction of sub-clinical mastitis: Deep Learning and Gradient-Boosted Trees outperform other models. Comput Biol Med 114, 103456 (2019).
Yunta, C., Guasch, I. & Bach, A. Short communication: Lying behavior of lactating dairy cows is influenced by lameness especially around feeding time. Journal of Dairy Science 95, 6546–6549 (2012).
Wagner, N. et al. Machine learning to detect behavioural anomalies in dairy cows under subacute ruminal acidosis. Computers and Electronics in Agriculture 170, 105233 (2020).
Dantzer, R. & Kelley, K. W. Twenty years of research on cytokine-induced sickness behavior. Brain Behav Immun 21, 153–160 (2007).
Stangaferro, M. L., Wijma, R., Caixeta, L. S., Al-Abri, M. A. & Giordano, J. O. Use of rumination and activity monitoring for the identification of dairy cows with health disorders: Part II. Mastitis. Journal of Dairy Science 99, 7411–7421 (2016).
de Boyer des Roches, A. et al. Behavioral and patho-physiological response as possible signs of pain in dairy cows during Escherichia coli mastitis: A pilot study. Journal of Dairy Science 100, 8385–8397 (2017).
Norring, M. et al. Short communication: Lameness impairs feeding behavior of dairy cows. Journal of Dairy Science 97, 4317–4321 (2014).
Riaboff, L. et al. Development of a methodological framework for a robust prediction of the main behaviours of dairy cows using a combination of machine learning algorithms on accelerometer data. Computers and Electronics in Agriculture 169, 105179 (2020).
Abeni, F. & Galli, A. Monitoring cow activity and rumination time for an early detection of heat stress in dairy cow. Int J Biometeorol 61, 417–425 (2017).
Marchesini, G. et al. Effects of axial and ceiling fans on environmental conditions, performance and rumination in beef cattle during the early fattening period. Livestock Science 214, 225–230 (2018).
Marchesini, G. et al. Use of rumination and activity data as health status and performance indicators in beef cattle during the early fattening period. The Veterinary Journal 231, 41–47 (2018).
Cabrera, V. E., Barrientos-Blanco, J. A., Delgado, H. & Fadul-Pacheco, L. Symposium review: Real-time continuous decision making using big data on dairy farms. Journal of Dairy Science 103, 3856–3866 (2020).
Borchers, M. R., Chang, Y. M., Tsai, I. C., Wadsworth, B. A. & Bewley, J. M. A validation of technologies monitoring dairy cow feeding, ruminating, and lying behaviors. Journal of Dairy Science 99, 7458–7466 (2016).
Benaissa, S. et al. On the use of on-cow accelerometers for the classification of behaviours in dairy barns. Research in Veterinary Science 125, 425–433 (2019).
Awais, M., Chiari, L., Ihlen, E. A. F., Helbostad, J. L. & Palmerini, L. Classical Machine Learning Versus Deep Learning for the Older Adults Free-Living Activity Classification. Sensors (Basel) 21, 4669 (2021).
Li, G., Erickson, G. E. & Xiong, Y. Individual Beef Cattle Identification Using Muzzle Images and Deep Learning Techniques. Animals (Basel) 12, 1453 (2022).
Li, G. et al. Practices and Applications of Convolutional Neural Network-Based Computer Vision Systems in Animal Farming: A Review. Sensors 21, 1492 (2021).
Mathis, M. W. & Mathis, A. Deep learning tools for the measurement of animal behavior in neuroscience. Curr Opin Neurobiol 60, 1–11 (2020).
Nunavath, V. et al. Deep Learning for Classifying Physical Activities from Accelerometer Data. Sensors (Basel) 21, 5564 (2021).
Balasso, P., Marchesini, G., Ughelini, N., Serva, L. & Andrighetto, I. Machine Learning to Detect Posture and Behavior in Dairy Cows: Information from an Accelerometer on the Animal’s Left Flank. Animals 11, 2972 (2021).
Sokolova, M. & Lapalme, G. A systematic analysis of performance measures for classification tasks. Information Processing & Management 45, 427–437 (2009).
Con, D., van Langenberg, D. R. & Vasudevan, A. Deep learning vs conventional learning algorithms for clinical prediction in Crohn’s disease: A proof-of-concept study. World J Gastroenterol 27, 6476–6488 (2021).
Roland, L. et al. Technical note: Evaluation of a triaxial accelerometer for monitoring selected behaviors in dairy calves. J Dairy Sci 101, 10421–10427 (2018).
Martiskainen, P. et al. Cow behaviour pattern recognition using a three-dimensional accelerometer and support vector machines. Applied Animal Behaviour Science 119, 32–38 (2009).
Cook, N. B. Symposium review: The impact of management and facilities on cow culling rates. Journal of Dairy Science 103, 3846–3855 (2020).
Cohen, J. A Coefficient of Agreement for Nominal Scales. Educational and Psychological Measurement 20, 37–46 (1960).
Chollet, F. Deep Learning with Python, Second Edition. (Simon and Schuster, 2021).
Goodfellow, I., Bengio, Y. & Courville, A. Regularization for Deep Learning in Deep Learning (MIT Press) 241-249 (Cambridge, 2016).

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Deep Learning performance in predicting dairy cows’ behaviour from a tri-axial accelerometer data

Status:

Version 1

Abstract

Figures

Introduction

Results

Discussion

Methods

2.1 Ethical Statement

2.2 Data collection

2.3 Dataset preparation

2.4 Data Modelling

Declarations

References

Additional Declarations

Status:

Version 1