Data Quality Management and Risk Assessment of Dairy Farming with Feed Behaviour Analysis Using Big Data Analytics with YOLOv51 Algorithm

doi:10.21203/rs.3.rs-4519712/v1

Download PDF

Research Article

Data Quality Management and Risk Assessment of Dairy Farming with Feed Behaviour Analysis Using Big Data Analytics with YOLOv51 Algorithm

https://doi.org/10.21203/rs.3.rs-4519712/v1

This work is licensed under a CC BY 4.0 License

Version 1

posted

You are reading this latest preprint version

Dairy farming is a vital sector of agriculture that plays a significant role in the global food supply chain. It provides essential dairy products such as milk, cheese, and yoghurt, contributing to both economic stability and food security. However, the dairy industry faces a multitude of challenges, including environmental concerns, animal health and welfare, and economic fluctuations. Amidst these challenges, optimizing dairy farm operations is crucial to ensure sustainability and profitability. The objective of this work is a comprehensive approach to address data quality management and risk assessment within the context of dairy farming, with a specific focus on feed behaviour analysis. The study begins by addressing the proliferation of big data necessitates paradigm shifts from conventional approaches in applying machine learning techniques to this huge quantity of data with varying velocity. The research proposed Apache Spark HDFS is designed to process a huge volume of data. Proper nutrition management is essential to prevent ketosis. Enhancing context across multiple scales modules was developed to rage the structures of ResNet and YOLOv5, allowing for improved extraction of contextual information from images through cross-connected semantic feature extraction modules and backbone networks. Providing a balanced diet that meets the energy requirements of the cows is important in preventing negative energy balance. Additionally, monitoring feed intake and adjusting the diet as needed can help prevent ketosis in dairy cows. This study aimed to forecast the likelihood of ketosis occurrence in dairy cows through the use of machine learning algorithms of Cascade feedforward artificial neural network. In this work, the study applies the (BOA) to the process of Stacking ensemble to generate domain-specific configurations based on non-invasive prenatal indicators of parity, body condition score, dystocia score, daily activity, daily rumination time, and season of calving, drinking time, eating time, bolus, drinking gulps, chews per minute. The simulation of this experiment is implemented using Python software. The findings exhibited the proposed algorithm positions out with an imposing accuracy rate of 95.5%, highlighting its capability for precise classifications. These findings can improve dairy farm sustainability, profitability, and the welfare of cattle, benefiting the global food supply chain.

Risk Assessment

Dairy Farming

Feed Behaviour Analysis

Yolov51 Algorithm

Ketosis and Mastitis and Data Quality Management

The practice of farm management is constantly evolving, and access to high-quality, real-time, integrated data is essential for herd managers to effectively identify feeding and health issues, optimize reproduction rates, and enhance overall production systems [1]. The utilization of new technologies on the precision livestock farming involves using the principles and techniques of process engineering in combination with IoT technology to manage livestock the processes involved in dairy farming production. [2]. Looking at each data stream in isolation, we can gather valuable insights into the various operations that occur on a dairy farm, including feeding management choices, genetic testing results, milk composition data, and heat detection activities. [3]. Farm organization evolves constantly, in real-time, and good-quality, combined data can support herd managers in identifying health problems and feeding, optimising reproduction, and increasing the complete production system [4]. There is a growing need to automatically merge various farm data sources into decision-making tools in order to provide farmers with integrated advice for more efficient herd management. [5].

Technology is being developed to hold vast quantities of complex data from diverse sources [6–7]. The terms "Big Data" and "Decision Support Systems" (DSS) computerized multidimensional data management systems assist stakeholders in utilizing data-driven methods to identify and address issues, ultimately facilitating improved decision-making processes. [8]. Big Data is the term used to describe vast amounts of intricate online information compiled quickly from various sources such as research, industry, organizations, online databases, and social media. There is a growing trend in food safety research towards utilizing Big Data and decision support systems (DSS) for conducting assessments of food safety risks. [9]. Differences in characteristics between organic and conventional dairy farms in the United States include smaller herd sizes, non-free-stall housing, and diets based on grazing. [11]. These management factors can also play a role in determining the occurrence and prevalence of various diseases, potentially leading to a distorted understanding of the actual impact of organic management practices on disease incidence. [12]. It is difficult to determine how the definition and perception of disease by animal caregivers influence the incidence and detection of disease [13]. Farmers' attitudes towards mastitis can impact the rate of clinical mastitis on their farms. Treatment options for diseases, including mastitis, differ between organic and conventional farming methods, potentially shaping how farmers perceive and address the issue. [14]. Researchers associating rates of clinical mastitis on CON and ORG farms often report less disease on organic farms. The occurrence of less clinical mastitis among ORG farms has been attributed to reduced milk production and improved cow cleanliness. The farmer's understanding and definition of mastitis may lead to a discrepancy in the reported rate of clinical mastitis. [15] As such, this work proposes to investigate data management and risk analysis in dairy farming. The remaining sections are arranged as follows: The literature review was described in Section 2, the study problem identification and motivation were described in Section 3, the proposed technique was described in Section 4, the results were discussed in Sections 5 and 6, and Section 6 ends with the conclusion.

El Bas et al [16] introduced a new framework for managing environmental supply chain risks in the situation of Industry 4.0. This framework utilizes data mining techniques to detect, evaluate, and address environmental risks effectively. As a result with the growing volume of data, data mining techniques and utilizing business intelligence have become crucial for effectively managing supply chain risk. Bellato et al [17] proposed to estimate the removal of a fresh cow from the herd. To account for unmeasured herd-level risk factors, mixed-effect generalized linear models were employed for the analysis. The results indicate that milk analysis could be also useful for predicting mastitis, its cure rate, and ketosis. Roussaki et al [18] proposed an orientation architecture that tackles the main experiments in the agricultural domain, the system can be further enhanced to provide a wider range of services due to improved interoperability. Luna et al [19] suggested that it would be beneficial to broaden studies on the perception of risk in aquaculture to include not only producers, but also other key stakeholders in the global value chain, such as policymakers, feed producers, and financial entities. All this provides a clear picture of risk sources in aquaculture and more information and knowledge pointing to some blind spots in producers’ perceptions which could result in inappropriate risk management decisions. Gruber et al [20] investigated the suitability of MIR spectral data for the prediction of bovine clinical mastitis and clinical ketosis with different methods, as well as various milk components, and outlined particularly relevant wavelengths for predictions, respectively. Studies that seek to predict traits such as mastitis and ketosis in dairy cows should consider utilizing spectral regions beyond the suggested ones, as valuable information can be found in these additional areas.

Kandhro et al [21] Suggested arsenic, cadmium, and lead in whey milk samples from cattle and human subjects in various regions were analyzed in Sindh, Pakistan were assessed based on their consumption of drinking water and surface water. The toxic elements were analyzed using inductively coupled plasma-optical emission spectrometry following both the proposed and conventional heating methods. Feyissa et al [22] This study examines the factors manipulating the acceptance of enhanced dairy farming practices (IDFP) such as improved breed, feed, and feeding conditions, and their impact on household food security in the central highlands of Ethiopia. Castillo Rodríguez et al [23] proposed determine if ADA and redox status dimensions in saliva can serve as effective indicators of disease conditions compared to the traditional method of using serum samples. Bradfield et al [24] This study examines the impact of land lease durations on investment decisions on dairy farms in Ireland between 2015 and 2018. The analysis considers the low levels of land sales and rentals, in addition to the presence of liberal market regulations. Specifically, the study explores the relationship between land leases and the likelihood of making capital or herd investments, taking into account factors such as the percentage of rented land and the length of leases. Eshete et al [25–28] suggested farm owners, managers, and attendants should focus on improving their awareness of proper feed and feeding management, precise heat detection, and optimal insemination timing [29–32]. This will help decrease the number of recurrence breeders and related reproductive issues, ultimately reducing economic loss on dairy farms. Additionally, it is necessary to enhance housing, health and reproductive management to further mitigate these challenges [33–36].

Dairy cows have a highly efficient system for converting roughage feeds into milk, making milk a nutritious and essential food source for humans. Milk products are crucial in providing necessary nutrients, especially in less developed regions and hot climates. Ensuring high-quality and safe milk products is a key priority in meeting the nutritional needs of people worldwide. Cows are kept and managed in different systems, where maintaining animal health, following proper milking practices, and ensuring hygiene in the milking parlour are crucial for reducing microbial contamination in raw milk. Proper housing, feeding, and equipment are important for ensuring the well-being of the cows and facilitating effective management of the animals. Another predictive model that is crucial for the dairy industry is the prediction of cow health. By analyzing various data points such as movement patterns, rumination, and body temperature, it is probable to predict potential health issues like mastitis or metabolic disorders before they become severe. This can help farmers take preventive measures and provide timely treatment, ultimately improving the overall health and welfare of the cows. It can also assist in selecting dairy cows earlier for genetic improvements and breeding programmes. The prevalence of Big Data in food safety has grown significantly. Information related to the food supply chain is widely dispersed and diverse in format, scale, and geographical location. Additionally, the interconnected relationships between environmental factors, food contamination, and foodborne illnesses are intricate, ever-changing, and difficult to forecast. Farm management is a sequence of complex processes, which require decision-making while taking into explanation many financial and social factors.

Cutting-edge technology is being created to manage extensive, intricate data from a variety of sources. Big Data and Decision Support Systems (DSS) are computerized systems that handle multidimensional data to assist stakeholders in utilizing modern data-driven methods for problem-solving and improved decision-making. The importance of Smart Farming is increasing as Internet of Things (IoT) technologies play a serious role in farm management. With a significant quantity of data being generated from various sources such as sensors, Information Systems (IS), and human observations, making timely and informed decisions is essential for effectively managing and operating farms. However, this process is hindered by technical and socio-economic limitations.

Figure 1 depicts the block diagram of the proposed work. The study aims to utilize Apache Spark (HDFS) for efficiently handling massive amounts of data. The model enhancement includes integrating ResNet backbone network with the YOLOv51 algorithm, leveraging multiple feature scales and a cross-connected semantic feature extraction module to enhance semantic feature interactions. This method proposes to improve classification accuracy and enhance the utilization of accumulated semantic information and constraints. Finally achieving the recognition of cow feeding behaviour in the farm feeding environment. The focus of this research was to identify key factors in increase of the dairy industry at the herd level, with a particular emphasis on the integration of production and health management systems. Monitoring and evaluation of ketosis risk are essential components of health management for dairy cows on farms. This work endeavored to predict the hazard of the use of machine learning models such as the Cascade feedforward artificial neural network to predict ketosis in dairy cows. In this work, the study applies the Butterfly Optimization Algorithm (BOA) to the process Stacking ensemble construction produce domain-specific configurations based on non-invasive prenatal indicators of parity, eating time, body condition score, dystocia score, daily rumination time, daily activity, and bolus, season of calving, drinking gulps, drinking time, chews per minute.

4.1 Big Data Analytics

Big data analytics can also help businesses make better decisions by providing insights that traditional methods may overlook. By analyzing large datasets, companies can identify trends, patterns, and correlations that can lead to more informed decision-making. This farming big data is way too large to be stored on a single node, but it should be distributed across multiple nodes. Therefore, numerous companies postpone the process of compiling and merging huge quantities of data due to their worries about how using big data will affect the ability to extract valuable information from the data and the quality of decision-making within the firm. Accordingly, the research proposed Apache Spark HDFS is designed to process a huge volume of data.

4.1.1 Apache Spark Hadoop Distributed File System (HDFS)

HDFS cannot store a vast quantity of data on a single node, so Hadoop utilizes a different file system known as Apache Spark HDFS. This file system divides data into smaller segments and disperses each segment across multiple nodes. The HDFS is created to accommodate massive data sets and efficiently transmit data to user applications. In extensive clusters, numerous servers host storage and run user applications. HDFS have two types of nodes in HDFS for the DataNodes (Master) and the NameNode (Worker). They support operations to read, write and delete files, and operations to create and delete directories. The Name Node is contacted to request access permission. The Name Node, if approved, will convert the HDFS filename into a compilation of the HDFS block IDs that make up the file and a roster of Data Nodes that house each block, before sending these compilations back to the client. Hadoop Distributed File System (HDFS) generally divides the file systems into data and metadata. HDFS has two important benefits in comparison with the traditional distributed file system.

One key advantage of this system is its high level of mistake tolerance, as it retains multiple copies of data across various data clients, allowing for easy recovery in the occasion of errors. The second benefit it allows to use of big data sizes because the Hadoop clusters can residence data sets in petabytes. Apache Spark is a versatile data analysis model that can be executed on both single nodes and distributed nodes, similar to Hadoop. One key advantages is its in-memory computation capabilities, which significantly enhance data processing speed. Moreover, it can interact with Hadoop data storage since it operates seamlessly atop the existing Hadoop node. Apache Spark is a high-performance framework for analysing large datasets. Apache Spark consists of a driver program (SparkContext), workers also called executors, a cluster manager, and the HDFS. The driver program is the main program of Spark. SparkContext is the object that gets created through the performance of the spark program and is in charge for the entire execution of the job. The SparkContext interfaces with the cluster manager to allocate and manage resources across the cluster. The cluster manager assigns Executors to execute the logic and store application data. Each application gets its processes for the period of the whole application run tasks in multiple threads and must be network addressable from worker nodes.

4.2 Feed Behaviour Prediction Model

The analysis and processing of big data necessitate intricate structures and advanced methods to excerpt valued insights from the vast quantity of information. The visualization of this data in real-time is essential to effectively utilize the semantics and classifications utilized in the processing algorithms. Is essential for ensuring optimal health and efficiency of dairy cows. By monitoring feeding behaviour in real-time, farmers can quickly identify any changes or abnormalities that may indicate health issues or management challenges. The machine learning and feature extraction enhancement of the model with the ResNet backbone network based on the YOLOv51 algorithm using multiple feature scales and the cross-connected semantic feature extraction module structure to enrich the scale semantic feature interactions, for classifying and improving coalition of accumulated information-based semantics and constraints. Finally achieving the recognition of cow feeding behaviour in the farm feeding environment.

4.2.1 ResNet Backbone Network-Based Yolov51 Algorithm for Multiple Feature Scales

The groundbreaking YOLOv51 algorithm is a cutting-edge innovation in the dominion of feed behaviour prediction models. Built upon the robust ResNet backbone network, this algorithm associates the strengths of deep learning and feature extraction to revolutionize how to understand and predict the feeding behaviours of dairy cows. YOLOv51 arrangement a fresh standard in precision and efficiency, offering invaluable insights for optimized dairy farming practices. Explore alternative architectures with enhanced multi-scale feature extraction capabilities by replacing the conventional group of 3×3 filters with smaller sifter groups. Attach these filter groups in a hierarchical residual-like style to maintain a similar computational load while achieving superior feature extraction. The proposed neural network module, named ResNet, incorporates residual-like connections within a single residual block. The variances between the bottleneck block and the proposed ResNet module lie in the implementation of these connections. The proposed neural network module, named ResNet, incorporates residual-like connections within a single residual block. The variances between the bottleneck block and the proposed ResNet module lie in the implementation of these connections. After the 1×1 convolution, evenly split the feature maps into $s$ feature map subsets, denoted by ${x}_{i}$ where$i\in \left\{\text{1,2},\dots .,s\right\}$. Each feature subset ${x}_{i}$has the same spatial size but $1/s$ number of channels compared with the input feature map. Except for ${x}_{1}$, each ${x}_{i}$ has a corresponding $3 \times 3$ convolution, denoted by ${K}_{i}\left({x}_{i}\right)$. Denote by ${y}_{i}$ the output of ${K}_{i}\left({x}_{i}\right)$. The feature subset ${x}_{i}$ is additional with the output of ${K}_{i-1}\left(\right)$, and then fed into ${K}_{i}\left({x}_{i}\right)$. To decrease parameters while increasing $s$, omit the 3 × 3 convolution for ${x}_{1}$. Thus, ${y}_{i}$ can be written as:

$${y}_{i}=\left\{\begin{array}{cc}{x}_{i}& i=1\\ \begin{array}{c}{K}_{i}\left({x}_{i}\right)\\ {K}_{i}({x}_{i}+{y}_{i}-1)\end{array}& \begin{array}{c}i=2\\ 2<i\le s\end{array}\end{array}\right.$$

Each $3\times 3$ convolutional operator ${K}_{i}\left({x}_{i}\right)$ could potentially collect feature information from all feature splits $\left\{{x}_{i,j}\le i\right\}$. Each time a feature split ${x}_{j}$ goes through a 3 × 3 convolutional operator, the output consequence can have a longer receptive field than ${x}_{j}$. The ResNet module generates outputs with varying receptive field sizes and combinations due to the combinatorial blast effect. This multi-scale processing allows for the removal of both local and global information within the splits of the network. To improve the fusion of information across different scales, concatenate all splits and pass them through a $1\times 1$ convolution. This concatenation and split strategy enhances the effectiveness of feature processing through convolutions. To trim the parameter, count, and eliminate the convolution operation for the primary split, which can also be professed as a form of feature reprocessing. The YOLOv51 algorithm represents an advancement in object detection techniques by introducing the utilization of multiple feature scales and the incorporation of a cross-connected semantic feature extraction module structure. This advanced approach improves the model's competence to detect objects of varying sizes and complexities in real-world scenarios.

The inclusion of multiple feature scales allows YOLOv51 to process images with different resolutions and aspect ratios more effectively. This is crucial in object detection tasks where objects may appear at various scales within the same image. By incorporating feature pyramids or feature maps at different scales, the algorithm ensures that both small and large objects can be accurately detected. This approach significantly improves the model's versatility and robustness, making it fit for a varied range of applications, from pedestrian detection in surveillance to identifying various objects in autonomous driving scenarios. Furthermore, the cross-connected semantic feature extraction module structure enriches the scale semantic feature interactions. This inventive module facilitates the model to capture not only spatial information but also semantic relationships between objects at different scales. By fusing information from different feature scales in a meaningful way, YOLOv51 can better understand the context of objects in the scene, improving objective localization and classification accuracy. This is particularly valuable in complex scenes where objects of interest may be partially occluded or appear at varying distances from the camera. YOLOv51's incorporation of multiple feature scales and the cross-connected semantic feature extraction module structure represents a significant step forward in object detection algorithms. The ResNet backbone network according to the YOLOv51 algorithm incorporates cross-connected semantic feature extraction modules to enhance interactions between semantic features at multiple scales.

4.2.2 Cross-Connected Semantic Feature Extraction Module

The cross-connected (CSFEM) enhances the classification performance by extracting high-level context cross-connected semantic features and utilizing spectral-spatial shuffle attention features from the encoder phase. This allows for better guidance of the spectral-spatial frequency attention features, suppression of noisy boundaries, and restoration of category boundaries. Additionally, the CSFEM strengthens the classification performance by fully exploiting the diverse spectral-spatial features. Employ global average pooling and global max pooling to produce two different spectral-spatial descriptors, which are denoted by ${F}_{gap}$and ${F}_{gmp}$. The aggregation of two descriptors through element-wise summation allows for the study of global context features, leading to more refined features. To make the DSFNet model more manageable and improve its ability to generalize, a gating module with two fully connected layers (FCs) and one ReLU activation function has been added. After the sigmoid operation, Obtain the global context attention features $p\in {R}^{1\times 1\times C}$. The equation of $C$ can be provided as follows:

$$p=\sigma \left({W}_{1}\left(\delta \left({W}_{0}\left({F}_{gap}+{F}_{gmp}\right)\right)\right)\right)$$

Where, ${W}_{0}\in {R}^{1\times 1\times \left(C/4\right)}$ and ${W}_{1}\in {R}^{1\times 1\times C}$ signify convolutional kernels of FCs. $\sigma$ refers to the sigmoid function.$\delta$ denotes the ReLU activation function. Subsequently, $3\times 3$ convolution is achieved on the low-level feature to obtain $T\in {R}^{H\times W\times C}$. Next, matrix increase is performed between $P$ and $T$ to acquire $L\in {R}^{H\times W\times C}$. In conclusion, an elementwise summation is used between the high-level feature and $L$ to succeed in the final output. The cross-connected semantic feature extraction module structure enriches the scale semantic feature interactions, for classifying and improving a coalition of accumulated information-based semantics and constraints. It is essential for dairy farms to regularly assess and monitor ketosis risk to effectively manage the dairy cows' health.

4.3 Ketosis and Mastitis Risk Management

Assessing and managing risk in the source chain is complex because it requires consideration of all stages, from production to distribution, and utilization of data generated at each step. Ketosis and Mastitis are the most significant metabolic diseases that can impact dairy herds, exceeding ruminal acidosis and milk fever in importance. However, the collection and detection of these indicators are complex and may lead to stress reactions in cows. Overall, this study demonstrates the potential of machine learning models in predicting health risks in highlights and dairy cows and the importance of utilizing advanced technology to improve animal welfare and productivity in the agriculture industry. In this work, the study applies the BOA to the process of Stacking ensemble construction to generate domain-specific configurations based on non-invasive prenatal indicators of parity, dystocia score, body condition score, daily rumination time, daily activity, eating time, drinking time, season of calving, drinking gulps, bolus, and chews per minute.

4.3.1 Cascade Feedforward Artificial Neural Network

Predicting the ketosis jeopardy in dairy cows is a critical endeavour for efficient herd management. Machine learning models, such as the CFANN, offer a powerful solution to this challenge. CFANN's cascading architecture allows it to extract complex patterns from diverse data sources, including cow behaviour, nutritional intake, and health history. By training on historical data and non-invasive indicators, CFANN can learn to detect early signs of ketosis, enabling dairy farmers and veterinarians to take timely preventive measures. This predictive model enhances dairy farm efficiency, reduces health risks for cows, and ultimately contributes to enhanced animal welfare and milk production. A common type of feed-forward ANN consists of a layer of inputs, a layer of output neurons, and one or more hidden layers of neurons. The cascade type of feed-forward ANN includes input, output, and hidden layers, with weights connected from the input to the first layer. Each subsequent layer in all of the four cascade feed-forward ANNs weights is derived from the input layer and all previous layers. Additionally, biases are included in each layer. The last layer of each ANN represents the network output. Initialization of weights and biases is necessary for all layers in each ANN. These four ANNs are used to estimate four suggested core parameters. A supervised training method is used to train considered cascade feed-forward ANNs. A net input $\left({V}_{j}\right)$to a neuron in a hidden layer $\partial$ is calculated by this formula Eq. (3).

$${V}_{j}=\sum _{i=1}^{n}{W}_{ji}{\theta }_{i}+{\theta }_{j}$$

Where $n$ is the number of $k-1$ layer neurons for a common type of feed-forward ANNs and the number of all of the earlier layer neurons for a cascade type of feed-forward ANNs. Weights are noted by ${W}_{ji}$; and the threshold offset by ${\theta }_{j}$. The output of the neuron ${O}_{j}$. is given by an activation function. It can provide dairy farmers and veterinarians with valuable insights into individual cow health, enabling timely intervention and tailored dietary adjustments to mitigate the risk of ketosis. Ultimately, CFANN plays a crucial role in modern dairy management by improving cow welfare and optimising milk production through proactive risk assessment and prevention. The study applies the (BOA) to the Stacking ensemble construction method to produce domain-specific configurations based on non-invasive prenatal indicators.

4.3.2 Butterfly Optimization Algorithm (BOA) With Stacking Ensemble Model

The BOA imitates the foraging and social habits of butterflies, which include using their senses to locate food, seeking out a mate, migrating between locations, and evading predators. BOA harnesses the power of fragrance as a key element of its physical intensity stimulus program based on Eq. (4)

$${f}_{i}={cI}^{a}$$

Where $c = 0.01$ is a sensory modality, $I$ is the stimulus intensity typically upper-lower bound, $a$ is the power proponent linearly updated from 0.1 to 0.2. The global candidate solution update is given by Eq. (5)

$${x}_{i}^{\left(t+1\right)}={X}_{i}\left(t\right)+\left({r}^{2}\times {X}_{best}-{X}_{i}\left(t\right)\right)\times {f}_{i}$$

Where, ${X}_{i}\left(t\right)$ is the solution vector ${x}_{i}$ for ${i}^{th}$ butterfly in iteration $t$. ${X}_{best}$represents the current best solution found among all the answers in the current iteration. ${f}_{i}$ signifies the fragrance of ${i}^{th}$ butterfly and $r$ is a random number between [0, 1]. Supplementing the global candidate solution update, BOA describes local candidate information as in Eq. (6)

$${x}_{i}^{\left(t+1\right)}={X}_{i}^{\left(t\right)}+\left({r}^{2}\times {x}_{i}^{\left(t+1\right)}\right)\times {f}_{i}$$

Where, ${X}_{j}\left(t\right)$ and ${X}_{k}\left(t\right)$are ${j}^{th}$ and ${k}^{th}$ butterflies from the solution space. Constructing a stacking ensemble model for predicting domain-specific configurations based on non-invasive prenatal indicators and various cow-related features involves a systematic method to harness the predictive power of multiple base models.

Table 1

Butterfly Optimization Algorithm (BOA) With Stacking Ensemble Model Algorithm
Algorithm: Stacking Ensemble with BOA Algorithm
Initialize the population of n Butterflies ${x}_{i}=(i=\text{1,2},\dots n)$ Define the objective function $f\left(x\right)$ Define $c, a$ and $p$ for BOA While stopping criteria are not met do // Stacking Ensemble Model// For each butterfly ${b}_{f}$ in population do Calculate the fragrance for ${b}_{f}$ using Eq. (4) end for Find the best butterfly (${b}_{{f}_{best}})$ Initialize a new population For each butterfly ${b}_{f}$ in the population do Generate a random number $r$ from [0, 1] If $r< p$ then Move towards the best butterfly (${b}_{{f}_{best}})$ using Eq. (5) else Move randomly using Eq. (6) end if Evaluate the objective function $f\left(x\right)$ for the new position if $f\left({new}_{position}\right)<f\left({b}_{f}\right)$ then Replace bf with ${new}_{position}$ in ${new}_{population}$ else Add ${b}_{f}$ to ${new}_{population}$ end if end for Update the population with ${new}_{population}$ Train the Stacking Ensemble Model on the current population end while return the best solution

These models are trained using historical data, where the features include non-invasive prenatal indicators (e.g., maternal health during pregnancy) and behavioural data (e.g., daily rumination time, activity levels, eating and drinking behaviour), as well as seasonal information. Each base model learns to predict specific configurations associated with the given domain, such as the likelihood of certain parity or body condition score categories. The class distribution vector over $c$ classes for the ${j}^{th}$classifier by a $1 x c$ vector as follows:

$${\varDelta }_{j}=\left[{\delta }_{{1}_{j}}{\delta }_{{2}_{j}}\dots {\delta }_{{c}_{j}}\right] 1\le j\le n$$

$$0\le {\delta }_{ij}\le 1\forall 1\le i\le c$$

$$\sum _{i}{\delta }_{ij}=1$$

The class distribution vectors for the $n$ classifiers can then be represented by an $n x c$ matrix as follows:

$$\varDelta ={\left[{\varDelta }_{1}{\varDelta }_{2}\dots {\varDelta }_{n}\right]}^{T}$$

The meta-classifier assigns different weighting to individual classifiers based on their relative importance. The weight distribution vector over $n$ classifiers is represented as follows:

$$\left[{\theta }_{1}{\theta }_{2}\dots {\theta }_{n}\right]$$

$$0\le {\theta }_{j}\le 1$$

$$\sum _{j}{\theta }_{j}=1$$

Given the class distribution matrix and the weight distribution vector, the meta-classifier calculates apiece instance of the test set by using the following $1 x c$ class distribution vector in Eq. (14)

$$\varDelta =\left[{\delta }_{1}^{{\prime }}{\delta }_{2}^{{\prime }}\dots {\delta }_{c}^{{\prime }}\right]$$

$${\delta }_{1}^{{\prime }}=\sum _{j}{\theta }_{i}{\delta }_{ij}$$

The stacking ensemble combines these base models' predictions to create a highly accurate and robust meta-classifier, which considers class distribution vectors for each classifier and weight distribution vectors for combining their outputs. This meta-classifier evaluates test instances using these weight distributions, ultimately providing precise predictions for domain-specific configurations. By incorporating a BOA-inspired fragrance-based movement and combining the strengths of multiple base models, this approach empowers dairy farmers and veterinarians to make knowledgeable decisions regarding individual cow health and tailored management practices. It contributes significantly to overall herd management, animal welfare, and the dairy industry's productivity while minimizing health risks for pregnant cows.

4.4 Non-Invasive Prenatal Indicators

Generating domain-specific configurations using non-invasive prenatal indicators and various cow-related features is essential for optimizing dairy herd management. These indicators, daily activity, including parity, dystocia score, body condition score, daily rumination time, seasonal factors and daily activity such as calving, eating, and drinking behaviour, offer valuable insights into cow health and behaviour. By systematically analysing and combining this data, dairy farmers and veterinarians can tailor their management practices to ensure the well-being and productivity of each cow. For instance, data-driven insights may reveal that cows with specific body condition scores or activity levels during certain seasons are more prone to certain health issues. This knowledge allows for the manufacture of domain-specific configurations that are categorised based on their cow risk levels and requirements. Implementing these configurations into dairy herd management practices. For example, cows identified as having a higher risk of health issues during certain seasons may receive targeted dietary supplements or additional monitoring. Those with specific prenatal indicators may require different calving procedures or postnatal care. By tailoring management practices to these domain-specific configurations, dairy farmers can optimize the health, welfare, and productivity of their herds, ultimately benefiting both the cows and the overall dairy operation.

In the dominion of modern dairy farming, the integration of advanced technologies has brought about a paradigm shift in data-driven decision-making. This experimentation and result discussion section explores the application of big data analytics, coupled with the YOLOv5 algorithm, from the perspective of data quality management and risk assessment within dairy farming. The simulation of this experiment is implemented using Python software. Through a meticulous examination of feed behaviour analysis, this work proposes to shed light on the value of these tools in improving the productivity and efficiency of dairy farming operations. The study collected various images from (https://dataverse.nl/dataset.xhtml?persistentId=doi:10.34894/7M108F) for risk and feed behaviours analysis purposes.

Figure 2 provides an assessment of a machine learning or classification model's presentation when relating its predictions to the actual ground truth within a training dataset. Notably, in Fig. 2(a) the model correctly identified and classified 17,418 cases as true positives, accurately recognizing instances as they confirmed their positive stance on the matter. In Fig. 2(b), the model observes the model's performance evaluation by comparing its predictions against the actual dataset. The model correctly predicted 4,262 instances as positive, aligning with the actual positive cases in the dataset, representing accurate positive predictions. Furthermore, the model misclassified 2,330 cases as undesirable when they were reframed, in reality, positive in the actual dataset. Finally, the model correctly identified 3,023 instances as negative, recognizing cases as negative in the actual dataset. The misunderstanding matrix serves as a crucial tool for evaluating the presentation of classification models, commonly employed in the domains of machine learning and statistics.

Figure 3 explores the relationship between feed intake and the predicted probability, highlighting specific values of interest. The highest value of predicted probability, 0.0477, signifies a significant likelihood of a particular outcome or event related to feed intake. This value holds particular importance within the dataset, suggesting a potential focus area for further investigation or decision-making. Moving to the next highest value, which is 0.0417, find another notable point in the data where the predicted probability remains relatively high. This value could indicate a different level of significance or a potential threshold for decision-making. On the lower end, observe a value of 0.0250, representing a lower predicted probability. This lower probability could indicate a different scenario or outcome related to feed intake. Furthermore, the dataset presents yet another lower value of 0.0230.

Figure 4 presents a comprehensive feature importance plot that showcases the percentage gain attributed to various features in a dataset. Each feature's importance is assessed concerning the outcome or variable of interest, which in this case appears to be related to factors affecting cows, such as Eating time, Daily activity time, Body condition score, Daily rumination time, Ketosis risk, Drinking gulps, Dystocia score, Bolus, Mastitis risk, Chews per minute, and the season of calving. The plot provides valuable insights into which features have the most significant influence on the outcome variable, as indicated by their gain percentages. A higher gain percentage implies that a feature plays a more crucial role in influencing the outcome, making it a top priority for further investigation or consideration. Analysing this material can be pivotal in decision-making processes, as it helps identify key factors that contribute to specific outcomes related to cows.

Figure 5 explores the relationship between eating time, a variable of interest, and the associated label or outcome. Scattering plots are commonly used to display the distribution and potential patterns or correlations between two variables. The eating time represents the duration or frequency of eating behaviour in a dataset, while the label could signify a specific classification or outcome related to this behaviour. By plotting these variables on the same graph, can visually assess whether there are any discernible trends, clusters, or outliers in the data. Analysing this scattering plot can offer insights into how eating time may relate to the label or outcome, aiding in the understanding of any potential associations or patterns.

Figure 6 illustrates the relationship between daily rumination time and a labelled variable of interest. Scatter plots are valuable visual tools for exploring the correlation or patterns between two variables. In this case, Daily Rumination Time is plotted on one axis, while the labelled variable is represented on the other axis. This scatter plot allows us to assess any potential trends, clusters, or associations between Daily Rumination Time and the labelled variable. By investigating the dispersal of data points, can gain insights into whether there is a discernible relationship between these two factors and whether daily rumination time has any predictive value for the labelled variable.

Figure 7 represents the relationship between drinking time, the expected variable of interest, and a corresponding label or outcome. This information is valuable for identifying potential associations or dependencies between drinking time and the label, which can have implications in different domains, such as agriculture, health monitoring, or behavioural analysis, depending on the environment of the label and its relevance to the drinking behaviour of interest.

Figure 8 visualized the relationship between daily activity time and a corresponding label or variable of interest. Scatter plots are effective for understanding how one variable might impact or relate to another. Daily activity time is likely a measure of some aspect of the subject under study, such as an animal's behaviour or health status. The plot provides a graphical demonstration of daily activity time that varies across different values of the label. By examining this scatter plot, researchers and analysts can gain insights into any patterns, trends, or correlations between daily activity time and the label.

Figure 9(a) the comparison between predicted and actual values of eating time showcases a highly accurate predictive model. The actual eating time, recorded at 3.0688, closely aligns with the predicted value of 4.93791, as indicated by a remarkably low RMSE and MAE, both measuring at 0.01. Moreover, the R-squared value of 1.00 signifies a perfect suitability of the model to the observed data, emphasizing the exceptional precision of the predictive algorithm. Figure 9(b) observes a remarkable agreement between the predicted and actual values of daily rumination time, as indicated by an impressively high R-squared value of 1.00. This near-perfect fit showcases the precision of the predictive model, suggesting that it accurately captures and reproduces the variations in rumination time. Furthermore, the RMSE of 0.23 and MAE of 0.19 reflect the small discrepancies between the actual and predicted values. Figure 12(c) shows the assessment between predicted and actual values of drinking time behaviour revealing intriguing insights into the predictive exactness of the model. The remarkable closeness between the actual and predicted values, with a negligible difference of 0.07 in RMSE and 0.06 in MAE, demonstrates the robustness of the predictive model in capturing the intricacies of drinking behaviour.

Figure 10 presents a visual representation of the cumulative percentage of declines for both sick and healthy cows, along with their respective Kolmogorov-Smirnov (KS) statistics. The KS statistic for sick cows is calculated at 0.00959, signifying a greater divergence in the distribution of declines for this group. On the other hand, healthy cows exhibit a lower KS statistic of 0.00586, suggesting a relatively closer resemblance in the decline distribution among them. The overall KS statistic for both groups stands at 0.00420, signifying that there is some overlap in the cumulative percentage of declines among the two groups, but they still exhibit differences in their distribution patterns. These findings are crucial for understanding and comparing the health standing of cows in the given dataset.

The Receiver Operating Characteristic (ROC) curve, depicted in Fig. 11, illustrates the presentation of the Ketosis jeopardy prediction model in terms of specificity versus Sensitivity. This graph provides valued insights into the model's ability to discriminate between individuals in jeopardy of ketosis and those who are not. The training data area under the curve (AUC) of 0.77 and the test data AUC of 0.75 point out the model's reasonable predictive power, with the training data showing slightly better discrimination. This proposes that the model exhibits good overall performance in identifying individuals susceptible to ketosis, while also demonstrating its robustness on unseen data, making it a promising tool for ketosis risk assessment and management.

Figure 12 presents the ROC analysis results for predicting ketosis and mastitis in a dataset. During training, the model achieved a reasonably good performance in distinguishing between ketosis-positive and ketosis-negative cases with an AUC of 0.72 for ketosis, indicating its potential in this aspect. However, when evaluated on a separate testing dataset, the model's presentation in predicting ketosis dropped to an AUC of 0.53, suggesting that it performed only slightly better than random chance during testing. Moreover, the model's performance in predicting mastitis during training was relatively poor, with an AUC of 0.19. This indicates that the model struggled to distinguish between mastitis-positive and mastitis-negative cases during the training phase. When assessed on a separate testing dataset for mastitis, the model's presentation improved slightly but remained close to random chance, with an AUC of 0.48. These findings highlight the need for substantial improvements in the model's ability to accurately detect both ketosis and mastitis in dairy cattle, especially during testing, to enhance its practical utility in real-world scenarios.

Figure 13 indicates the risk scores for ketosis and mastitis in a dairy group. The percentages represent the likelihood or severity of these two health issues. Ketosis has a risk score of 46%, while mastitis has a higher risk score of 51%. This suggests that there is a higher probability of mastitis occurring compared to ketosis in the herd, and both conditions require attention and management to maintain the health of the cows.

5.1 Comparison Analysis

The comparison analysis presented in this work proposes to provide a complete evaluation of the performance of various classification algorithms, including the Decision Tree algorithm, Support Vector Machine (SVM), and a proposed method. Through an examination of key metrics such as precision and accuracy, this analysis offers valuable perceptions of the strengths and weaknesses of each approach. By assessing their respective capabilities in addressing the research problem, can identify the most suitable algorithm for the given task, ultimately guiding informed decision-making and the advancement of the research objectives.

In this comparison of different techniques, the study evaluates their performance based on key metrics: precision, accuracy and recall. The "Multiple Machine Learning" approach achieved a respectable accuracy of 90.9%, with a precision of 96.7% and a recall of 87.6%. Moving to the "Efficient DenseNet" technique, we observe a notable improvement in accuracy, reaching 97.2%, with precision at 98.09% and an impressive recall of 99.28%. However, the "Proposed" technique stands out as the clear frontrunner, boasting an exceptional accuracy rate of 99.8%, coupled with a precision of 99.2% and a remarkable recall of 99.4%. These results highlight the superior performance of the suggested technique, which excels in accurately classifying cases, reducing false positives, and effectively identifying actual positive cases, making it a highly promising approach for the task under consideration.

In conclusion, this research has demonstrated the significant potential of leveraging Big Data Analytics, specifically employing the YOLOv5 algorithm, for data quality management and risk assessment in dairy farming. The objective of enhancing feed behaviour analysis has been effectively realized, leading to more informed decision-making and proactive risk mitigation strategies. The findings reveal that by binding the power of big data analytics, dairy farmers can make more informed decisions, optimize feeding schedules, and proactively identify and address risks, leading to healthier cows, increased milk production, and reduced operational costs. Furthermore, the YOLOv5 algorithm has proven effective in accurately detecting and analysing feed behaviour, providing valuable perceptions into the nutritional needs and well-being of dairy cattle. The simulation of this experiment is implemented using Python software. Through detailed analyses of feeding behaviour and risk level assessments, this research has provided valuable insights into improving feeding strategies on dairy farms. Farmers can now make informed conclusions regarding feed intake, optimizing nutrition for their cattle, and ultimately improving milk production and herd health. The combination of machine learning and data analysis techniques empowers dairy farmers with the ability to make informed, data-driven decisions with improved accuracy of 99.8%, precision of 99.2% and a remarkable recall of 99.4%. This shift from traditional approaches to more modern, evidence-based practices contributes to the sustainability and long-term success of dairy farming operations. This directly translates into practical benefits for dairy farmers, their operations, and the international food supply chain, financial stability, fostering sustainability, and enhanced animal welfare in the dairy farming sector.

Conflict of Interest

The writers have no battles of interest to disclose.

Data Availability Statement

Data sharing is not relevant to this article as no datasets were created or analyzed in this work.

Wang, L., Sun, H., GAO, H., Xia, Y., Zan, L. and Zhao, C.: A meta-analysis on the effects of probiotics on the performance of pre-weaning dairy calves. Journal of Animal Science and Biotechnology, 14(1), 3 (2023)
DeLay, N.D., Boehlje, M.D. and Ferrell, S.: The economics of property rights in digital farming data: Implications for farmland markets. Applied Economic Perspectives and Policy (2023)
Alshurideh, M., Al Kurdi, B.H., Alzoubi, H.M. and Salloum, S. eds.: The Effect of Information Technology on Business and Marketing Intelligence Systems. Springer Nature, 1056 (2023)
Tukamuhabwa, B.R.: Supply Chain Orientation and Supply Chain Risk Management Capabilities: Mechanisms for Supply Chain Performance of Agro-Food Processing Firms in Uganda. Journal of African Business, 1-24 (2023)
Theodorou, J.A. and Tzovenis, I.: A framework for risk analysis of the shellfish aquaculture: The case of the Mediterranean mussel farming in Greece. Aquaculture and Fisheries, 8(4), 375-384 (2023)
Yu, J.J., Hu, Y.L., Liu, C.Z., Wu, S.B., Zheng, Z.J., Cui, Z.H., Chen, L., Wei, T., Sun, S.K., Ning, J. and Wen, X.: ARSCP: An antimicrobial residue surveillance cloud platform for animal-derived foods. Science of the Total Environment, 858, 159807 (2023)
Tiwari, S., Sharma, P., Choi, T.M. and Lim, A.: Blockchain and third-party logistics for global supply chain operations: Stakeholders’ perspectives and decision roadmap. Transportation Research Part E: Logistics and Transportation Review, 170, 103012 (2023)
Adnan, K.M., Sarker, S.A., Tama, R.A.Z., Shan, T.B., Datta, T., Monshi, M.H., Hossain, M.S. and Akhi, K.: Catastrophic risk perceptions and the analysis of risk attitudes of Maize farming in Bangladesh. Journal of Agriculture and Food Research, 11, 100471 (2023)
Pearce, S.D., Parmley, E.J., Winder, C.B., Sargeant, J.M., Prashad, M., Ringelberg, M., Felker, M. and Kelton, D.F.: Evaluating the efficacy of internal teat sealants at dry-off for the prevention of new intra-mammary infections during the dry-period or clinical mastitis during early lactation in dairy cows: A systematic review update and sequential meta-analysis. Preventive Veterinary Medicine, 105841 (2023)
Hernandez, M.C., Alvarez, A.N.R. and Anguiano, F.I.S.: Project management and supply chain 4.0 improvement: the case of infant formulas in the face of the challenge of COVID-19. Procedia Computer Science, 217, 278-285 (2023)
Aiassa, E., Mosbach-Schulz, O., Canali, E. and Authority, S.: Risk assessment for beef cattle at slaughter: a method for performing a risk assessment based on empirical (2011)
Thangamayan, S., Pradhan, K., Loganathan, G.B., Sitender, S., Sivamani, S. and Tesema, M.: Blockchain-Based Secure Traceable Scheme for Food Supply Chain. Journal of Food Quality (2023)
Ncibi, K., Hamed, Y., Hadji, R., Busico, G., Benmarce, K., Missaoui, R. and Wederni, K.: Hydrogeochemical characteristics and health risk assessment of potentially toxic elements in groundwater and their relationship with the ecosystem: case study in Tunisia. Environmental Science and Pollution Research, 1-18 (2023)
Ida, J.A., Wilson, W.M., Nydam, D.V., Gerlach, S.C., Kastelic, J.P., Russell, E.R., McCubbin, K.D., Adams, C.L. and Barkema, H.W.: Contextualized understandings of dairy farmers' perspectives on antimicrobial use and regulation in Alberta, Canada. Journal of Dairy Science, 106(1), 547-564 (2023)
Fernandes, S., Pereira, G. and Bexiga, R.: Bimodal milk flow and overmilking in dairy cattle: risk factors and consequences. Animal 100716 (2023)
El Baz, J., Cherrafi, A., Benabdellah, A.C., Zekhnini, K., Beka Be Nguema, J.N. and Derrouiche, R.: (2023) Environmental Supply Chain Risk Management for Industry 4.0: A Data Mining Framework and Research Agenda. Systems, 11(1), 46.
Bellato, A., Tondo, A., Dellepiane, L., Dondo, A., Mannelli, A. and Bergagna, S.: Estimates of dairy herd health indicators of mastitis, ketosis, inter-calving interval, and fresh cow replacement in the Piedmont region, Italy. Preventive Veterinary Medicine 105834 (2023)
Roussaki, I., Doolin, K., Skarmeta, A., Routis, G., Lopez-Morales, J.A., Claffey, E., Mora, M. and Martinez, J.A.: Building an interoperable space for smart agriculture. Digital Communications and Networks 9(1), 183-193 (2023)
Luna, M., Llorente, I. and Luna, L.: A conceptual framework for risk management in aquaculture. Marine Policy, 147, 105377 (2023)
Gruber, S., Rienesl, L., Köck, A., Egger-Danner, C. and Sölkner, J.: Importance of Mid-Infrared Spectra Regions for the Prediction of Mastitis and Ketosis in Dairy Cows. Animals, 13(7), 1193 (2023)
Kandhro, F., Kazi, T.G., Afridi, H.I., Baig, J.A., Lashari, A.A. and Lashari, A.: Determination of toxic elemental levels in whey milk of different cattle and human using an innovative digestion method: risk assessment for children< 6.0 months to 5 years. Environmental Science and Pollution Research, 1-14 (2023)
Feyissa, A.A., Senbeta, F., Tolera, A. and Guta, D.D.: Unlocking the potential of smallholder dairy farm: Evidence from the central highland of Ethiopia. Journal of Agriculture and Food Research 11, 100467 (2023)
Castillo Rodríguez, C., Sotillo Mesanza, J., Muiño Otero, R., Benedito Castellote, J.L., Gutiérrez Montes, A.M., Arana Sánchez, R., Matas Quintanilla, M. and Gutiérrez Panizo, C.: Is adenosine deaminase (ADA) activity in saliva and serum a more accurate disease detection tool than traditional redox balance parameters in early-lactating dairy cows (2023)
Bradfield, T., Butler, R., Dillon, E.J., Hennessy, T. and Loughrey, J.: The impact of long-term land leases on farm investment: Evidence from the Irish dairy sector. Land Use Policy 126, 106553 (2023)
Eshete, T., Demisse, T., Yilma, T. and Tamir, B.: Repeat Breeding and Its’ Associated Risk Factors in Crossbred Dairy Cattle in Northern Central Highlands of Ethiopia. Veterinary Medicine International (2023)
Shiva Shankar, R., MNSSVKR, V., Gupta, Priyadarshini, V., Neelima, P.: P-S protocol to detect fire in forest and fire alert system using sensors, 3rd International Conference on Innovations in Communication Computing and Sciences: ICCS-2021 AIP Conf. Proc. 2576, (2021) 020002-1–020002-11; https://doi.org/10.1063/5.0105702
Reddy Shiva Shankar, Pilli Neelima, Voosala Priyadarshini, Swaroop Ravi Chigurupati.: An approach to classify distraction driver detection system by using mining techniques, Indonesian Journal of Electrical Engineering and Computer Science 27(3), 1670-1680, (September 2022) DOI: 10.11591/ijeecs. v27.i3
Shiva Shankar, R., Priyadarshini, V., Neelima, P., Raminaidu, C.H.: Analyzing Attrition and Performance of an Employee using Machine Learning Techniques, 2021 5th International Conference on Electronics, Communication and Aerospace Technology (ICECA) (2021) DOI: 10.1109/ICECA52323.2021.9676102
Reddy Shiva Shankar, Neelima, P., Priyadarshini, V., and Murthy, K. V. S. S. R.: Comprehensive Analysis to Predict Hepatic Disease by Using Machine Learning Models, Mobile Computing and Sustainable Informatics, Lecture Notes on Data Engineering and Communications Technologies 126 (2022) https://doi.org/10.1007/978-981-19-2069-1_33
Shiva Shankar, R., Srinivas, L.V., Sivarama Raju, V.V., Murthy, KVSS.: A Comprehensive Analysis of Deep Learning Techniques for Recognition of Flower Species, Proceedings of the Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV 2021). IEEE Xplore Part Number: CFP21ONG-ART; 978-0-7381-1183-4, 10.1109/ICICV50876.2021.9388503
Rajeswarappa, G., Vasundra, S.: Red Deer and Simulation Annealing Optimization Algorithm-Based Energy Efficient Clustering Protocol for Improved Lifetime Expectancy in Wireless Sensor Networks. Wireless Pers Commun 121, 2029–2056 (2021). https://doi.org/10.1007/s11277-021-08808-2
Lokavarapu V. Srinivas, Chitri Raminaidu, Devareddi Ravibabu, Shiva Shankar Reddy, A framework to recognize the sign language system for deaf and dumb using mining techniques, Indonesian Journal of Electrical Engineering and Computer Science 29(2) 1006~1016 (February 2023), ISSN: 2502-4752, DOI: 10.11591/ijeecs.v29.i2.pp1006-1016
Shiva Shankar, R., Raminaidu, CH., Ravibabu D., and Gupta, VMNSSVR.: A Survey to Raise the Awareness of Road Accidents Due to NotWearing Helmet, International Journal of Industrial Engineering & Production Research 31(3) 367-377 (September 2020) DOI: 10.22068/ijiepr.31.3.367
Shiva Shankar, R., Deshai, N., Murthy, K.V.S.S., Gupta, VMNSSVKR.: The Source of Growing Knowledge by Cognitive Artificial Intelligence, 2019 IEEE International Conference on System, Computation, Automation and Networking (ICSCAN), DOI: 10.1109/ICSCAN.2019.8878732
Deshai, N., Shiva Shankar, R., Sravani, K., Ravibabu, D.: A Developed Task Allotments Policy for Apache Hadoop Executing in the Public Clouds, 2019 IEEE International Conference on System, Computation, Automation and Networking (ICSCAN), DOI: 10.1109/ICSCAN.2019.8878857
Ravi Babu Devareddi, R., Shiva Shankar, K., Murthy, VSSR., Ch. Raminaidu.: Image segmentation based on scanned document and hand script counterfeit detection using neural network , 3rd International Conference on Innovations in Communication Computing and Sciences: ICCS-2021 AIP Conf. Proc. 2576, 050001-1–050001-11; https://doi.org/10.1063/5.0105808 Published by AIP Publishing. 978-0-7354-4253-5/$30.00

No competing interests reported.

Download PDF

Version 1

posted

You are reading this latest preprint version

Data Quality Management and Risk Assessment of Dairy Farming with Feed Behaviour Analysis Using Big Data Analytics with YOLOv51 Algorithm

Status:

Version 1

Abstract

Figures

1. INTRODUCTION

2. LITERATURE SURVEY

3. RESEARCH PROBLEM DEFINITION AND MOTIVATION

4. PROPOSED RESEARCH METHODOLOGY

4.1 Big Data Analytics

4.1.1 Apache Spark Hadoop Distributed File System (HDFS)

4.2 Feed Behaviour Prediction Model

4.2.1 ResNet Backbone Network-Based Yolov51 Algorithm for Multiple Feature Scales

4.2.2 Cross-Connected Semantic Feature Extraction Module

4.3 Ketosis and Mastitis Risk Management

4.3.1 Cascade Feedforward Artificial Neural Network

4.3.2 Butterfly Optimization Algorithm (BOA) With Stacking Ensemble Model

4.4 Non-Invasive Prenatal Indicators

5. EXPERIMENTATION AND RESULTS DISCUSSION

5.1 Comparison Analysis

6. RESEARCH CONCLUSION

Declarations

References

Additional Declarations

Status:

Version 1