Predicting nitrogen use efficiency, nitrogen loss and dry matter intake of individual dairy cows in late lactation by including mid-infrared spectra of milk samples

doi:10.21203/rs.3.rs-1722648/v1

Download PDF

Research Article

Predicting nitrogen use efficiency, nitrogen loss and dry matter intake of individual dairy cows in late lactation by including mid-infrared spectra of milk samples

https://doi.org/10.21203/rs.3.rs-1722648/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 10 Jan, 2023

Read the published version in Journal of Animal Science and Biotechnology →

You are reading this latest preprint version

Background

Nitrate leaching to groundwater and surface water and ammonia volatilization from dairy farms have negative impacts on the environment. Meanwhile, the increasing demand for dairy products will result in more pollution if N losses are not controlled. Therefore, a more efficient, and environmentally friendly production system is needed, in which nitrogen use efficiency (NUE) of dairy cows plays a key role. To genetically improve NUE, extensively recorded and cost-effective proxies are essential, which can be obtained by including mid-infrared (MIR) spectra of milk in prediction models for NUE. This study aimed to develop and validate the best prediction model of NUE, nitrogen loss (NL) and dry matter intake (DMI) for individual dairy cows in China.

Results

A total of 86 lactating Chinese Holstein cows were used in this study. After data editing, 704 records were obtained for calibration and validation. Six prediction models with three different machine learning algorithms and three kinds of pre-processed MIR spectra were developed for each trait. Results showed that the coefficient of determination (R²) of the best model in within-herd validation was 0.66 for NUE, 0.58 for NL and 0.63 for DMI. For external validation, reasonable prediction results were only observed for NUE, with R² ranging from 0.58 to 0.63, while the R² of the other two traits was below 0.50. The infrared waves from 973.54 to 988.46 cm^− 1 and daily milk yield were the most important variables for prediction.

Conclusion

This study developed reasonable prediction models for N related traits of Chinese Holstein cows. Models with MIR spectra outperformed the other models indicating that MIR spectra have additional value for predicting these target traits. Pre-processing MIR spectra by mathematical algorithms may fail to improve prediction accuracies. The most important wave ranges of MIR (973.54 to 988.46 cm^− 1) for prediction are associated with particular chemical compounds, which can be useful to unravel the biological mechanisms of nitrogen utilization in dairy cows. These models will be applied to large-scale data to further investigate the genetic architecture of nitrogen efficiency and further reduce the adverse impacts on the environment after more data is collected.

nitrogen use efficiency

mid-infrared spectra

environment

prediction

dairy cow

The utilization of nutrients is not considered sustainable enough in the dairy production systems of China [1, 2]. Ammonia emissions and nitrate leaching to groundwater and surface water lead to adverse impacts on the surrounding environment of farms. Previous research indicated that average NUE values in China were 16% at the dairy cow level [3], which is relatively low compared with what is potentially possible (30–40%) [4, 5]. Meanwhile, China does not produce enough dairy products to be self-sufficient. In 2019, the inventory of milking cows in China reached 5.7 million heads, which is more than half of the US (9.3 million heads), while the average production per cow (5,600 kg/head/year) was about only 53% of the US [6]. Besides, the milk self-sufficiency of China decreased during the last decade, while the quantity of imported milk reached a new peak (0.8 million tons) in 2019 [6]. The increasing demand for animal products is expected to result in higher production levels. This development is expected to result in more intensive dairy farming with higher total emissions of nitrogen (N) when N losses are not controlled. Therefore, a more efficient, and environmentally friendly production system is needed, in which NUE of dairy cows plays a key role. Among all the potential strategies to improve the efficiency of cows, genetic improvement is cumulative and permanent, whereas other improvements, such as better feeding, mostly require sustained efforts and inputs. If efficient cows are selected, the fraction of intake that is ending up in faeces or urine will be lower, which will contribute to lower N losses to the environment.

Generally, the NUE is difficult to measure for individual cows (Lopez-Villalobos et al., 2018). To calculate individual NUEs, daily feed intake (N intake) is required, which is costly for regular assessment. To genetically improve NUE, routinely recording and cost-effective proxies are essential to initiate genetic evaluations. Fourier-transform mid-infrared (MIR) spectra played a significant role in the phenotyping of milk composition. Applications are traits related to the nutritional value of milk and the processability of milk into products such as cheese. Some of these traits, such as milk fat percentage and protein percentage, are used in the milk payment systems to farmers and therefore used in genetic evaluations as well to increase fat and protein content by genetic improvement. Other more novel applications of MIR are traits related to animal health, reproductive status and the environment [8, 9], as well as the heat production of animals [10]. Recently, Grelet et al. [11] obtained reasonable proxies for N related traits such as NUE, NL and DMI by including MIR spectra of dairy cows in their prediction models. A maximum coefficient of determination (R²) of 0.82 was observed in the within-herd validation of their report, which indicated the proxies were promising for further genetic analysis. Chen et al. [12] further applied the same model to a large dataset and estimated the genetic parameters of predicted NUE and NL, indicating the possibility of genetic improvement of N related traits.

In studies regarding prediction questions, many researchers [13, 14] have addressed the so-called dimensionality problem, where many input variables for prediction models but few samples are available. This issue is more likely to show up when spectroscopy data (e.g., MIR spectra) are used to predict traits with few records (e.g. feeding data). Meanwhile, including more variables in the prediction models may increase the risk of including noise (noninformative variables), which potentially will reduce the predictive ability. Therefore, it was suggested to reduce the data dimensionality by selecting most relevant variables from the original space to obtain more robust prediction models [15]. Meanwhile, the contribution of those relevant variables will help the interpretation of prediction results from a biological perspective.

To our knowledge, published prediction models were only based on records from Holstein cows in early lactation. Individual NUE and NL in other lactation stages have not yet been predicted with MIR data. Additionally, these models have not yet been generalized sufficiently to be used in a totally different population with different diets and rearing conditions [11]. Hence, developing prediction models for Chinese Holstein cows is necessary. Literature also indicated that non-informative signals (such as high-frequency noise and baseline shift) may exist in original MIR data, which will decrease the relation between phenotypes and MIR spectra [15]. Pre-processed MIR data may be beneficial for constructing robust prediction models. Therefore, the objective of this study was to develop and validate the best prediction model of NUE, NL and DMI for individual dairy cows in China. Subobjectives included: (1) to compare different prediction models and machine learning algorithms within each trait; (2) to compare different pre-processing methods of MIR spectra; (3) to identify the most important input variables.

Animals

The two trials used in the current study were conducted in one Holstein dairy farm of the Sunlon Livestock Development Co. Ltd. in Beijing, China (39.6˚ N, 116.2˚ E). All the experimental animals were kept in the same ventilated barn with a free-stall design and were milked 3 times/d at 07:00h, 14:00h, and 21:00h in milking parlours. Cows were in mid and late lactation stage, with days in milk (DIM) ranging from 154 to 452 and parities ranging from 1 to 4. The total mixed ration (TMR) was offered 3 times a day, and the animals had ad libitum access to TMR and water.

Feeding trials and diet analysis

The first feeding trial (T1) was conducted from spring to autumn in 2017, in which a total of 56 Chinese Holstein cows were divided in 4 subgroups and offered different diets [16]. This experiment was designed to evaluate the feed efficiency of cows by adding different levels of yeast culture (Table 1). The second feeding trial (T2) was conducted in the winter of 2019, in which a total of 30 Chinese Holstein cows were randomly divided in 3 subgroups and offered different diets [17]. This experiment was designed to evaluate the milking performance, feed intake and rumination by offering different levels of roughage (Table 1).

Table 1

Description of diets used in this study.
Trial	Description	Diet components¹	Animals
T1	4 subgroups with small amounts of a yeast culture (different levels), which do not affect the ratio of different components in the ration: subgroup 1 includes no yeast culture, subgroup 2 includes 1%² of yeast culture A, subgroup 3 includes 2% of yeast culture A, and subgroup 4 includes 1% of yeast culture B	DM: 58.8%, CP: 17.0% C:R = 56:44	56
T2	3 subgroups with different levels of roughage offered	group 1: DM: 61.5%, CP: 17.0% C:R = 61:39	30
		group 2: DM: 55.7%, CP: 17.0% C:R = 59:41
		group 3: DM: 51.0%, CP: 17.2% C:R = 56:44
Regular	Regular diet offered in the experimental farm	DM: 56.8%, CP: 16.8% C:R = 55:45	-
¹ DM = dry matter, CP = crude protein, C:R = ratio of concentrate to roughage on a dry matter basis. The main roughages for all the diets were maize silage and alfalfa, and the concentrates were mainly constituted by maize and soyabean meal.
² 1% indicates that the weight of added yeast culture is equal to the 1% DM of concentrates of subgroup 1. The diets in 4 subgroups of T1 were adjusted to keep the DM, CP, and C:R consistent.

Daily feed intake of individual cows was recorded by an automatic system (Roughage Intake Control System, Insentec B.V., Marknesse, the Netherlands). Samples of each diet were dried in an oven for 48 hours at 65 ℃ once per two weeks for the determination of dry matter content and nutrient composition. Daily DMI was calculated for each cow based on fresh matter intake and dry matter content of the diet. Afterwards, a 3-day moving average of DMI (DMI_a) was calculated for all cows to avoid biased measurements. Individual N intake was crude protein/ 6.25 [18]. Additionally, each cow was evaluated monthly for body condition score (BCS, 1 ~ 5 scale) by two technicians, and days in pregnancy (DIP) was calculated based on DIM and the last insemination date.

Milk analysis and MIR spectra

Daily milk yield (MY) for each cow was recorded by the milking system. Individual milk samples were tested at the Beijing Dairy Cattle Centre, and MIR spectra were obtained from Fourier transform spectrometer (Bentley Instruments Inc., Chaska, USA). Fat, lactose, total protein content and milk urea nitrogen (MUN) of milk samples were also derived from MIR analysis. Daily N output in milk was calculated based on daily protein output in milk divided by 6.38 [19].

Data editing

Individual daily NUE was defined as ratio of total N output in milk to total N intake from feed, and NL was defined as total N intake from feed minus total N output in milk [5]. Records with DMI_a below 5 kg/d were treated as outliers and discarded. Parities were divided in two groups (primiparous and multiparous cows), and DIMs were clustered into groups every 5 days (DIM_g). In addition, quality control criteria were applied to milk information data: MY (5 to 80 kg/d), protein percentage (2.5 to 5.0%), fat percentage (3.0 to 5.0%) and MUN (5 to 20 mg/dl). Thereafter, feeding trial data, milk information data, and MIR spectra were merged together, providing 600 records for T1 and 104 records for T2.

Pre-processing spectral data is a common strategy that helps to mitigate undesirable signals in the raw data, maximizing the relationship between the infrared spectrum and the target phenotype [15, 20]. In the present study, two pre-processing methods were applied to the original MIR data of each trial to reduce the influence of noise in the MIR spectra [15]. One method was used to remove noninformative effects of spectra by correcting differences in the baseline and the trend, i.e., multiplicative scatter correction (MSC). The other was used to remove the multiplicative effects of scatter and particle size, giving the sample a unit standard deviation, i.e., standard normal variate (SNV). Subsequently, wavenumbers induced by water and other noise were omitted, resulting in 215 wavenumbers for each record, from 968.1 to 1,577.5 cm^− 1, 1,731.8 to 1,762.6 cm^− 1, 1,781.9 to 1,808.9 cm^− 1, and 2,831.0 to 2,966.0 cm^− 1 [11, 21].

Model development

Data that passed editing steps were used to develop models predicting NUE, NL and DMI_a. Six model equations were developed for each trait in this study (Table 2). Model 1 included MIR spectra only. This model was included to test whether the information in MIR spectra only was sufficient to perform accurate prediction. Model 2 included MIR spectra, MY and parity, which was reported as the optimal model in previous studies [11, 12]. This model was therefore used as a reference model. Model 3 additionally included monthly BCS to investigate the potentially valuable information provided by body condition, due to its close relation with metabolic status. DIM_g was added in Model 4 to account for the possible impacts of lactation status. Similarly, DIP was further added in Model 5 to check whether pregnancy status affected the prediction. Model 6 only included non-MIR predictors to evaluate the additional value of MIR spectra when comparing results of model 1–5 with model 6.

MIR spectra were included in prediction either as original spectra or after pre-processing (MSC-spectra or SNV-spectra). Furthermore, three different machine learning algorithms in scikit-learn [22] were applied for prediction: partial least squares (PLS), ridge regression (RR), and support vector machine (SVM) regression. For PLS, the number of latent variables (LV) was selected based on the inspection of the root mean square prediction error (RMSPE), where including a new LV did not reduce the RMSPE. The RR and SVM algorithms were used in default settings [22], and SVM was used after a PLS compression reducing the dimension of input variables to 7 (optimal number of LVs for most models in this study). Consequently, a total of 48 models were used for predicting each trait (Table 2). All input variables were adjusted to the same scale before model development as required in machine learning algorithms.

Table 2

Prediction models for nitrogen use efficiency, nitrogen loss and dry matter intake.
Models	Predictors¹	Number of input variables	Spectra	Algorithms	Count²
Model 1	MIR	215	Original, MSC, SNV	PLS, RR, SVM	9
Model 2	+ MY, parity	217			9
Model 3	+ MY, parity, BCS	218			9
Model 4	+ MY, parity, BCS, DIM_g	219			9
Model 5	+ MY, parity, BCS, DIM_g, DIP	220			9
Model 6	MY, parity, BCS, DIM_g, DIP, Protein, fat, lactose, MUN (excluding MIR)	9	No MIR		3
MIR = mid-infrared, MY = milk yield, BCS = body condition score, DIM_g = days in milk grouped by 5 days, DIP = days in pregnancy, MUN = milk urea nitrogen, MSC = multiplicative scatter correction, SNV = standard normal variate, PLS = partial least squares, RR = ridge regression, SVM = support vector machine.
¹ For Model 2 to 5, the additional predictor of next model is based on Model 1, and Model 6 includes all additional predictors, except for MIR spectra.
² Number of prediction models developed using this set of predictors: 3 algorithms times 3 types of MIR spectra for models 1–5 = 9 models.

Validation

Prediction of N related traits is significantly affected by the diet [11, 23]. In this study, the different proportions of roughage in T2 affected the digestibility of diets. The diets in T1 were relatively similar to regular diets of farms in Beijing (Table 1). Therefore, developing models only with records of T1 is more likely to cover the practical situation on the dairy farm, and to obtain accurate prediction values when the models are applied to a large dataset of these farms. Consequently, in the current research, records of T1 were used to develop prediction models and conduct within-herd validation, while records of T2 were used as a validation set of external (across-herd) validation. The results of the external validation were used to evaluate the generalization of the developed models.

For within-herd validation, dataset T1 was randomly split in test and training sets in a ratio of 1 to 3, and a cow could be either in the test set or in the training set. Prediction models were constructed using the training sets and validated using the test sets, in which true values were masked. For external validation, true values of dataset T2 were masked to validate the performance of developed models, and the training set was the same as within-herd validation. The performance metrics included R², relative error (RE, calculated as RMSPE/ mean of the global data), ratio of standard deviation of the global data to RMSPE from the validation (RPD), and the Spearman correlation coefficient (SpearR) between true values and predictions. Additionally, the prediction model was further investigated by splitting the mean square prediction error (MSPE) into 3 parts: (1) the error due to bias, (2) the error due to the deviation from the slope of the 1:1 line, and (3) random errors [24]. The equations were:

$${Error}_{bias}={\left(\frac{{\sum }_{i=1}^{n}{X}_{i}-{\sum }_{i=1}^{n}{Y}_{i}}{n}\right)}^{2}$$

$${Error}_{slope}=\frac{{\sum }_{i=1}^{n}{({X}_{i}-\stackrel{-}{X})}^{2}}{n}\times {(1-\beta )}^{2}$$

$${Error}_{random}=(1-{R}^{2})\times \frac{{\sum }_{i=1}^{n}({Y}_{i}-\stackrel{-}{Y})}{n}$$

where: ${X}_{i}$ is the i^th predicted value; ${Y}_{i}$ is the i^th true value; $\stackrel{-}{X}$ is the average value of predictions; $\stackrel{-}{Y}$ is the average value of true data; $n$ is the number of samples; $\beta$ is the slope; ${R}^{2}$ is the coefficient of determination. These three sources of error were expressed as percentage of MSPE.

These metrics were calculated based on the predictions and true values of the validation datasets (test set of T1, and T2). The steps of splitting datasets and validations were repeated for five times, and average values of each performance metrics were presented.

Variable importance

The variable importance in this study is based on the absolute value of the regression coefficient (b) of the PLS model [25]. The coefficient is a measure of association between each input variable and the response variable, and higher b values indicate higher importance. This method has been used previously in wavelength selection for infrared spectra [26, 27]. In this study, the b of each input variable was derived and ranked. Afterwards, the top 10 variables were selected to identify the most important predictors in the best models of within-herd validation for each trait.

All the data editing steps and statistics were carried out with the pandas and numpy in Python 3.7 [28].

Descriptive statistics

For the T1 dataset, individual daily NUE was on average 26.4%, with a standard deviation of 8.6%, NL was on average 0.51 kg/day, with a standard deviation of 0.15 kg/day, and DMI_a was on average 25.2 kg/day, with a standard deviation of 6.0 kg/day (Fig. 1). For the T2 dataset, individual daily NUE was on average 23.7%, with a standard deviation of 7.8%, NL was on average 0.48 kg/day, with a standard deviation of 0.11 kg/day, and DMI_a was on average 22.9 kg/day, with a standard deviation of 4.1 kg/day (Fig. 1).

The average values of predicted traits NUE, NL, DMI_a and the non-MIR predictors MY, protein%, DIM and DIP of different diet groups are shown in Table 3. For the diets in T2, the proportion of roughage increased from group 1 to group 3, and the roughage on dry matter basis (C:R) of group 3 was the same as that of T1. The average NUE increased, while the average NL decreased when more roughage was added in the diets of T2. The DMI_a reached the lowest value when the diet of group 2 was supplied to the cows. The average NUE and DMI_a in T1 were higher than those in T2, whereas average NL of T1 was relatively comparable with the NL of T2 (Table 3).

Table 3

The average values for individual nitrogen use efficiency, nitrogen loss, 3-day moving average dry matter intake, and other predictors in each diet group¹.
Trial	Group	C:R	N	NUE (%)	NL (kg)	DMI_a (kg)	MY (kg)	Protein (%)	DIM (day)	DIP (day)
T1		56:44	600	26.4	0.51	25.2	32.5	3.4%	253.3	136.1
T2	1	61:39	34	21.0	0.54	24.8	24.1	3.7%	267.2	144.3
	2	59:41	34	23.8	0.45	21.7	24.0	3.7%	318.3	172.5
	3	56:44	36	26.1	0.45	22.1	27.1	3.6%	310.5	136.3
¹ C:R = ratio of concentrate to roughage on a dry matter basis, NUE = nitrogen use efficiency, NL = nitrogen loss, DMI_a = 3-d moving average of dry matter intake, MY = milk yield, DIM = days in milk, DIP = days in pregnancy.

Most of the average values of predictors in T1 were different from those in T2, e.g. daily MY in T1 was at least 5.4 kg higher than T2, while the protein content in T1 was at least 0.2% lower than T2. Additionally, the cows in T1 had lower DIM and DIP compared to T2 (Table 3).

Within-herd validation

The average R², RE and RPD of within-herd validation results for different traits using the PLS algorithm are shown in Fig. 2. The R² was higher when pre-processed MIR spectra, especially MSC-spectra, were included regardless of the models and traits. The values of RE were the lowest for most models when using SNV-spectra to predict the traits. The values of RPD in each trait and model were relatively comparable when different MIR spectra were used.

In most cases, Model 2, 3,4 and 5 generated comparable results for each trait when the same MIR spectra were used (Fig. 2). Model 6 was the least accurate for NL and DMI_a, regardless of the performance metrics. For NUE, the results produced by Model 6 were close to those produced by Model 2, 3, 4 and 5, whereas the predictive ability of Model 1 was lowest.

For the other two machine learning algorithms, similar distribution patterns of performance metrics (compared to PLS) were obtained for each trait. (Suppl. Figures 1 and 2).

The R²s of the best models for NUE were higher (0.62 to 0.66) than of those for NL (0.53 to 0.58) and DMI (0.60 to 0.63; Table 4). For NUE, Model 5 with the SVM algorithm outperformed the other models, with highest R² (0.66), RPD (2.15), SpearR (0.82) and smallest RE (0.15). For NL and DMI, performance metrics RE, RPD and SpearR were comparable among different algorithms, whereas Model 3 and Model 2 with the RR algorithm generated the highest R² for NL (0.58) and DMI (0.63). Meanwhile, pre-processed MIR spectra (MSC- and SNV-spectra) were incorporated in the best models for all the traits (Table 4). Although the best model varied (Model 1 to Model 5) when different traits or algorithms were included, the predictive abilities of all these best models including MIR spectra were better than the model without MIR spectra (Model 6).

Table 4

Performance metrics of the best prediction models in within-herd validation for each trait¹. Values between brackets indicate the standard deviation.
Trait	Algorithm	Model	MIR	R²	RPD	SpearR
NUE	PLS	5	MSC	0.62(0.01)	1.55(0.25)	0.80(0.01)
	RR	2	MSC	0.62(0.01)	1.48(0.25)	0.80(0.03)
	SVM	5	SNV	0.66(0.01)	2.15(0.15)	0.82(0.03)
NL	PLS	1	SNV	0.56(0.04)	1.41(0.11)	0.79(0.01)
	RR	3	MSC	0.58(0.02)	1.39(0.07)	0.79(0.05)
	SVM	2	MSC	0.53(0.004)	1.35(0.03)	0.74(0.04)
DMI_a	PLS	3	MSC	0.63(0.02)	1.39(0.12)	0.82(0.01)
	RR	2	MSC	0.63(0.02)	1.43(0.09)	0.80(0.03)
	SVM	4	MSC	0.60(0.03)	1.40(0.09)	0.78(0.04)
¹ NUE = nitrogen use efficiency, NL = nitrogen loss, DMI_a = 3-d moving average of dry matter intake, PLS = partial least squares, RR = ridge regression, SVM = support vector machine, MIR = mid-infrared, MSC = multiplicative scatter correction, SNV = standard normal variate, R² = coefficient of determination, RPD = ratio of standard deviation of the global data to root mean square error from the validation, SpearR = Spearman correlation coefficient.

The prediction errors of the best models were further investigated by dividing the MSPE into three sources of error (Table 5). For all the models, random error (random%) accounted for the largest proportion of the MSPE (89.0 to 97.1%), while the error due to mean bias (bias%) and deviation from the slope (slope%) only accounted for a small part of the MSPE (1.1 to 6.5%). The MSPE of the best model for NUE (model 5 with SVM algorithm) was approximately half of the other two models (16.1 vs. 32.4/ 35.6), whereas similar MSPEs were observed among different models for the other traits.

Table 5

The bias, slope, and random proportions of the mean square prediction error of best prediction models in with-herd validation for each trait¹. Values between brackets indicate the standard deviation.
Trait	Algorithm	Model	MIR	MSPE²	RE	bias%	slope%	random%
NUE	PLS	5	MSC	32.4(9.2)	0.21(0.03)	3.7(3.9)	4.2(6.6)	92.1(8.3)
	RR	2	MSC	35.6(9.4)	0.23(0.03)	2.3(2.0)	1.8(2.8)	95.9(3.2)
	SVM	5	SNV	16.1(2.3)	0.15(0.01)	5.7(5.1)	2.1(1.3)	92.2(5.7)
NL	PLS	1	SNV	1.1e-02(1.8e-03)	0.21(0.02)	4.9(5.0)	2.7(3.3)	92.5(3.7)
	RR	3	MSC	1.2e-02(1.2e-03)	0.21(0.01)	4.5(4.5)	6.5(6.4)	89.0(3.3)
	SVM	2	MSC	1.2e-02(5.7e-04)	0.22(0.004)	3.4(4.9)	6.4(4.4)	90.2(6.0)
DMI_a	PLS	3	MSC	18.7(3.5)	0.17(0.01)	4.8(4.1)	3.1(1.3)	92.1(4.7)
	RR	2	MSC	17.5(2.1)	0.17(0.01)	1.3(1.4)	1.6(1.6)	97.1(2.5)
	SVM	4	MSC	18.4(2.6)	0.17(0.01)	1.1(1.6)	6.1(4.3)	92.7(4.6)
¹ NUE = nitrogen use efficiency, NL = nitrogen loss, DMI_a = 3-d moving average of dry matter intake, PLS = partial least squares, RR = ridge regression, SVM = support vector machine, MIR = mid-infrared, MSC = multiplicative scatter correction, SNV = standard normal variate, MSPE = mean square prediction error, RE = relative error, bias% = proportion of error due to mean bias, slope% = proportion of error due to deviation of the slope from 1, random% = proportion of error explained by random error.
² The unit of MSPE: %×% for NUE; kg×kg for NL and DMI_a.

Overall, all three machine learning algorithms generated comparable and reasonable results for different traits. All the best prediction models included MSC- or SNV-MIR spectra, which indicates that pre-processed MIR spectra increased the predictive ability of these traits in within-herd validation.

External validation

The R²s of the external validation were slightly lower for NUE (0.58 to 0.63) than the R²s of the within-herd validation, but considerably lower for NL (0.09 to 0.35) and DMI (0.10 to 0.47; Table 6). For NL, Model 3 was the best model regardless of algorithms, while different best models were observed for NUE and DMI when different machine learning algorithms were included. Additionally, original MIR spectra were used for most of the best models in the external validation (Table 6).

Table 6

Performance metrics of the best prediction models in external validation for each trait.
Trait	Algorithm	Model	MIR	R²	RPD	SpearR
NUE	PLS	2	Original	0.63(0.02)	1.80(0.06)	0.81(0.01)
	RR	4	Original	0.58(0.01)	1.69(0.03)	0.73(0.04)
	SVM	3	Original	0.62(0.01)	1.77(0.02)	0.80(0.02)
NL	PLS	3	Original	0.19(0.03)	1.52(0.03)	0.37(0.05)
	RR	6	No	0.35(0.01)	1.15(0.03)	0.64(0.04)
	SVM	3	Original	0.09(0.02)	1.43(0.01)	0.24(0.02)
DMI_a	PLS	6	No	0.22(0.07)	0.87(0.04)	0.56(0.13)
	RR	6	No	0.47(0.02)	1.29(0.06)	0.74(0.05)
	SVM	3	Original	0.10(0.02)	1.52(0.02)	0.34(0.04)
¹ NUE = nitrogen use efficiency, NL = nitrogen loss, DMI_a = moving average of dry matter intake, PLS = partial least squares, RR = ridge regression, SVM = support vector machine, MIR = mid-infrared, R² = coefficient of determination, RPD = ratio of standard deviation of the global data to root mean square error from the validation, SpearR = Spearman correlation coefficient.

Three sources of MSPE for each model are listed in Table 7. Generally, most of the prediction error was due to random error (79.7 to 98.2%), and a more varied range was observed for the bias% and slope% (0.4 to 15.3%). The model with highest R² for NUE (model 2 with the PLS algorithm) generated the smallest MSPE compared to the other two models. Additionally, higher MSPEs and lower random% were observed for the models using no MIR spectra (model 6) than for models using MIR spectra, regardless of the traits and algorithms.

Table 7

The bias, slope, and random proportions of the mean square prediction error of best prediction models in the external validation for each trait¹. Values between brackets indicate the standard deviation.
Trait	Algorithm	Model	MIR	MSPE	RE	bias%	slope%	random%
NUE	PLS	2	Original	22.7(1.4)	0.19(0.01)	11.3(5.5)	6.0(4.3)	82.7(5.4)
	RR	4	Original	25.7(0.9)	0.20(0.004)	7.8(1.2)	1.4(1.0)	90.9(0.6)
	SVM	3	Original	23.4(0.5)	0.18(0.003)	2.1(1.8)	3.2(4.0)	94.7(5.3)
NL	PLS	3	Original	9.6e-03(3.2e-04)	0.20(0.003)	1.2(0.7)	0.7(0.6)	98.2(1.3)
	RR	6	No	1.4e-02(3.9e-04)	0.26(0.01)	4.3(3.7)	8.9(3.3)	86.9(2.7)
	SVM	3	Original	1.1e-02(1.8e-04)	0.21(0.004)	0.8(1.1)	2.2(0.9)	97.0(1.2)
DMI_a	PLS	6	No	47.4(4.2)	0.28(0.01)	0.8(0.3)	15.3(17.0)	83.9(16.9)
	RR	6	No	20.6(0.7)	0.18(0.01)	9.8(2.1)	10.5(3.3)	79.7(1.8)
	SVM	3	Original	15.3(0.3)	0.16(0.002)	0.4(0.3)	3.7(0.8)	96.0(0.8)
¹ NUE = nitrogen use efficiency, NL = nitrogen loss, DMI_a = 3-d moving average of dry matter intake, PLS = partial least squares, RR = ridge regression, SVM = support vector machine, MIR = mid-infrared, MSC = multiplicative scatter correction, SNV = standard normal variate, MSPE = mean square prediction error, RE = relative error, bias% = proportion of error due to mean bias, slope% = proportion of error due to deviation of the slope from 1, random% = proportion of error explained by random error.
² The unit of MSPE: %×% for NUE; kg×kg for NL and DMI_a.

The best model for individual NUE in external validation (Model 2 with the PLS algorithm and original MIR, Table 6) was further inspected by calculating the R² of each diet group separately. The average R² (and standard deviation) was 0.37 (0.07), 0.50 (0.04) and 0.76 (0.02) for group 1, 2 and 3 in T2, respectively. It was noted that the separate R² of group 3 was relatively high compared to the R² of the other groups and the overall R² of T2.

External validation generated comparable results for NUE, but less accurate results for NL and DMI_a compared to within-herd validations. None of the best models included pre-processed MIR spectra in the external validation (Table 6), which means pre-processing of MIR did not contribute to better predictions in external validations. However, including the information of original MIR spectra reduced RE in the external validation (Table 7). Meanwhile, detailed inspection on diet groups of T2 indicated that R² varied between different diets.

Variable importance

The importance score of MIR spectral regions and other predictors was obtained from within-herd validations (Fig. 3). The best model for NUE when using the PLS algorithm was Model 5, which includes more predictors compared to the best model for NL (Model 1) and DMI_a (Model 3; Table 6). Wavenumbers 973.54 to 988.46 cm^− 1 were the top 10 important predictors for all the traits, while wavenumbers around 1354.00 cm^− 1 and MY were as well important predictors to predict NUE and DMI_a.

The current study aimed to develop the best prediction model for NUE, NL and DMI_a of individual dairy cattle in China. Different pre-processing methods of MIR spectra, machine learning algorithms, predicting equations, and validations (within-herd and external) were investigated. The results indicated that the best prediction model was different for each trait. Reasonable performance metrics were obtained for within-herd validation, while only NUE could be predicted with a relatively high accuracy in the external validation. The results of different diet groups in T2 indicated that diet composition may have considerable impacts on the predictive ability. Additionally, variables that significantly contribute to the prediction were assessed for each trait, which can be helpful for the interpretation of prediction results.

Individual nitrogen use efficiency

The average value of individual NUE in this research (Fig. 1, Table 3) is comparable with that in previous studies [5, 29], which reported ranges from 15 to 40%, but lower in studies that investigated cows in early lactation, in which individual NUE ranged from 34.4–36.9% [11, 12]. This variation may be due to the coverage of a relatively long period (about 300 DIM) for individual NUE in the present study, as well as differences in animals, diets, rearing conditions, and the lactation stage. Grelet et al. [11] observed that more efficient animals have greater negative energy balance in early lactation. Cows mobilize fat tissue and lose weight when the energy balance is negative. The additional protein from tissue mobilization may have resulted, therefore, in a higher NUE in early lactation. Additionally, the variation of NUE in different lactation stages may be explained by the dilution effect of protein requirements for maintenance as a result of the high MY in early lactation. With an increasing stage of lactation, the efficiency decreases, as an increasing fraction of protein (N) is allocated to maintenance and gestation, instead of to milk production [30].

It should be noted that the methodology used to calculate NUE in this study neglected changes in body weight (e.g., fat reserves, fetus, and supporting tissues) and the associated increase or decrease in body N, because these changes are relatively small compared to the N output via milk production. The NUE in this study is expected to be slightly lower than the true NUE considering all N flows in the animal.

Within-herd validation and important variables

The current study developed reasonable prediction models for daily NUE, NL and DMI of individual cows by comparing different prediction algorithms and pre-processing methods for MIR data. Furthermore, the important scores of input variables for different prediction models were evaluated. The performance metrics of the best models for NUE and NL (Tables 4 to 6) were comparable with those in the study of Grelet et al. [11], who reported R² ranging from 0.59 to 0.68, RE ranging from 0.14 to 0.23, and RPD ranging from 1.57 to 2.07. Lahart et al. [23] included both MIR spectra and near-infrared spectra to predict the DMI of individual cows in grazing system and reported R²s ranging from 0.60 to 0.81 in cross-validation. The best R² of DMI_a in the present study ranged from 0.60 to 0.63 in the within-herd validation (Table 4), which was comparable to the results of Lahart et al. [23]. In addition, the prediction accuracy for NL was relatively low compared to that for NUE in our study. This was observed in previous research [11, 12] as well. This may be due to the different nature of NUE and NL. NUE was calculated as the ratio of N output in milk to N intake, while NL was subtracting N output in milk from N intake. The prediction accuracy of N output (obtained from protein yield) in milk is substantially higher than the prediction accuracy of NL or N intake because the MIR profile is capturing N-bounds in the milk, but the prediction of NL or N intake is likely to be indirect and therefore the prediction accuracy is lower. Furthermore, the prediction accuracy of NL is lower than of NUE, because NUE is a ratio, and a ratio has more possibilities to remain stable when both numerator and denominator change. The detailed analysis of MSPE (Table 5) showed that most of the model error was random error, which indicated the established models were unbiased and can capture most of the variability in the input data [24, 31].

In this study, adding MIR spectra in the best models increased the R² and RPD by 10 to 30%, as well as reduced the RE by 0.03 to 0.10 compared to model 6 (Fig. 2). Meanwhile, this improvement was more obvious when MIR spectra were pre-processed in within-herd validation. These results indicate that MIR includes additional information for better prediction of NUE, NL and DMI. Pre-processed MIR spectra were better than raw MIR spectra for developing accurate prediction models in within-herd validation (Table 4).

Milk yield and several MIR wavenumbers were featured variables for predicting N related traits. MY was highly correlated with N output (phenotypic correlation = 0.96), and thus contributed substantially to the models. The high-score wave range of MIR at 973.54 to 988.46 cm^− 1 (Fig. 3) is associated with C-H, C-N and N-N stretching [32–34]. These chemical compounds are related to proteins, unravelling what proteins affect NUE may further increase understanding of the N metabolism in dairy cows. In this study, the top 10 critical variables were selected to develop a new PLS model and predict the target traits using the same procedure. However, the predictive abilities of these reduced models were compromised for 10 to 20% (results not shown), which suggested that the feature selection procedure did not have additional value in this study.

External validation

In the current research, a dataset with three different diets was used for external validation, and relatively less accurate performance metrics were generated for all target traits (Table 6). Similar findings were reported in the research of Grelet et al. [11], where the R² for NUE varied from 0.06 to 0.68 in external validations, which reflected the potential decrease of predictive ability. Lahart et al. [23] also found that the accuracy of external validation of DMI was lower than the cross-validation and that the R² varied considerably among models (0.16 to 0.68). Lower accuracies in external validation may be explained by variations in the validation dataset not covered in the calibration dataset. [11, 35]. In the present study, different R²s for NUE were obtained in different diet groups of T2, and the variation of diet components was impossible to be included in the prediction models due to the identical diet formulation in the calibration dataset (Table 1). Furthermore, T1 and T2 were conducted in different seasons. Extreme heat in summer is expected to affect the metabolic status and further affect the milk production and N utilization of dairy cows [36]. The occurrence of summer heat in T1 may have resulted in lower R²s for predictions of N intake (DMI) and N output than in T2, which was conducted in winter. In addition, as discussed in previous section, NUE may be more robust when both N intake and N output were changed because it is a ratio trait. The prediction accuracies for NUE were relatively stable and comparable.

Predicting cows consuming completely different diets without taking the diets into account usually generate lower accuracies [35]. However, it is still worthwhile to test the robustness of prediction models by external validation, especially for large-scale data prediction. Large-scale data prediction requires much more variation in diets in the training set and much higher numbers of farms and animals to end up with more accurate external validations. In this study, reasonable results were obtained for NUE in the external validation (Table 6), which means the models were robust enough to predict the NUE without considering diet composition. Moreover, the R² of 3 subgroups in T2 indicated that the diet components do affect the prediction accuracy. The average NUE of animals in subgroup 3 was close to that of the calibration dataset (26.1 vs 26.4), and the ratio of concentrate to C:R for subgroup 3 and calibration dataset was the same (56:44). The similar NUE and the equal C:R may be the reason for the accurate predictions (R² = 0.76) by Model 2. In this study, animals in group 1 and 2 of T2 were offered diets with higher C:R ratios than the standard, which significantly reduced milk yield and NUE (Table 3). Therefore, it is expected that the current model would perform even better (or more reasonable, for NL and DMI_a) on those Chinese farms that feed the cows with the regular diet, in which the C:R is 55:45 in most cases (Table 1). As long as feeding regimes are very similar on other farms, the prediction equations may facilitate genetic evaluation on a larger data set of MIR spectra. For instance, Chen et al. [12] applied the prediction models of Grelet et al. [11] to a large dataset for genetic evaluation of predicted NUE and NL in early lactation. It also should be noted that the equations in Grelet et al. [11] were based on three farms in three countries and therefore they might be more robust than prediction equations from the present study with the data from one farm.

MIR spectra do have additional value for predicting N-related traits. Higher MSPEs, and higher bias% and slope% were observed when MIR spectra were not included in the prediction model for DMI_a and NL (Table 7), even though higher R²s were obtained for these models (Table 6). However, as several studies indicated [15, 33, 37], pre-processed MIR cannot always provide more accurate results. In the current research, it was observed that models including pre-processed MIR performed better than models with original MIR in within-herd validation, but performed worse in external validation (Tables 4, 5, 6 and 7). The different seasons and diet compositions may affect the profile of original MIR spectra. Mathematical treatments could further amplify the error if the data points of the calibration data set and validation data set are already different. Thus, the final spectra may strongly affect the quality of prediction. Therefore, it is suggested to pre-test the model with preprocessed and original MIR spectra before using the prediction equations for nationwide prediction.

Future implications

The results in this research showed that predictivities of NUE, NL and DMI_a by milk MIR spectra were affected by diet composition. Changes in these target traits were observed even though we did not perform a detailed dietary analysis in this study (Table 3). There is a potential opportunity to combine the knowledge of animal nutrition and MIR spectra to improve the predictive ability of N related traits, as well as to understand the biological mechanisms underlying these traits. Nevertheless, these results may provide insights in the farm management strategy in China. The improved model and biological understanding could be used to improve feeding management on dairy farms. For example, a suitable ration can result in a higher nitrogen use efficiency for individual cows, which would be beneficial for mitigating the negative environmental impacts of dairy farms.

A reasonable prediction model, with R² of 0.63, RE of 0.19 and RPD of 1.80 in the external validation, was developed for NUE of individual cows (Table 6). It is possible to perform genetic analysis for NUE in a large-scale dataset with MIR records. However, our model was based on a Holstein population in a typical farm in the north of China, which means it might not be applicable to farms in a completely different environment (e.g., climate conditions, diets, management strategies). Therefore, a more comprehensive dataset, which accounts for the variation in environment, is needed to develop a nationwide generalized model to predict N related traits in the Chinese dairy population.

The results of this study indicated that reasonable prediction models for NUE of Chinese Holstein cows were successfully developed across different feeding diets. The prediction models for NL and DMI_a were also reasonable given the similar diet. MIR spectra increased the predictive ability of these traits, but pre-processed MIR spectra cannot always provide more accurate results in external validation compared to original MIR spectra. Among all the potential predictors, milk yield and the specific wave MIR-range from 973.54 to 988.46 cm^− 1 were the most important to predict target traits. This wave range may comprise chemical bonds associated with protein in milk. This can be useful to unravel the biological mechanisms affecting NUE, DMI and NL in dairy cows. More data are needed to improve the generalization of models before conducting large-scale prediction. This prediction will be helpful for mitigating the negative impacts of dairy production on environment by breeding more efficient animals or to optimize feeding management.

BCS: body condition score; C:R: ratio of concentrate to roughage on a dry matter basis; DIM: days in milk; DIM_g: days in milk grouped by 5 days; DIP: days in pregnancy; DMI: dry matter intake; DMI_a: 3-day moving average of dry matter intake; LV: latent variable; MIR: mid-infrared spectra; MSC: multiplicative scatter correction; MSPE: mean square prediction error; MUN: milk urea nitrogen; MY: milk yield; N: nitrogen; NL: nitrogen loss; NUE: nitrogen use efficiency; PLS: partial least squares; R²: coefficient of determination; RE: relative error; RMSPE: root mean square prediction error; RPD: ratio of standard deviation of the global data to root mean square prediction error from the validation; RR: ridge regression; SNV: standard normal variate; SpearR: Spearman correlation coefficient; SVM: support vector machine; T1: the first feeding trial; T2: the second feeding trial; TMR: total mixed ration.

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.

Competing interests

The authors declare that they have no competing interests.

Funding

This research was supported by China Agriculture Research System (CARS-36), National Agricultural Genetic Improvement Program (2130135), Beijing Sanyuan Breeding Technology Ltd. Co. funded project (SYZYZ20190005), the Program for Changjiang Scholar and Innovation Research Team in University (IRT_15R62), China Scholarship Council (No. 201913043) and Hainan University. The funders had no role in the study design, data collection and analysis, preparation of the manuscript or in the decision to publish the manuscript.

Authors’ contributions

RS, BD, AvdL, HAM and YW designed the study. RS, WL, BD, AvdL, HAM, SJO and YW analyzed and interpreted data. RS wrote the manuscript. RS, BD, AvdL, HAM and SJO substantively revised the manuscript. WL and SL contributed to accessing tools and materials. All authors read and approved the final manuscript.

Acknowledgements

We thank the Beijing Dairy Cattle Center for providing data, and the members of the cattle nutrition research team of China Agricultural University who participated in the feeding data collection.

Ma L, Ma WQ, Velthof GL, Wang FH, Qin W, Zhang FS, et al. Modeling Nutrient Flows in the Food Chain of China. J Environ Qual. 2010;39:1279–89.
Bai Z, Ma L, Ma W, Qin W, Velthof GL, Oenema O, et al. Changes in phosphorus use and losses in the food chain of China during 1950–2010 and forecasts for 2030. Nutr Cycl Agroecosyst. 2016;104:361–72.
Bai ZH, Ma L, Oenema O, Chen Q, Zhang FS. Nitrogen and Phosphorus Use Efficiencies in Dairy Production in China. J Environ Qual. 2013;42:990–1001.
Chase L. Nitrogen utilization in dairy cows-what are the limits of efficiency? Proc Cornell Nutr Conf. 2003;233–44.
Calsamiglia S, Ferret A, Reynolds CK, Kristensen NB, van Vuuren AM. Strategies for optimizing nitrogen use by ruminants. Animal. 2010;4:1184–96.
FAO (Food and Agriculture Organization of the United Nations). 2022. https://www.fao.org/faostat/en/#data. Accessed 1 June 2022.
Lopez-Villalobos N, Correa-Luna M, Burke J, Sneddon N, Schutz M, Donaghy D, et al. Genetic parameters for milk urea concentration and milk traits in New Zealand grazing dairy cattle. N Z J Anim Sci Prod. 2018;78:56–61.
Tiplady KM, Lopdell TJ, Littlejohn MD, Garrick DJ. The evolving role of Fourier-transform mid-infrared spectroscopy in genetic improvement of dairy cattle. J Anim Sci Biotechnol. 2020;11:39.
Lou W, Zhang H, Luo H, Chen Z, Shi R, Guo X, et al. Genetic analyses of blood β-hydroxybutyrate predicted from milk infrared spectra and its association with longevity and female reproductive traits in Holstein cattle. J Dairy Sci. 2022;105:3269–81.
Mesgaran SD, Eggert A, Höckels P, Derno M, Kuhla B. The use of milk Fourier transform mid-infrared spectra and milk yield to estimate heat production as a measure of efficiency of dairy cows. J Anim Sci Biotechnol. 2020;11:43.
Grelet C, Froidmont E, Foldager L, Salavati M, Hostens M, Ferris CP, et al. Potential of milk mid-infrared spectra to predict nitrogen use efficiency of individual dairy cows in early lactation. J Dairy Sci. 2020;103:4435–45.
Chen Y, Vanderick S, Mota RR, Grelet C, Gengler N. Estimation of genetic parameters for predicted nitrogen use efficiency and losses in early lactation of Holstein cows. J Dairy Sci. 2021;104:4413–23.
Helland I. Some theoretical aspects of partial least squares regression. Chemom Intell Lab Syst. 2001;58:97–107.
Chun H, KeleÅ S. Sparse partial least squares regression for simultaneous dimension reduction and variable selection. J R Stat Soc Ser B (Stat Methodol). 2010;72:3–25.
Bresolin T, Dórea JRR. Infrared Spectrometry as a High-Throughput Phenotyping Technology to Predict Complex Traits in Livestock Systems. Front Genet. 2020;11:923.
Zhou D, Yao K, Xie S, Li B, Zhou F, Li S, et al. Nutrient Apparent Digestibility and Serum Indices of Lactating Dairy Cows. Chin J Anim Nutr. 2018;30:2741–8.
Liu J. Effects of Dietary Whole Corn Silage Levels on Milk Performance, Feeding Behavior and Rumination Behavior of late lactating dairy cows. Master’s thesis, China Agricultura University. 2020.
FAO (Food and Agriculture Organization of the United Nations). Food and nutrition paper 77, Food energy—Methods of analysis and conversion factors. Rome: FAO; 2003.
WHO and FAO (World Health Organization and Food and Agriculture Organization of the United Nations). Codex Alimentarius: Milk and Milk Products. 2nd ed. Rome: WHO FAO; 2011.
McParland S, Berry DP. The potential of Fourier transform infrared spectroscopy of milk samples to predict energy intake and efficiency in dairy cows. J Dairy Sci. 2016;99:4056–70.
Grelet C, Bastin C, Gelé M, Davière J-B, Johan M, Werner A, et al. Development of Fourier transform mid-infrared calibrations to predict acetone, β-hydroxybutyrate, and citrate contents in bovine milk through a European dairy network. J Dairy Sci. 2016;99:4816–25.
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine Learning in Python. J Mach Learn Res. 2011;12:2825–30.
Lahart B, McParland S, Kennedy E, Boland TM, Condon T, Williams M, et al. Predicting the dry matter intake of grazing dairy cows using infrared reflectance spectroscopy analysis. J Dairy Sci. 2019;102:8907–18.
Bibby J, Toutenburg H. Prediction and Improved Estimation in Linear Models. John Wiley and Sons; 1977.
Saeys Y, Inza I, Larranaga P. A review of feature selection techniques in bioinformatics. Bioinformatics. 2007;23:2507–17.
Osborne S, Künnemeyer R, Jordan R. Method of Wavelength Selection for Partial Least Squares. Analyst. 1997;122:1531–7.
Xu H, Liu Z, Cai W, Shao X. A wavelength selection method based on randomization test for near-infrared spectral analysis. Chemom Intell Lab Syst. 2009;97:189–93.
Van Rossum G, Drake FL. Python 3 Reference Manual. Scotts Valley: CreateSpace; 2009.
Nadeau E, Englund J-E, Gustafsson A. Nitrogen efficiency of dairy cows as affected by diet and milk yield. Livest Sci. 2007;111:45–56.
Phuong HN, Friggens NC, de Boer IJM, Schmidely P. Factors affecting energy and nitrogen efficiency of dairy cows: A meta-analysis. J Dairy Sci. 2013;96:7245–59.
Souza MC, Oliveira AS, Araújo CV, Brito AF, Teixeira RMA, Moares EHBK, et al. Short communication: Prediction of intake in dairy cows under tropical conditions. J Dairy Sci. 2014;97:3845–54.
Grelet C, Fernández Pierna JA, Dardenne P, Baeten V, Dehareng F. Standardization of milk mid-infrared spectra from a European dairy network. J Dairy Sci. 2015;98:2150–60.
Shetty N, Løvendahl P, Lund MS, Buitenhuis AJ. Prediction and validation of residual feed intake and dry matter intake in Danish lactating dairy cows using mid-infrared spectroscopy of milk. J Dairy Sci. 2017;100:253–64.
Xia Y, Ugarte CM, Guan K, Pentrak M, Wander MM. Developing Near- and Mid‐Infrared Spectroscopy Analysis Methods for Rapid Assessment of Soil Quality in Illinois. Soil Sci Soc Am j. 2018;82:1415–27.
Dardenne P. Some Considerations about NIR Spectroscopy: Closing Speech at NIR-2009. Nir News. 2010;21.
West J, Mullinix B, Bernard J. Effects of Hot, Humid Weather on Milk Temperature, Dry Matter Intake, and Milk Yield of Lactating Dairy Cows. J Dairy Sci. 2003;86:232–42.
Cafferky J, Sweeney T, Allen P, Sahar A, Downey G, Cromie A, et al. Investigating the use of visible and near infrared spectroscopy to predict sensory and texture attributes of beef M. longissimus thoracis et lumborum. Meat Sci. 2019;159:107915.

Download PDF

Journal Publication

published 10 Jan, 2023

Read the published version in Journal of Animal Science and Biotechnology →

Reviewers invited by journal
07 Jun, 2022
Editor assigned by journal
06 Jun, 2022
First submitted to journal
03 Jun, 2022

You are reading this latest preprint version

Predicting nitrogen use efficiency, nitrogen loss and dry matter intake of individual dairy cows in late lactation by including mid-infrared spectra of milk samples

Status:

Journal Publication

Version 1

Abstract

Background

Results

Conclusion

Figures

Background

Methods

Animals

Feeding trials and diet analysis

Milk analysis and MIR spectra

Data editing

Model development

Validation

Variable importance

Results

Descriptive statistics

Within-herd validation

External validation

Variable importance

Discussion

Individual nitrogen use efficiency

Within-herd validation and important variables

External validation

Future implications

Conclusions

Abbreviations

Declarations

References

Supplementary Files

Status:

Journal Publication

Version 1