Disentangling data dependency using cross-validation strategies to evaluate prediction quality of cattle grazing activities using machine learning algorithms and wearable sensor data

Ribeiro, Leonardo Augusto Coelho; Bresolin, Tiago; Rosa, Guilherme Jordão de Magalhães; Casagrande, Daniel Rume; Danés, Marina de Arruda Camargo; Dórea, João Ricardo Rebouças

doi:10.1093/jas/skab206

Cited by 10 publications

(4 citation statements)

References 24 publications

Supporting

Mentioning

Contrasting

Order By: Relevance

“…In the first approach, an animal-based split [ 62 ] was chosen, meaning that all observations on an individual cow were either placed within the training dataset or test dataset. The dataset was randomly split by animal into a training dataset containing 80% of the records and a test dataset consisting of the remaining 20%.…”

Section: Methodsmentioning

confidence: 99%

The Combined Use of Automated Milking System and Sensor Data to Improve Detection of Mild Lameness in Dairy Cattle

Lemmens

Schodl

Fuerst-Waltl

et al. 2023

Animals

View full text Add to dashboard Cite

This study aimed to develop a tool to detect mildly lame cows by combining already existing data from sensors, AMSs, and routinely recorded animal and farm data. For this purpose, ten dairy farms were visited every 30–42 days from January 2020 to May 2021. Locomotion scores (LCS, from one for nonlame to five for severely lame) and body condition scores (BCS) were assessed at each visit, resulting in a total of 594 recorded animals. A questionnaire about farm management and husbandry was completed for the inclusion of potential risk factors. A lameness incidence risk (LCS ≥ 2) was calculated and varied widely between farms with a range from 27.07 to 65.52%. Moreover, the impact of lameness on the derived sensor parameters was inspected and showed no significant impact of lameness on total rumination time. Behavioral patterns for eating, low activity, and medium activity differed significantly in lame cows compared to nonlame cows. Finally, random forest models for lameness detection were fit by including different combinations of influencing variables. The results of these models were compared according to accuracy, sensitivity, and specificity. The best performing model achieved an accuracy of 0.75 with a sensitivity of 0.72 and specificity of 0.78. These approaches with routinely available data and sensor data can deliver promising results for early lameness detection in dairy cattle. While experimental automated lameness detection systems have achieved improved predictive results, the benefit of this presented approach is that it uses results from existing, routinely recorded, and therefore widely available data.

show abstract

Section: Methodsmentioning

confidence: 99%

The Combined Use of Automated Milking System and Sensor Data to Improve Detection of Mild Lameness in Dairy Cattle

Lemmens

Schodl

Fuerst-Waltl

et al. 2023

Animals

View full text Add to dashboard Cite

show abstract

“…Therefore, validation B assessed how the model would perform when predicting weekly average DMI for new cows. Data dependencies between calibration and test sets, which were reported when animals from the same herds are included in both calibration and test data sets, may inflate prediction performance (e.g., Wang and Bovenhuis, 2019;Coelho Ribeiro et al, 2021). However, this is not always the case, as in study by Lahart et al (2019), where the average accuracy of DMI prediction in test sets using only MIRS and MIRS combined with MY, F%, P%, BW, stage of lactation and parity, using within herd and across-herd crossvalidations, were 0.69, 0.87, 0.55, and 0.80, respectively.…”

Section: Cross-validation To Assess Model Robustnessmentioning

confidence: 99%

Predicting dry matter intake in Canadian Holstein dairy cattle using milk mid-infrared reflectance spectroscopy and other commonly available predictors via artificial neural networks

Shadpour

Chud²,

Hailemariam³

et al. 2022

Journal of Dairy Science

View full text Add to dashboard Cite

Dry matter intake (DMI) is a fundamental component of the animal's feed efficiency, but measuring DMI of individual cows is expensive. Mid-infrared reflectance spectroscopy (MIRS) on milk samples could be an inexpensive alternative to predict DMI. The objectives of this study were (1) to assess if milk MIRS data could improve DMI predictions of Canadian Holstein cows using artificial neural networks (ANN); (2) to investigate the ability of different ANN architectures to predict unobserved DMI; and (3) to validate the robustness of developed prediction models. A total of 7,398 milk samples from 509 dairy cows distributed over Canada, Denmark, and the United States were analyzed. Data from Denmark and the United States were used to increase the training data size and variability to improve the generalization of the prediction models over the lactation. For each milk spectra record, the corresponding weekly average DMI (kg/d), test-day milk yield (MY, kg/d), fat yield (FY, g/d), and protein yield (PY, g/d), metabolic body weight (MBW), age at calving, year of calving, season of calving, days in milk, lactation number, country, and herd were available. The weekly average DMI was predicted with various ANN architectures using 7 predictor sets, which were created by different combinations MY, FY, PY, MBW, and MIRS data. All predictor sets also included age of calving and days in milk. In addition, the classification effects of season of calving, country, and lactation number were included in all models. The explored ANN architectures consisted of 3 training algorithms (Bayesian regularization, Levenberg-Marquardt, and scaled conjugate gradient), 2 types of activation functions (hyperbolic tangent and linear), and from 1 to 10 neurons in hidden layers). In addition, partial least squares regression was also applied to predict the DMI. Models were compared using cross-validation based on leaving out 10% of records (validation A) and leaving out 10% of cows (validation B). Superior fitting statistics of models comprising MIRS information compared with the models fitting milk, fat and protein yields suggest that other unknown milk components may help explain variation in weekly average DMI. For instance, using MY, FY, PY, and MBW as predictor variables produced a predictive accuracy (r) ranging from 0.510 to 0.652 across ANN models and validation sets. Using MIRS together with MY, FY, PY, and MBW as predictors resulted in improved fitting (r = 0.679-0.777). Including MIRS data improved the weekly average DMI prediction of Canadian Holstein cows, but it seems that MIRS predicts DMI mostly through its association with milk production traits and its utility to estimate a measure of feed efficiency that accounts for the level of production, such as residual feed intake, might be limited and needs further investigation. The better predictive ability of nonlinear ANN compared with linear ANN and partial least squares regression indicated possible nonlinear relationships between weekly average DMI and the predictor variables. In ...

show abstract

“…And, the significance test based on p value faces a crisis of duplication, that is, the irreproducibility of research results. Strict and systematic use of machine learning cross-validation technology [ 44 ] can provide great potential for realizing the reproducibility of psychological research. Technology-based machine learning can construct learning models from massive amounts of data, more accurately identify the underlying laws of the data, and have stronger generalization capabilities.…”

Section: Related Workmentioning

confidence: 99%

Research on Students’ Mental Health Based on Data Mining Algorithms

Luo

2021

Journal of Healthcare Engineering

View full text Add to dashboard Cite

With the diversification and rapid development of society, people’s living conditions, learning and friendship conditions, and employment conditions are facing increasing pressure, which greatly challenges people’s psychological endurance. Therefore, strengthening the mental health education of students has become an urgent need of society and a hot issue of common concern. In order to solve the problems of high misjudgment rate and low work efficiency in the current mental health intelligence evaluation process, a mental health intelligence evaluation system based on a joint optimization algorithm is proposed. The joint optimization algorithm consists of an improved decision tree algorithm and an improved ANN algorithm. First, analyze the current research status of mental health intelligence evaluation, and construct the framework of mental health intelligence evaluation system; then collect mental health intelligence evaluation data based on data mining, use joint learning algorithm to analyze and classify mental health intelligence evaluation data, and obtain mental health intelligence evaluation results. Finally, through specific simulation experiments, the feasibility and superiority of the mental health intelligent evaluation system are analyzed. The results show that the system in the article overcomes the shortcomings of the existing mental health intelligence evaluation system, improves the accuracy of mental health intelligence evaluation, and improves the efficiency of mental health intelligence evaluation. It has good system stability and can meet the actual current situation, which are requirements for mental health intelligence evaluation.

show abstract

Disentangling data dependency using cross-validation strategies to evaluate prediction quality of cattle grazing activities using machine learning algorithms and wearable sensor data

Cited by 10 publications

References 24 publications

The Combined Use of Automated Milking System and Sensor Data to Improve Detection of Mild Lameness in Dairy Cattle

The Combined Use of Automated Milking System and Sensor Data to Improve Detection of Mild Lameness in Dairy Cattle

Predicting dry matter intake in Canadian Holstein dairy cattle using milk mid-infrared reflectance spectroscopy and other commonly available predictors via artificial neural networks

Research on Students’ Mental Health Based on Data Mining Algorithms

Contact Info

Product

Resources

About