Next Article in Journal
Group B Streptococcus-Induced Macropinocytosis Contributes to Bacterial Invasion of Brain Endothelial Cells
Next Article in Special Issue
Ecological and Anthropogenic Spatial Gradients Shape Patterns of Dispersal of Foot-and-Mouth Disease Virus in Uganda
Previous Article in Journal
Antiviral Activity of Contemporary Contact Lens Care Solutions against Two Human Seasonal Coronavirus Strains
Previous Article in Special Issue
Predicting Bovine Respiratory Disease Risk in Feedlot Cattle in the First 45 Days Post Arrival
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Predictive Models for Weekly Cattle Mortality after Arrival at a Feeding Location Using Records, Weather, and Transport Data at Time of Purchase

by
Lauren Wisnieski
1,2,*,
David E. Amrine
2,3 and
David G. Renter
2,3
1
Center for Animal and Human Health in Appalachia, College of Veterinary Medicine, Lincoln Memorial University, Harrogate, TN 37752, USA
2
Center for Outcomes Research and Epidemiology, College of Veterinary Medicine, Kansas State University, Manhattan, KS 66502, USA
3
Beef Cattle Institute, Kansas State University, Manhattan, KS 66502, USA
*
Author to whom correspondence should be addressed.
Pathogens 2022, 11(4), 473; https://doi.org/10.3390/pathogens11040473
Submission received: 22 February 2022 / Revised: 12 April 2022 / Accepted: 14 April 2022 / Published: 15 April 2022

Abstract

:
Feedlot mortality negatively affects animal welfare and profitability. To the best of our knowledge, there are no publications on predictive models for weekly all-cause mortality in feedlot cattle. In this study, random forest models to predict weekly mortality for cattle purchase groups (n = 14,217 purchase groups; 860,545 animals) from arrival at the feeding location (Day 1) to Day 42 and cumulative mortality from Day 43 until slaughter were built using records, weather, and transport data available at the time of purchase. Models were evaluated by calculating the root mean squared error (RMSE) and accuracy (as defined as the percent of purchase groups that had predictions within 0.25% and 0.50% of actual mortality). The models had high accuracy (>90%), but the RMSE estimates were high (range = 1.0% to 4.1%). The best predictors were maximum temperature and purchase weight, although this varied by week. The models performed well among purchase groups with low weekly mortality but performed poorly in high mortality purchase groups. Although high mortality purchase groups were not accurately predicted utilizing the models in this study, the models may potentially have utility as a screening tool for very low mortality purchase groups after arrival. Future studies should consider building iterative models that utilize the strongest predictors identified in this study.

1. Introduction

From 1994 to 2011, feedlot mortality has remained steady at around 1% [1]. Feedlot mortality has negative effects, including decreased animal welfare and reduced profitability [2]. For each percentage increase in mortality, added costs due to lower average daily gain and feed conversion ratio are estimated to amount to 1 USD/head [3]. A potential way to lower beef cattle mortality is to make management changes that may improve animal health, including the adjustment of processing procedures, preconditioning cattle, and improving the monitoring of high-risk groups of cattle [4,5]. One tool that can be used to improve the identification of cattle at-risk of mortality is predictive modeling, which is a statistical modeling method that can be used to predict future events [6]. Previous studies focused on building predictive models for bovine respiratory disease (BRD) morbidity [7,8], the number of days on feed (DOF) to first treatment [9], and identifying cattle that did not finish the production cycle normally after BRD treatment [10]. However, to the best of our knowledge, no previous studies developed predictive models for all-cause mortality in beef cattle.
Ideally, cattle mortality can be predicted as early as possible so that interventions can be promptly implemented. Previously, we showed that weather parameters and demographic factors measured on the day of purchase are associated with cumulative BRD mortality in the first 60 days on feed [11]. In addition, Amrine et al. 2019 [8] built predictive models for the BRD risk category and included multiple predictors, including weather variables pre- and post-arrival at a feeding location, and found that models that included information from the sale barn had similar accuracy compared to models built with lot arrival data. However, it is unknown if purchase data can be used to predict weekly all-cause mortality. Accurately predicting which groups of cattle are at risk of high mortality as early as the day of purchase may have benefits over only using information available when cattle arrive at the feeding location. Predictions of mortality risk generated at purchase can guide transportation and processing procedures, allowing for earlier intervention in groups of cattle that are expected to have high mortality shortly after arrival at a feeding location. For example, antimicrobials can be better utilized to target groups of animals with high-risk animals after arrival at the feedlot [4].
The objective of this study was to build a series of random forest models to predict weekly mortality from arrival at a feeding location (Day 1) to 42 days on feed (DOF) (Day 42) and cumulative mortality from Day 43 to slaughter in purchase groups from one large commercial beef cattle feeding system based on records, weather, and transport data available at the time of purchase.

2. Results

2.1. Descriptive Statistics

Descriptive statistics for the full sample are presented in Table 1 and Table 2. The training dataset contained 9967 purchase groups and the test dataset contained 4250 purchase groups. Ninety-eight percent of deaths were attributed to BRD. Most of the purchase groups were all steers, contained all calves or both calves and yearlings (versus all yearlings), and had weaned animals. The majority of purchase groups were purchased in winter, were purchased from the Southern United States (US) or were from mixed sources, and were purchased from the auction or had a mix of auction/contracted animals.

2.2. Model Results

The optimal random forest mtry parameters (number of randomly selected predictors to choose from at each split in the trees) identified from the grid search were 2 (Weeks 1–4 and Week 6) and 3 (Week 5 and Day 43 to slaughter). The optimal number of trees (which are pooled to form a forest of which predictions are made) for the models were 1000, 1000, 1300, 1300, 1300, 1500, and 1000 for Weeks 1–6 and for Day 43 to slaughter, respectively. The mean and median DOF was 100.5 days and 82 days, respectively. The final models included 17 variables, as described in Table 3.
Overall weekly model results are presented in Table 4. The accuracy of the models was highest for Week 1 and decreased as weeks increased. Accuracy was very high for Week 1–Week 6 (>90%) but was much lower for Day 43 to slaughter (<70%). Overall, the most important predictor variables were maximum temperature and purchase weight.
Table 5 and Table 6 describe average model performance (RMSE and accuracy) when stratified by different purchase group characteristics. In general, the models performed worse in high mortality purchase groups (>2% mortality) and in purchase groups with high-risk characteristics. For example, the models performed worse in purchase groups with calves and mixed groups compared to purchase groups with only yearlings. Model accuracy (the ability of the model to predict mortality within 0.25% of actual mortality) was low for purchase groups with high weekly mortality (Table 6). Model performance, as measured by root mean squared error (RMSE), among high mortality purchase groups (>5%) was best in the Week 4 and Week 5 models and worst in the Week 1 model. However, even in the best performing weeks for high mortality purchase groups, the RMSE estimates were high (0.083 and 0.081 for Week 4 and Week 5, respectively). RMSE estimates can be interpreted on the scale of outcome (% mortality) by multiplying by 100. Therefore, an RMSE of 0.083 indicates that the predicted mortality deviated, on average, 8.3% from observed mortality. The models had high accuracy and low RMSE estimates for purchase groups with low weekly mortality (0% to ≤2%).
In sensitivity and specificity analyses to assess the model’s ability to accurately detect purchase groups with low mortality (0% to 0.25% or 0% to 0.50%), Weeks 3–5 had the best balance of sensitivity and specificity, although specificity was low (Table 7). The Week 1 model had low specificity because the models performed poorly at identifying the few purchase groups that had higher mortality. The Day 43 to Slaughter model had 100% specificity because there were no purchase groups with low mortality, which the models accurately predicted.

3. Discussion

Predicting feedlot mortality risk at the time of purchase could help determine when changes in processing or management are needed for certain purchase groups. For example, purchase groups that are high-risk may be sent to a backgrounding facility or may have modified processing procedures that can reduce stress [5]. Intervening right after purchase is ideal since it is one of the earliest points for intervention. Likewise, identifying purchase groups that are low-risk could be beneficial for personnel management as-well-as judicial antimicrobial administration. Therefore, our objective was to build all-cause mortality predictive models using data available at the time of purchase.
Random forest models for weekly mortality of beef cattle after arrival at a feeding location were built using cattle records, weather information, and transport data available at the time of purchase. Even though the models had high accuracy (>90%), the RSMEs were large when considering the scale of weekly percent mortality at a feedlot. In order to illustrate this, mortality in the first 30 days of the feeding period averaged 0.40% and 0.35% in heifers and steers, respectively, in a study by Vogel et al. 2015 [12]. Therefore, an RMSE of approximately 1%, as reported in some of our models, would be a large margin around the actual percent mortality. The models performed more poorly among purchase groups with high weekly mortality. There are several possible reasons for this finding. One major reason is that predictions generated from predictive models tend to favor the majority class, and predictions are shrunk towards the sample mean [13]. Since the average overall weekly mortality was close to 0%, the predictions were shrunk towards 0%, and the models tended to under-predict for high-mortality purchase groups. In general, the models also performed worse for purchase groups with characteristics associated with a higher risk of morbidity and mortality. For example, purchase groups with known risk factors for mortality such as calves (versus yearlings), unweaned calves, lower average arrival weight, calves purchased from auctions, and calves purchased in the fall had less accurate predictions compared with their lower-risk counterparts [11,14,15,16]. This is also likely due to shrinkage towards the sample mean and consistent with under-prediction of mortality risk.
Predictions were less accurate as the weeks progressed, likely because the predictions were farther and farther away from the data used to generate them, and there were intervening events and treatments. For example, transport, processing, co-mingling with other purchase groups, weather after arrival, and treatments all affected mortality risk and were not included in the prediction models [5,11,14]. Cattle processing is of concern because the perceived risk on arrival can influence how cattle are processed. High-risk cattle were more likely to receive metaphylaxis and potentially other preventative measures, which affects mortality risk.
In the groups that had 0% mortality, random forest models were highly accurate (100%) with low RMSE values (range of 0.001 to 0.003) for Week 1 through Week 6. Although models were less accurate at identifying groups with greater than 0% mortality, there could be significant utility in models that accurately identify groups at low risk of mortality (0%). For example, fewer personnel can be allocated to monitoring low-risk groups. However, this will lead to an increased number of false negatives and decreased sensitivity (i.e., high-risk groups identified as low-risk). The models with the best balance of sensitivity and specificity for detecting low mortality purchase groups (0–0.50% mortality) were models for Weeks 3–5.
Although the models performed relatively poorly on high-mortality groups, the results still have utility in that models performed well on low mortality purchase groups (0% to 0.25% mortality or 0% to 0.50% mortality), and they demonstrated the relative predictive ability of data at different time points. The strongest predictors can inform future studies aimed to build predictive models for mortality. It is interesting to note that the strongest predictors varied by week. In earlier weeks, the maximum temperature was a strong predictor, but in later weeks purchase weight became more important. Higher temperatures contribute to heat stress, which increases the risk of morbidity and mortality [11,17]. The predictive ability of the maximum temperature on the day of purchase may decrease over time since the weather conditions have likely changed after arrival at the feeding location. Purchase weight, which was consistently a strong predictor but increased in importance as weeks progressed, was also consistently associated with morbidity and mortality in multiple studies [7,11,14,16]. Age (calf and mixed purchase groups versus yearling only purchase groups) was a strong predictor in later weeks. Age is a rough proxy for body weight, but calves also may have a higher risk of morbidity and mortality after arrival at a feeding location compared to yearlings because they have not been exposed to as many pathogens over time, making them more susceptible [18,19]. The region was a strong predictor in Weeks 3–5. Studies found a significant association between the region of origin and morbidity and mortality in beef cattle [15,16,20]. This could be due to increased shipping distances from farther regions, which may increase transportation stress [21]. Of note, in the weeks with the highest sensitivity and specificity for detecting low mortality purchase groups (Weeks 3–5), the strongest predictors were purchase weight, number of cattle in the purchase group, region, and weaned status. Future studies should incorporate these predictors in models with consideration of their varying predictive ability over time.
To the best of our knowledge, the present study is the first to attempt to build predictive models for all-cause mortality. Other studies that have built models for predicting BRD morbidity and the probability of not finishing the production cycle also had difficulty producing accurate predictions [9,22] or had very inconsistent accuracy based on factors such as DOF and arrival weight [7]. However, Kayser et al. 2019 [23] built models with high predictive accuracy for BRD incidence using feeding behaviors, so it is possible that feeding behaviors could be a useful predictor for mortality since BRD is a major cause of death in feedlot cattle [24]. Several potential predictors for BRD were discussed in a narrative review by Wisnieski et al. 2021b [25]. The review highlighted complete blood counts, acute phase protein concentrations, and data from precision livestock farming technologies as predictors that should be evaluated further for their predictive ability [25]. In addition, predictions also can be improved by building models that include more information after arrival at a feeding location and update daily or weekly with new estimates. Babcock et al. 2013 [7] found that including daily morbidity data after arrival improved morbidity predictions for cohorts of cattle that were determined to be high-risk at arrival. Similar to our findings, the authors found that the models generally performed better for low-risk cohorts.
There are several limitations of this study. One limitation is that purchase-group level data were used throughout the analysis, but purchase groups were combined into different lots at the feeding location. However, at the time of purchase, it was unknown which purchase groups would be combined into lots, so the analysis reflected real-life management practices. This could have lowered the predictive ability of the models because lot-level factors, such as the degree of co-mingling of purchase groups into lots, can affect disease risk [26]. In addition, only one feeding operation was utilized for the analysis. The results cannot be generalized outside of the study population without further validation. However, there were several feeding locations in different states. The results of this study can be used to guide future predictive models that incorporate multiple feeding operations.
In summary, records, weather, and transport data available at the time of purchase produced predictions for weekly mortality among purchase groups with wide error margins. The models performed poorly among high-risk purchase groups, underestimating their mortality risk, which indicates that the models would not accurately identify high-risk purchase groups for additional health interventions. However, the models performed well for low-mortality purchase groups; therefore, the models could potentially serve as a screening tool to identify low mortality purchase groups. Model accuracy decreased as weeks progressed, however, the sensitivity and specificity of the models to detect low mortality purchase groups were highest in Weeks 3–5. Future studies should consider utilizing some of the strongest predictors that were identified in this study and build daily or weekly risk models that can incorporate more information as days on feed increase. In addition, future studies can investigate the use of different modeling strategies, such as building separate models for low versus high-risk purchase groups or building predictive models to identify categories of mortality risk (quartiles or quintiles of mortality).

4. Materials and Methods

Animal use approval was not needed because the data were ascertained through an existing dataset. All data management and statistical analyses were performed in Stata version 14.2 (StataCorp, College Station, TX, USA) and R version 3.6.0 (R Foundation for Statistical Computing, Vienna, Austria).

4.1. Data Sources

Information on demographics, purchase and backgrounding locations, and health outcomes were obtained through an existing operational data system from a large commercial feeding operation located in the midwestern United States. The unit of analysis was the purchase group, which was defined as animals that are purchased, grouped, and then transported together to the backgrounding or feedlot location. Prior to data cleansing, the dataset included 14,631 purchase groups. The dataset included all purchase groups from November 2015 to January 2019. More details about the feeding system are described in Wisnieski et al. 2021a [11].
Weather information only on the purchase day was used in the models. Weather information on the purchase day was downloaded using the DarkSky application program interface (API) with the “dark sky” package in R [27,28]. Daily weather data included wind bearing, apparent temperature minimum and maximum, temperature minimum and maximum, precipitation accumulation, precipitation type, ultraviolet (UV) index, dew point, humidity, and wind speed. The latitude and longitude of the purchase location and the backgrounding or feedlot location were downloaded for each purchase group through the MapQuest API with the “mqgeocode” command in Stata [29,30].

4.2. Data Partitioning

The full dataset was partitioned into test (30%) and training (70%) datasets and balanced based on quartiles of the outcome (% cumulative mortality for the entire feeding period) using the “caret” package in R [31].

4.3. Data Processing

Pre-processing and exploratory data analysis were completed using the training dataset. Pre-processing included visually assessing variables using histograms and box plots for unusual observations, checking for missingness, bivariable analyses with the outcome, and multicollinearity assessment [13]. The full list of variables considered for inclusion is presented in Supplementary Table S1. Variables with a large amount of missing data or were highly correlated (r > 0.90) with another variable were dropped. In total, 16 variables were dropped. Levels of categorical variables with sparse categories were combined. There were very few purchase days with precipitation; therefore, precipitation was treated as a binary categorical variable (0 = there was no precipitation, 1 = there was precipitation). Observations that were missing any of the final variables listed in Table 3 were dropped. The test data were processed the same as the training data. After processing, the final dataset included 14,217 purchase groups, consisting of 860,545 animals.

4.4. Random Forests

Random forests are a tree-based ensemble method that constructs a multitude of decision trees. The forest’s predictions are calculated by averaging the individual trees’ predictions. The ensemble nature of random forest models makes understanding the relationships between predictors and outcomes impossible; however, random forest models are specifically designed to optimize predictions. Random forests can be used for either classification of categorical data or for regression [32]. For this analysis, regression was used since the outcome was defined as percent mortality. The training dataset was used to build 7 random forest models: 6 models for weekly mortality from arrival at a feeding location (Day 1) to Day 42 and one model for cumulative mortality from Day 43 to slaughter, using the “caret” package in R. For each model, mortality was defined as the number that died during that time period/total number in purchase group at the time of purchase. One tuning parameter for random forests is mtry, which is the number of randomly selected variables to choose from at each split. For each model, five values of mtry between 2 and 17 were tested with approximately equal increments; then, the optimum mtry value was narrowed down to a more precise value using the “bestTune” model output [32]. For each random forest, at least 1000 trees were used. The number of trees increased from 1000 until the performance of the model leveled off [32]. RMSE values, which measure (on average) how far the residuals are from 0, were calculated for each model through five-fold cross-validation (CV) to evaluate model performance [32]. Finally, the 7 random forest models were tested for predictive ability on the test dataset. Accuracy was calculated as the percent of purchase groups that were correctly classified within 0.25% and 0.50% percent of actual weekly mortality. RMSE values and accuracy within 0.25% were reported by purchase group characteristics to assess if the models performed better in certain sub-groups of cattle. Lower RMSE values indicate that predictions have smaller deviations from the observed values. RMSE can be interpreted on the scale of mortality by multiplying RMSE values by 100. Variance importance was also reported in order to determine which variables were the strongest predictors. Variables with high importance are used more by the model to make predictions compared to variables with low importance.
In order to further examine the ability of the models to detect low mortality purchase groups, sensitivity and specificity were calculated to evaluate the ability of the models to detect purchase groups with 0% to 0.25% and 0% to 0.50% weekly mortality. Sensitivity was calculated as the percent of low mortality purchase groups that were accurately predicted to have low mortality. Specificity was calculated as the percent of purchase groups that did not have low mortality that was accurately identified as not having low mortality.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/pathogens11040473/s1, Table S1: Description of predictors considered in random forest models predicting all-cause mortality among purchase groups of feedlot cattle.

Author Contributions

Conceptualization, L.W., D.E.A. and D.G.R.; Methodology, L.W., D.E.A. and D.G.R.; Formal Analysis, L.W.; Resources, D.G.R.; Data Curation, D.G.R.; Writing—Original Draft Preparation, L.W.; Writing—Review and Editing, L.W., D.E.A. and D.G.R., Supervision, D.G.R.; Funding Acquisition, D.G.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Agriculture and Food Research Initiative Competitive Grant no. 2015-67015-23079 from the USDA National Institute of Food and Agriculture and the Agriculture Hatch Act of 1887 (Multistate Research Fund No. 1018845).

Institutional Review Board Statement

Ethical review and approval were waived for this study due to the use of existing data.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data are not publicly available to ensure confidentiality. However, any data summaries can be made available on request.

Acknowledgments

The authors would like to thank the participating cattle feeding operation.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.

References

  1. National Animal Health Monitoring System (NAHMS). Part III: Trends in Health and Management Practices on U.S. Feedlots, 1994–2011; United States Department of Agriculture-Animal and Plant Health Inspection Service, Veterinary Sciences, Centers for Epidemiology and Animal Health: Fort Collins, CO, USA, 2011.
  2. Brooks, K.R.; Raper, K.C.; Ward, C.E.; Holland, B.P.; Krehbiel, C.R.; Step, D.L. Economic effects of bovine respiratory disease on feedlot cattle during backgrounding and finishing phases. Prof. Anim. Sci. 2011, 27, 195–203. [Google Scholar] [CrossRef]
  3. Irsik, M.; Langemeier, M.; Schroeder, T.; Spire, M.; Order, J.D. Estimating the effects of animal health on the performance of feedlot cattle. Bovine Pract. 2006, 40, 65–74. [Google Scholar]
  4. Taylor, J.D.; Lehenbauer, T.W.; Step, D.L.; Confer, A.W. The epidemiology of bovine respiratory disease: What is the evidence for preventive measures? Can. Vet. J. 2010, 51, 1351–1359. [Google Scholar] [PubMed]
  5. Noffsinger, T.; Lukasiewicz, K.; Hyder, L. Feedlot processing and arrival cattle management. Vet. Clin. Food Anim. 2015, 31, 323–340. [Google Scholar] [CrossRef] [PubMed]
  6. Shmueli, G. To explain or to predict? Stat. Sci. 2010, 25, 289–310. [Google Scholar] [CrossRef]
  7. Babcock, A.H.; White, B.J.; Renter, D.G.; Dubnicka, S.R.; Scott, H.M. Predicting cumulative risk of bovine respiratory disease complex (BRDC) using feedlot arrival data and daily morbidity and mortality counts. Can. J. Vet. Res. 2013, 77, 33–44. [Google Scholar]
  8. Amrine, D.E.; McLellan, J.G.; White, B.J.; Larson, R.L.; Renter, D.G. Evaluation of three classification models to predict risk class of cattle cohorts developing bovine respiratory disease within the first 14 days on feed using on-arrival and/or pre-arrival information. Comput. Electron. Agric. 2019, 156, 436–439. [Google Scholar] [CrossRef]
  9. Burciaga-Robles, L.O.; Holland, B.P.; Step, D.L.; Krehbiel, C.R.; McMillen, G.L.; Richards, C.J.; Sims, L.E.; Jeffers, J.D.; Namjou, K.; McCann, P.J. Evaluation of breath biomarkers and serum haptoglobin concentration for diagnosis of bovine respiratory disease in heifers newly arrived at feedlot. Am. J. Vet. Res. 2009, 270, 1291–1298. [Google Scholar] [CrossRef]
  10. Amrine, D.E.; White, B.J.; Larson, R.L. Comparison of classification algorithms to predict outcomes of feedlot cattle identified and treated for bovine respiratory disease. Comput. Electron. Agric. 2014, 105, 9–19. [Google Scholar] [CrossRef] [Green Version]
  11. Wisnieski, L.C.; Amrine, D.E.; Cernicchiaro, N.; Sanderson, M.W.; Renter, D.G. Weather conditions associated with death attributed to bovine respiratory disease complex in high risk auction mark-sourced male beef calves. Am. J. Vet. Res. 2021, 82, 644–652. [Google Scholar] [CrossRef]
  12. Vogel, G.J.; Bokenkroger, C.D.; Rutten-Ramos, S.C.; Bargen, J.L. A retrospective evaluation of animal mortality in US feedlots: Rate, timing, and cause of death. Bov. Pract. 2015, 49, 113–123. [Google Scholar]
  13. Kuhn, M.; Johnson, K. Feature Engineering and Selection: A Practical Approach for Predictive Models; CRC Press: Boca Raton, FL, USA, 2019; ISBN 9781138079229. [Google Scholar]
  14. Sanderson, M.W.; Dargatz, D.A.; Wagner, B.A. Risk factors for initial respiratory disease in United States’ feedlots based on producer-collected daily morbidity counts. Can. Vet. J. 2008, 49, 373–378. [Google Scholar] [PubMed]
  15. Cernicchiaro, N.; White, B.J.; Renter, D.G.; Babcock, A.H.; Kelly, L.; Slattery, R. Associations between the distance traveled from sale barns to commercial feedlots in the United States and overall performance, risk of respiratory disease, and cumulative mortality n feeder cattle during 1997 to 2009. J. Anim. Sci. 2012, 90, 1929–1939. [Google Scholar] [CrossRef] [PubMed]
  16. Hay, K.E.; Morton, J.M.; Mahony, T.J.; Clements, A.C.A.; Barnes, T.S. Associations between animal characteristic and environmental risk factors and bovine respiratory disease in Australian feedlot cattle. Prev. Vet. Med. 2016, 125, 66–74. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  17. Theurer, M.E.; White, B.J.; Larson, R.L.; Holstein, K.K.; Amrine, D.E. Relationship between rectal temperature at first treatment for bovine respiratory disease complex in feedlot calves and the probability of not finishing the production cycle. J. Am. Vet. Med. Assoc. 2014, 245, 1279–1285. [Google Scholar] [CrossRef]
  18. Kayser, W.C.; Carstens, G.E.; Jackson, K.S.; Pinchak, W.E.; Banerjee, A.; Fu, Y. Evaluation of statistical process control procedures to monitor feeding behavior patterns and detect onset of bovine respiratory disease in growing bulls. J. Anim. Sci. 2009, 77, 1158. [Google Scholar] [CrossRef]
  19. White, B.J.; Larson, B.L. Impact of bovine respiratory disease in U.S. beef cattle. Anim. Health Res. Rev. 2020, 21, 132–134. [Google Scholar] [CrossRef]
  20. Wisnieski, L.; Amrine, D.E.; Renter, D.G. Predictive modeling of bovine respiratory disease outcomes in feedlot cattle: A narrative review. Livestock Sci. 2021, 241, e104666. [Google Scholar] [CrossRef]
  21. Brown-Brandl, T.M. Understanding heat stress in beef cattle. R. Bras. Zootec. 2018, 47, e20160414. [Google Scholar] [CrossRef] [Green Version]
  22. Cortese, V.S. Neonatal Immunology. Vet. Clin. Food. Anim. 2009, 25, 221–227. [Google Scholar] [CrossRef]
  23. Smith, D.R. Risk factors for bovine respiratory disease in beef cattle. Anim. Health Res. Rev. 2020, 21, 149–152. [Google Scholar] [CrossRef] [PubMed]
  24. Rutton-Ramos, S.C.; Simjee, S.; Calvo-Lorenzo, M.S.; Bargen, J.L. Population-level analysis of antibiotic use and death rates in beef feedlots over ten years in three cattle feeding regions of the United States. J. Am. Vet. Med. Assoc. 2021, 259, 1344–1350. [Google Scholar] [CrossRef] [PubMed]
  25. Damtew, A.; Erega, Y.; Ebrahim, H.; Tsegaye, S.; Msigie, D. The effect of long-distance transportation stress on cattle: A review. Biomed. J. Sci. Technol. Res. 2018, 2, 1–5. [Google Scholar]
  26. Step, D.L.; Krehbiel, C.R.; DePra, H.A.; Cranston, J.J.; Fulton, R.W.; Kirkpatrick, J.G.; Gill, D.R.; Payton, M.E.; Monteglono, M.A.; Confer, A.W. Effects of commingling beef calves from different sources and weaning protocols during a forty-two-day receiving period on performance and bovine respiratory disease. J. Anim. Sci. 2008, 86, 3146–3158. [Google Scholar] [CrossRef]
  27. Rudis, B. Darksky: Tools to Work with the ‘Dark Sky’ ‘API.’ R Package Version 1.3.0. 2017. Available online: https://CRAN.R-project.org/package=darksky (accessed on 1 August 2019).
  28. Dark Sky. Dark Sky API. The Dark Sky Company, LLC. Available online: https://darksky.net/dev (accessed on 1 August 2019).
  29. MapQuest. Geocoding API. Available online: https://developer.mapquest.com/documentation/geocoding-api (accessed on 1 August 2019).
  30. Voorheis, J. mqtime: A Stata Tool for calculating travel time and distance using MapQuest Webservices. Stata J. 2015, 15, 845–853. [Google Scholar] [CrossRef] [Green Version]
  31. Kuhn, M. Building Predictive Models in R Using the caret Package. J. Stat. Soft. 2008, 28, 1–26. [Google Scholar] [CrossRef] [Green Version]
  32. Kuhn, M.; Johnson, K. Applied Predictive Modeling; Springer Science and Business: New York, NY, USA, 2016; ISBN 978-1461468486. [Google Scholar]
Table 1. Descriptive statistics of categorical variables in purchase-group level data from a large commercial beef feeding operation in the Midwestern United States (n = 14,217 purchase groups).
Table 1. Descriptive statistics of categorical variables in purchase-group level data from a large commercial beef feeding operation in the Midwestern United States (n = 14,217 purchase groups).
VariableCategoryn(%)
SexFemale285 (2.0)
Male13,932 (98.0)
AgeCalf and mixed11,173 (78.6)
Yearling3044 (21.4)
Weaned statusUnweaned and mixed2082 (14.6)
Weaned12,135 (85.4)
Purchase seasonSpring4476 (31.5)
Summer2091 (14.7)
Fall2973 (20.9)
Winter4677 (32.9)
Geographic originCanada or North US227 (1.6)
South US or mixed13,990 (98.4)
SourceAuction and mixed13,588 (95.6)
Contracted629 (4.4)
Purchase year2015992 (7.0)
20164441 (31.2)
20175273 (37.1)
20183508 (24.7)
20193 (0.02)
Day of week purchasedMonday1777 (12.5)
Tuesday2936 (20.7)
Wednesday3486 (24.5)
Thursday3176 (22.3)
Friday1962 (13.8)
Saturday/Sunday880 (6.2)
Week of month purchasedDays 1–73376 (23.8)
Days 8–143624 (25.5)
Days 15–213503 (24.6)
Day 22 until EOM3714 (26.1)
PrecipitationYes573 (4.0)
No13,644 (96.0)
Mortality0%4131 (29.1)
0% to ≤2%3007 (21.2)
2% to <5%3532 (24.8)
≥5%3547 (25.0)
Table 2. Descriptive statistics of numerical variables in purchase-group level data from a large commercial beef feeding operation in the Midwestern United States (n = 14,217 purchase groups).
Table 2. Descriptive statistics of numerical variables in purchase-group level data from a large commercial beef feeding operation in the Midwestern United States (n = 14,217 purchase groups).
VariableMean (SD)
Average purchase weight (kg)307.8 (57.2)
Head (Number of cattle per purchase group)60.6 (49.2)
Shipping distance (km)510.5 (302.6)
Relative humidity0.69 (0.14)
Windspeed (km/h)10.6 (6.1)
Maximum temperature (Celsius)15.1 (12.5)
Table 3. Description of predictors used in random forest models predicting all-cause mortality among purchase groups of feedlot cattle.
Table 3. Description of predictors used in random forest models predicting all-cause mortality among purchase groups of feedlot cattle.
PredictorDescriptionCoding 1
AgeAge of purchase group1 = Calf and mixed, 2 = Yearling
Weaned statusWeaning status of purchase group1 = Mixed and unweaned, 2 = Weaned
SexSex of purchase group1 = Heifer, 2 = Steer
OriginGeographic region where purchased1 = Canada or north US, 2 = South or mixed
Average purchase weightAverage purchase weight of purchase groupContinuous
MonthMonth purchased1 = January, 2 = February, … 12 = December
Day of weekDay of week purchased1 = Monday, 2 = Tuesday, … 6 = Saturday/Sunday
Week of monthWeek of month purchased1 = Days 1–7, 2 = Days 8–14, 3 = 15–21, 4 = Day 22 until end of month
SeasonSeason when purchased1 = Spring (March, April, May), 2 = Summer (June, July, August), 3 = Fall (September, October, November), 4 = Winter (December, January, February)
StateState where purchased, states with small sample size grouped together based on location and form regions1 = CO, 2 = FL, GA, AL, MS, SC, 3 = ID, NV, OR, UT, CA, WY, MT, 4 = IL, OH, WI, 5 = IA, 6= KS, 7 = KY, 8 = MN, 9 = MS, AR, 10 = NE, 11 = ND, 12 = OK, TX, 13 = SD, 14 = TN, 15 = N/A (Canada)
SourcePurchase source1 = Auction and mixed, 2 = contracted
Shipping distanceDistance from purchase location to backgrounding/feedlot locationContinuous
HeadNumber of cattle in purchase groupContinuous
Relative humidityHumidity on purchase dayContinuous
WindspeedWindspeed on purchase dayContinuous
Maximum temperatureMaximum temperature on purchase dayContinuous
PrecipitationPrecipitation on purchase day0 = There was no precipitation, 1 = There was precipitation
1 Variables inputted in the final models in the format presented in this table.
Table 4. Description of root mean squared error (RMSE), accuracy, and variable importance of random forest models predicting all-cause mortality among purchase groups of feedlot cattle in weeks after arrival at a feeding location (n = 4250 purchase groups).
Table 4. Description of root mean squared error (RMSE), accuracy, and variable importance of random forest models predicting all-cause mortality among purchase groups of feedlot cattle in weeks after arrival at a feeding location (n = 4250 purchase groups).
ModelRMSE 1Accuracy within 0.25%Accuracy within 0.5%Most Important Variable2nd Most Important Variable3rd Most Important Variable
Week 10.00996.16%96.38%Max temperatureWindspeedShipping distance
Week 20.01193.86%94.26%Max temperaturePurchase weightSeason (Winter)
Week 30.01392.66%93.06%Purchase weightRegion (Southern US)Region (NW US)
Week 40.0192.00%92.56%Number of cattlePurchase weightWindspeed
Week 50.00991.46%92.21%Weaned statusPurchase weightRegion (Midwestern US)
Week 60.0191.76%92.14%Purchase weightAgeMonth (September)
Day 43 to slaughter0.04163.51%66.96%Purchase weightMax temperatureAge
1 RMSE reported in decimals. To convert to % mortality, multiply by 100. Max = Maximum; NW = Northwestern.
Table 5. Mean RMSE 1 of random forest models predicting all-cause mortality among purchase groups of feedlot cattle in weeks after arrival at a feeding location in test dataset stratified by purchase group characteristics (n = 4250 purchase groups).
Table 5. Mean RMSE 1 of random forest models predicting all-cause mortality among purchase groups of feedlot cattle in weeks after arrival at a feeding location in test dataset stratified by purchase group characteristics (n = 4250 purchase groups).
VariableCategoryWeek 1Week 2Week 3Week 4Week 5Week 6Day 43 to SlaughterAverage
SexFemale (n = 82)0.0270.0300.0090.0070.0070.0050.0410.018
Male (n = 4, 168)0.0080.0100.0130.0100.0090.0100.0410.014
AgeCalf and mixed (n = 3338)0.0100.0110.0130.0100.0100.0110.0440.016
Yearling (n = 912)0.0070.0080.0130.0060.0080.0070.0270.011
Weaned statusUnweaned 2 (n = 627)0.0130.0180.0230.0180.0150.0190.0680.025
Weaned (n = 3623)0.0080.0090.0100.0070.0080.0080.0350.012
Purchase weight 3≤255.8 (n = 1067)0.0130.0150.0150.0130.0100.0090.0500.018
255.8 to ≤302.1 (n = 1063)0.0040.0100.0110.0080.0080.0130.0430.014
302.1 to ≤346.5 (n = 1061)0.0080.0080.0120.0100.0100.0100.0400.014
>346.5 (n = 1059)0.0090.0090.0130.0060.0090.0070.0290.012
Purchase seasonSpring (n = 1299)0.1000.0130.0120.0090.0070.0060.0390.027
Summer (n = 608)0.0070.0040.0090.0090.0100.0080.0320.011
Fall (n = 909)0.0120.0130.0150.0120.0130.0160.0440.018
Winter (n = 1434)0.0060.0090.0130.0090.0080.0080.0550.015
Geographic originCanada or North US (n = 58)0.0020.0020.0060.0030.0050.0030.0270.007
South US or mixed (n = 4192)0.0090.0110.0130.0100.0090.0100.0410.015
SourceAuction and mixed (n = 4061)0.0090.0110.0130.010.0100.0100.0420.015
Contracted (n = 189)0.0020.0020.0040.0050.0040.0030.0160.005
Head≤27 (n = 1078)0.0150.0170.0210.0140.0150.0170.0700.024
27 to ≤53 (n = 1058)0.0070.0110.0100.0090.0080.0080.0300.012
53 to ≤77 (n = 1058)0.0040.0050.0070.0090.0050.0050.0230.008
>77 (n = 1056)0.0040.0040.0070.0050.0040.0050.0210.007
Purchase year 42015 (n = 318)0.0140.0110.0140.0140.0160.0140.0540.020
2016 (n = 1369)0.0090.0110.0130.0110.0100.0120.0470.016
2017 (n = 1030)0.0100.0110.0110.0070.0080.0080.0340.013
2018 (n = 1030)0.0040.0100.0140.0090.0080.0080.0370.013
Weekly mortality0%0.0010.0030.0030.0030.0030.0020.0260.006
0% to ≤2%0.0120.0110.0110.0100.0100.0110.0130.011
2% to <5%0.0310.0270.0300.0280.0270.0290.0120.026
≥5%0.1220.1090.1190.0830.0810.0980.0860.100
1 RMSE reported in decimals. To convert to % mortality, multiply by 100. 2 Unweaned and mixed purchase groups. 3 Average purchase weight (kg). 4 The year 2019 was omitted from table due to small sample size (n = 2 purchase groups).
Table 6. Accuracy within 0.25% of weekly mortality from random forest models predicting all-cause mortality among purchase groups of feedlot cattle in weeks after arrival at a feeding location in a test dataset stratified by purchase group characteristics (n = 4250 purchase groups).
Table 6. Accuracy within 0.25% of weekly mortality from random forest models predicting all-cause mortality among purchase groups of feedlot cattle in weeks after arrival at a feeding location in a test dataset stratified by purchase group characteristics (n = 4250 purchase groups).
VariableCategoryWeek 1Week 2Week 3Week 4Week 5Week 6Day 43 to SlaughterAverage
SexFemale (n = 82)92.7%89.0%92.7%89.0%96.3%93.9%78.1%90.2%
Male (n = 4168)96.2%94.0%92.7%92.1%91.4%91.7%63.2%88.8%
AgeCalf and mixed (n = 3338)96.3%93.8%92.0%91.3%90.5%90.6%61.9%88.1%
Yearling (n = 912)95.6%94.0%95.1%94.7%94.9%95.9%69.5%91.4%
Weaned statusUnweaned 1 (n = 627)94.3%91.2%86.8%86.3%83.9%85.5%57.3%83.6%
Weaned (n = 3623)96.5%94.3%93.7%93.0%92.8%92.9%64.5%89.7%
Purchase weight 2≤255.8 (n = 1067)95.2%93.1%90.1%89.5%88.6%89.6%56.8%86.1%
255.8 to ≤302.1 (n = 1063)96.6%93.7%92.9%92.6%91.1%91.1%61.4%88.5%
302.1 to ≤346.5 (n = 1061)96.9%94.2%92.7%91.0%91.2%90.8%65.5%88.9%
>346.5 (n = 1059)95.9%94.5%95.0%95.0%95.0%95.7%70.4%91.6%
Purchase seasonSpring (n = 1299)96.9%95.2%94.7%94.8%93.8%93.6%62.1%90.2%
Summer (n = 608)96.9%95.6%92.3%91.1%87.2%90.5%63.5%88.2%
Fall (n = 909)94.7%91.0%89.6%88.7%88.9%87.8%65.6%86.6%
Winter (n = 1434)96.1%93.7%93.0%91.9%92.8%93.2%63.5%89.2%
Geographic originCanada or North US (n = 58)98.3%96.6%89.7%93.1%93.1%96.6%70.7%91.2%
South US or mixed (n = 4192)96.1%93.8%92.7%92.0%91.4%91.7%63.4%88.7%
SourceAuction and mixed (n = 4061)96.2%93.8%92.5%91.9%91.3%91.7%62.9%88.6%
Contracted (n = 189)95.2%95.8%96.8%94.7%94.7%94.2%77.3%92.7%
Head 3≤27 (n = 1078)98.3%97.3%96.0%96.9%95.7%96.3%65.8%92.3%
27 to ≤53 (n = 1058)96.2%94.8%93.2%92.2%91.9%92.2%61.0%88.8%
53 to ≤77 (n = 1058)96.0%92.6%92.1%91.3%91.5%91.9%64.5%88.6%
>77 (n = 1056)94.0%90.6%89.3%87.5%87.7%86.7%62.8%85.5%
Purchase year 42015 (n = 318)94.0%90.6%89.9%85.9%86.2%87.4%60.1%84.9%
2016 (n = 1369)95.8%93.4%92.6%91.8%91.8%92.3%68.0%89.4%
2017 (n = 1030)96.9%94.4%92.5%93.1%91.8%91.0%64.4%89.2%
2018 (n = 1030)96.2%94.8%93.9%92.6%92.0%93.6%57.4%88.6%
Weekly mortality0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%100.0%
0% to ≤2%2.9%0.6%7.4%5.2%5.8%2.7%95.0%17.1%
2% to <5%0.0%0.0%0.0%0.0%0.0%0.0%38.8%5.5%
≥5%0.0%0.0%0.0%0.0%0.0%0.0%0.1%0.0%
1 Unweaned and mixed purchase groups. 2 Average purchase weight (kg). 3 Head is the number of cattle in the purchase group at time of purchase. 4 The year 2019 was omitted from table due to small sample size (n = 2 purchase groups).
Table 7. Sensitivity and specificity 1 of detecting low mortality (0% to 0.25% mortality or 0% to 0.50% mortality) purchase groups of feedlot cattle in weeks after arrival at a feeding location (n = 4250 purchase groups) 2.
Table 7. Sensitivity and specificity 1 of detecting low mortality (0% to 0.25% mortality or 0% to 0.50% mortality) purchase groups of feedlot cattle in weeks after arrival at a feeding location (n = 4250 purchase groups) 2.
Ability to Detect Purchase Groups with 0–0.25% Mortality
Week 1Week 2Week 3Week 4Week 5Week 6Day 43 to slaughter
Sensitivity98.3%83.2%69.1%69.6%71.6%82.8%0.0%
Specificity9.8%33.6%51.7%51.2%56.8%30.1%100.0%
Ability to Detect Purchase Groups with 0–0.50% Mortality
Week 1Week 2Week 3Week 4Week 5Week 6Day 43 to slaughter
Sensitivity98.3%83.2%69.2%69.7%71.7%82.8%0.0%
Specificity10.1%34.5%53.6%53.0%58.7%30.6%100.0%
1 Sensitivity was calculated as the percent of low mortality (0% to 0.25% mortality or 0% to 0.50% mortality) in purchase groups that were accurately predicted to have low mortality. Specificity was calculated as the percent of purchase groups that did not have low mortality that was accurately identified as not having low mortality. 2 Additional details of the models are presented in Table 4, Table 5 and Table 6.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wisnieski, L.; Amrine, D.E.; Renter, D.G. Predictive Models for Weekly Cattle Mortality after Arrival at a Feeding Location Using Records, Weather, and Transport Data at Time of Purchase. Pathogens 2022, 11, 473. https://doi.org/10.3390/pathogens11040473

AMA Style

Wisnieski L, Amrine DE, Renter DG. Predictive Models for Weekly Cattle Mortality after Arrival at a Feeding Location Using Records, Weather, and Transport Data at Time of Purchase. Pathogens. 2022; 11(4):473. https://doi.org/10.3390/pathogens11040473

Chicago/Turabian Style

Wisnieski, Lauren, David E. Amrine, and David G. Renter. 2022. "Predictive Models for Weekly Cattle Mortality after Arrival at a Feeding Location Using Records, Weather, and Transport Data at Time of Purchase" Pathogens 11, no. 4: 473. https://doi.org/10.3390/pathogens11040473

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop