Prediction of Cattle Fever Tick Outbreaks in United States Quarantine Zone

<p><br></p> <p>[NOTE - 11/24/2021: this dataset supersedes an earlier version <a href="https://doi.org/10.15482/USDA.ADC/1518654" target="_blank">https://doi.org/10.15482/USDA.ADC/1518654</a> ]</p> <div><strong>Data sources</strong>. Time series data on cattle fever tick incidence, 1959-2020, and climate variables January 1950 through December 2020, form the core information in this analysis. All variables are monthly averages or sums over the fiscal year, October 01 (of the prior calendar year, <em>y</em>-1) through September 30 of the current calendar year (<em>y</em>). Annual records on monthly new detections of <em>Rhipicephalus microplus</em> and <em>R. annulatus</em> (cattle fever tick, CFT) on premises within the Permanent Quarantine Zone (PQZ) were obtained from the Cattle Fever Tick Eradication Program (CFTEP) maintained jointly by the United States Department of Agriculture (USDA), Animal Plant Health Inspection Service and the USDA Animal Research Service in Laredo, Texas. Details of tick survey procedures, CFTEP program goals and history, and the geographic extent of the PQZ are in the main text, and in the Supporting Information (SI) of the associated paper. Data sources on oceanic indicators, on local meteorology, and their pretreatment are detailed in SI.</div> <div><strong>Data pretreatment</strong>. To address the low signal-to-noise ratio and non-independence of observations common in time series, we transformed all explanatory and response variables by using a series of six consecutive steps: (i) First differences (year <em>y</em> minus year <em>y</em>-1) were calculated, (ii) these were then converted to <em>z</em> scores (<em>z</em> = (<em>x</em>- <em>μ</em>) / <em>σ</em>, where <em>x</em> is the raw value, <em>μ</em> is the population mean, <em>σ</em> is the standard deviation of the population), (iii) linear regression was applied to remove any directional trends, (iv) moving averages (typically 11-year point-centered moving averages) were calculated for each variable, (v) a lag was applied if/when deemed necessary, and (vi) statistics calculated (<em>r, n, df, P<, p<</em>).</div> <div><strong>Principal component analysis (PCA)</strong>. A matrix of <em>z</em>-score first differences of the 13 climate variables, and CFT (1960-2020), was entered into XLSTAT principal components analysis routine; we used Pearson correlation of the 14 x 60 matrix, and Varimax rotation of the first two components.</div> <div><strong>Autoregressive Integrated Moving Average (ARIMA)</strong>. An ARIMA (2,0,0) model was selected among 7 test models in which the <em>p</em>, <em>d</em>, and <em>q</em> terms were varied, and selection made on the basis of lowest RMSE and AIC statistics, and reduction of partial autocorrelation outcomes. A best model linear regression of CFT values on ARIMA-predicted CFT was developed using XLSTAT linear regression software with the objective of examining statistical properties (<em>r, n, df, P<, p<</em>), including the Durbin-Watson index of order-1 autocorrelation, and Cook’s Di distance index. Cross-validation of the model was made by withholding the last 30, and then the first 30 observations in a pair of regressions.</div> <div><strong>Forecast of the next major CFT outbreak</strong>. It is generally recognized that the onset year of the first major CFT outbreak was not 1959, but may have occurred earlier in the decade. We postulated the actual underlying pattern is fully 44 years from the start to the end of a CFT cycle linked to external climatic drivers. (SI Appendix, Hypothesis on CFT cycles). The hypothetical reconstruction was projected one full CFT cycle into the future. To substantiate the projected trend, we generated a power spectrum analysis based on 1-year values of the 1959-2020 CFT dataset using SYSTAT AutoSignal software. The outcome included a forecast to 2100; this was compared to the hypothetical reconstruction and projection. Any differences were noted, and the start and end dates of the next major CFT outbreak identified.</div> <p><br> Resources in this dataset:</p> <ul> <li>Resource Title: CFT and climate data. File Name: climate-cft-data2.csv Resource Description: Main dataset; see data dictionary for information on each column</li> <li>Resource Title: Data dictionary (metadata). File Name: climate-cft-metadata2.csv Resource Description: Information on variables and their origin</li> <li>Resource Title: fitted models. File Name: climate-cft-models2.xlsx Resource Software Recommended: Microsoft Excel,url: <a href="https://www.microsoft.com/en-us/microsoft-365/excel" target="_blank">https://www.microsoft.com/en-us/microsoft-365/excel; </a>XLSTAT,url: <a href="https://www.xlstat.com/en/" target="_blank">https://www.xlstat.com/en/; </a>SYStat Autosignal,url: <a href="https://www.systat.com/products/AutoSignal/" target="_blank">https://www.systat.com/products/AutoSignal/</a></li> </ul><p></p>

Saved in:
Bibliographic Details
Main Authors: Allan N. Auclair (17482044), Adalberto Pérez de León (17865914), Pete D. Teel (9412880), Nicholas Manoukis (17362237), Matthew T. Messenger (17482050), Denise L. Bonilla (13837979)
Format: Dataset biblioteca
Published: 2021
Subjects:Animal production, Animal welfare, Host-parasite interactions, Climatology, Environmental sciences, Cattle Fever Tick Prediction, cattle tick, disease, NP104, data.gov, ARS,
Online Access:https://figshare.com/articles/dataset/Novel_Hurricane_Hypothesis_Predicts_US_Cattle_Fever_Tick_Outbreaks/25012448
Tags: Add Tag
No Tags, Be the first to tag this record!