Journal of Data Science logo


Login Register

  1. Home
  2. To appear
  3. Use of Farm Equipment Machine-Logged Dat ...

Journal of Data Science

Submit your article Information
  • Article info
  • Related articles
  • More
    Article info Related articles

Use of Farm Equipment Machine-Logged Data to Inform Crop Production Statistics✩
Sean Rhodes   David M. Johnson   Luca Sartore ORCID icon link to view author Luca Sartore details     All authors (6)

Authors

 
Placeholder
https://doi.org/10.6339/26-JDS1235
Pub. online: 8 June 2026      Type: Data Science In Action      Open accessOpen Access

✩ The findings and conclusions in this presentation are those of the authors and should not be construed to represent any official USDA or US Government determination or policy.

Received
29 September 2025
Accepted
29 May 2026
Published
8 June 2026

Abstract

The United States Department of Agriculture’s (USDA’s) National Agricultural Statistics Service (NASS) conducted a pilot study in 2024 to obtain data collected onboard farm machinery and explore their uses for statistical purposes. NASS has recognized high value in these machine-logged data (MLD) systems as they can potentially augment, or even replace, traditional survey efforts while providing additional benefits of reducing respondent burden and improving crop-related estimates. This pilot study ultimately addressed four topics: 1) understanding the obstacles in obtaining MLD from farmers; 2) creating geographic workflows to manage inherent geospatial MLD; 3) developing the linkages to NASS’s tabular list frame information; and 4) assessing the use of MLD to replace survey data for time-sensitive estimates. To study each topic, field-level information was gathered from the MLD systems of dozens of producers over hundreds of fields across the central United States (US) for the 2023 growing season. Results showed that 90% of the fields could be linked to a producer on the NASS list frame. Of those producers, the consistency of MLD versus traditional survey reporting was highly variable for those who were selected for a survey in 2023. Comparisons showed median MLD values were larger than historical NASS survey values. Approximately 48% of survey comparisons showed a difference of 25% or less between MLD and historical NASS survey values. MLD shows promise for use in official statistics; however, further analyses with additional producers’ data and enhancements to MLD collection processes are needed before supplementing traditional survey methods.

Supplementary material

 Supplementary Material
The supplementary material contains an R program that is made available in a zip file attached to this article.

References

 
Bradley R (1997). Potential Benefits from the Use of Scanner Data in the Construction of the CPI. In: 1996 ASA Proceedings.
 
Brimble P, McSharry P, Bachofer F, Bower J, Braun A (2020). Using machine learning and remote sensing to value property in Rwanda.
 
Collender RN, Chalfant JA (1986). An alternative approach to decisions under uncertainty using the empirical moment-generating function. American Journal of Agricultural Economics, 68(3): 727–731. https://doi.org/10.2307/1241557
 
De Finetti B (2017). Theory of Probability: A Critical Introductory Treatment. John Wiley & Sons.
 
Deines JM, Patel R, Liang SZ, Dado W, Lobell DB (2021). A million kernels of truth: Insights into scalable satellite maize yield mapping and yield gap analysis from an extensive ground dataset in the US corn belt. Remote Sensing of Environment, 253: 112174. https://doi.org/10.1016/j.rse.2020.112174
 
Dillman DA, Smyth JD, Christian LM (2014). Internet, Phone, Mail, and Mixed-Mode Surveys: The Tailored Design Method. John Wiley & Sons, Indianapolis, Indiana, 17.
 
Dumbacher B, Capps C (2016). Big data methods for scraping government tax revenue from the web. In: Proceedings of the Joint Statistical Meetings, Section on Statistical Learning and Data Science.
 
Groves RM, Fowler FJ, Couper MP, Lepkowski JM, Singer E, Tourangeau R (2004). Survey Methodology: Wiley Series in Survey Methodology. 43–198. John Wiley & Sons, New York, USA.
 
Heald J (2002). USDA establishes a common land unit. ArcUser Online.
 
Heberlein T, Baumgratner R (1978). Factors affecting response rates to mailed questionnaires: A quantitative analysis of the published literature. Am Sociological Review, 43: 447–462. https://doi.org/10.2307/2094771
 
Hollander M, Wolfe DA (1999). Nonparametric Statistical Methods. John Wiley & Sons, New York, NY, 2nd edition.
 
Hutchinson R, Scheleur S (2017). Using Big Data to Enhance US Census Bureau Economic Data Products. American Statistical Association, Alexandria, VA.
 
Hutchinson R, Scheleur S, Weidenhamer D (2023). Alternative data sources in the Census Bureau’s monthly state retail sales data product. Advances in Business Statistics, Methods and Data Collection, 593–611. https://doi.org/10.1002/9781119672333.ch26
 
Leeuw ED, Callegaro M, Hox J, Korendijk E, Lensvelt-Mulders G (2007). The influence of advance letters on response in telephone surveys: A meta-analysis. Public Opinion Quarterly, 71(3): 413–443. https://doi.org/10.1093/poq/nfm014
 
Longva S, Thomsen I, Severeide PI (1998). Reducing costs of censuses in Norway through use of administrative registers. International Statistical Review, 66(2): 223–234. https://doi.org/10.1111/j.1751-5823.1998.tb00415.x
 
McFadden J, Njuki E, Griffin T (2023). Precision agriculture in the digital era: Recent adoption on US farms.
 
Murphy T, Rosales A, Abreu D, Young L (2022). Automatic imputation for an area survey. In: Proceedings of the Joint Statistical Meetings, Survey Research Methods Section. American Statistical Association, Alexandria, VA.
 
Rhodes S, Rosales A, Sartore L, Murphy T (2023). Uncertainty assessment for the imputation of an area survey. In: Proceedings of the 2023 Joint Statistical Meetings. American Statistical Association.
 
Snijkers G, de Jong T, Lam C, van Meurs C (2024). System-to-System Data Collection in business surveys applied to an agricultural survey: Small-scale pilot results.
 
Snijkers G, Punt T, De Broe S, Pérez JG (2021). Exploring sensor data for agricultural statistics: The fruit is not hanging as low as we thought. Statistical Journal of the IAOS, 37(4): 1301–1314. https://doi.org/10.3233/SJI-200728
 
Thompson A VHS (2025). Assessing Engagement and Outreach Initiatives: A Pilot Study for the Grain Stocks Report.
 
Tran HN, Gerling MW, McCarthy JS, O’Connor TP (2012). The Effectiveness of Automated Reminder Messages Used on the 2007 Census of Agriculture and on the National Animal Health Monitoring System 2011 Small Producer Study.

Related articles PDF XML
Related articles PDF XML

Copyright
2026 The Author(s). Published by the School of Statistics and the Center for Applied Statistics, Renmin University of China.
by logo by logo
Open access article under the CC BY license.

Keywords
data linkage geospatial imputation MyAgDataⓇ precision agriculture unstructured data

Metrics
since February 2021
27

Article info
views

12

PDF
downloads

Export citation

Copy and paste formatted citation
Placeholder

Download citation in file


Share


RSS

Journal of data science

  • Online ISSN: 1683-8602
  • Print ISSN: 1680-743X

About

  • About journal
  • Renmin University of China homepage
  • Academic Journal Management
    and Development Center homepage

For contributors

  • Submit
  • OA Policy
  • Become a Peer-reviewer

Contact us

  • JDS@ruc.edu.cn
  • Contact person: Jing Zhou
  • Phone: +86-10-62511318
  • No. 59 Zhongguancun Street, Haidian District Beijing, 100872, P.R. China
Powered by PubliMill  •  Privacy policy