Schulthess, URodrigues Jr, FTaymans, MBellemans, NBontemps, SOrtiz-Monasterio, IGérard, BDefourny, P2023-05-112023-01-192023-02-012023-01-178W0AZ (isidoc)https://hdl.handle.net/10182/16107Sen2-Agri is a software system that was developed to facilitate the use of multi-temporal satellite data for crop classification with a random forest (RF) classifier in an operational setting. It automatically ingests and processes Sentinel-2 and LandSat 8 images. Our goal was to provide practitioners with recommendations for the best sample size and composition. The study area was located in the Yaqui Valley in Mexico. Using polygons of more than 6000 labeled crop fields, we prepared data sets for training, in which the nine crops had an equal or proportional representation, called Equal or Ratio, respectively. Increasing the size of the training set improved the overall accuracy (OA). Gains became marginal once the total number of fields approximated 500 or 40 to 45 fields per crop type. Equal achieved slightly higher OAs than Ratio for a given number of fields. However, recall and F-scores of the individual crops tended to be higher for Ratio than for Equal. The high number of wheat fields in the Ratio scenarios, ranging from 275 to 2128, produced a more accurate classification of wheat than the maximal 80 fields of Equal. This resulted in a higher recall for wheat in the Ratio than in the Equal scenarios, which in turn limited the errors of commission of the non-wheat crops. Thus, a proportional representation of the crops in the training data is preferable and yields better accuracies, even for the minority crops.18 pagesen© 2023 by the authors. Licensee MDPI, Basel, Switzerland.agriculturecrop classificationmachine learningrandom forestremote sensingsample sizeOptimal sample size and composition for crop classification with Sen2-Agri’s random forest classifierJournal Article10.3390/rs150306082072-42922023-03-30ANZSRC::300206 Agricultural spatial analysis and modellingANZSRC::300202 Agricultural land managementANZSRC::401304 Photogrammetry and remote sensingANZSRC::3701 Atmospheric sciencesANZSRC::3709 Physical geography and environmental geoscienceANZSRC::4013 Geomatic engineeringhttps://creativecommons.org/licenses/by/4.0/Attribution