Evaluating Methods for Imputing Missing Data from Longitudinal Monitoring of Athlete Workload

J Sports Sci Med. 2021 Mar 5;20(2):188-196. doi: 10.52082/jssm.2021.188. eCollection 2021 Jun.

Abstract

Missing data can influence calculations of accumulated athlete workload. The objectives were to identify the best single imputation methods and examine workload trends using multiple imputation. External (jumps per hour) and internal (rating of perceived exertion; RPE) workload were recorded for 93 (45 females, 48 males) high school basketball players throughout a season. Recorded data were simulated as missing and imputed using ten imputation methods based on the context of the individual, team and session. Both single imputation and machine learning methods were used to impute the simulated missing data. The difference between the imputed data and the actual workload values was computed as root mean squared error (RMSE). A generalized estimating equation determined the effect of imputation method on RMSE. Multiple imputation of the original dataset, with all known and actual missing workload data, was used to examine trends in longitudinal workload data. Following multiple imputation, a Pearson correlation evaluated the longitudinal association between jump count and sRPE over the season. A single imputation method based on the specific context of the session for which data are missing (team mean) was only outperformed by methods that combine information about the session and the individual (machine learning models). There was a significant and strong association between jump count and sRPE in the original data and imputed datasets using multiple imputation. The amount and nature of the missing data should be considered when choosing a method for single imputation of workload data in youth basketball. Multiple imputation using several predictor variables in a regression model can be used for analyses where workload is accumulated across an entire season.

Keywords: Jump count; basketball; imputation; machine learning; training load.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Basketball / physiology*
  • Data Interpretation, Statistical*
  • Female
  • Humans
  • Longitudinal Studies
  • Machine Learning
  • Male
  • Perception / physiology
  • Physical Conditioning, Human / physiology*
  • Physical Exertion / physiology
  • Workload