Majumdar, A., Bakirov, R., Hodges, D., McCullough, S. and Rees, T., 2024. A multi-season machine learning approach to examine the training load and injury relationship in professional soccer. Journal of Sports Analytics, 10 (1), 47-65.
Full text available as:
|
PDF (OPEN ACCESS ARTICLE)
jsa_2024_10-1_jsa-10-1-jsa240718_jsa-10-jsa240718.pdf - Published Version Available under License Creative Commons Attribution Non-commercial. 1MB | |
PDF
Manuscript (MS718) - Final Version.pdf - Accepted Version Restricted to Repository staff only Available under License Creative Commons Attribution Non-commercial. 2MB | ||
Copyright to original material in this document is with the original owner(s). Access to this content through BURO is granted on condition that you use it only for research, scholarly or other non-commercial purposes. If you wish to use it for any other purposes, you must contact BU via BURO@bournemouth.ac.uk. Any third party copyright material in this document remains the property of its respective owner(s). BU grants no licence for further use of that third party material. |
DOI: 10.3233/JSA-240718
Abstract
OBJECTIVES: The purpose of this study was to use machine learning to examine the relationship between training load and soccer injury with a multi-season dataset from one English Premier League club. METHODS: Participants were 35 male professional soccer players (aged 25.79±3.75 years, range 18–37 years; height 1.80±0.07 m, range 1.63–1.95 m; weight 80.70±6.78 kg, range 66.03–93.70 kg), with data collected from the 2014–2015 season until the 2018–2019 season. A total of 106 training loads variables (40 GPS data, 6 personal information, 14 physical data, 4 psychological data and 14 ACWR, 14 MSWR and 14 EWMA data) were examined in relation to 133 non-contact injuries, with a high imbalance ratio of 0.013. RESULTS: XGBoost and Artificial Neural Network were implemented to train the machine learning models using four and a half seasons’ data, with the developed models subsequently tested on the following half season’s data. During the first four and a half seasons, there were 341 injuries; during the next half season there were 37 injuries. To interpret and visualize the output of each model and the contribution of each feature (i.e., training load) towards the model, we used the Shapley Additive Explanations (SHAP) approach. Of 37 injuries, XGBoost correctly predicted 26 injuries, with recall and precision of 73% and 10% respectively. Artificial Neural Network correctly predicted 28 injuries, with recall and precision of 77% and 13% respectively. In the model using Artificial Neural Network (the relatively more accurate model), last injury area and weight appeared to be the most important features contributing to the prediction of injury. CONCLUSIONS: This was the first study of its kind to use Artificial Neural Network and a multi-season dataset for injury prediction. Our results demonstrate the potential to predict injuries with high recall, thereby identifying most of the injury cases, albeit, due to high class imbalance, precision suffered. This approach to using machine learning provides potentially valuable insights for soccer organizations and practitioners when monitoring load injuries.
Item Type: | Article |
---|---|
ISSN: | 2215-020X |
Uncontrolled Keywords: | Soccer injury; predictive analytics; machine learning; English premier league; artificial neural network |
Group: | Faculty of Health & Social Sciences |
ID Code: | 39595 |
Deposited By: | Symplectic RT2 |
Deposited On: | 14 Mar 2024 09:30 |
Last Modified: | 24 Apr 2024 13:10 |
Downloads
Downloads per month over past year
Repository Staff Only - |