Skip to main content

A multi-season machine learning approach to examine the training load and injury relationship in professional soccer.

Majumdar, A., Bakirov, R., Hodges, D., McCullough, S. and Rees, T., 2024. A multi-season machine learning approach to examine the training load and injury relationship in professional soccer. Journal of Sports Analytics, 10 (1), 47-65.

Full text available as:

[img]
Preview
PDF (OPEN ACCESS ARTICLE)
jsa_2024_10-1_jsa-10-1-jsa240718_jsa-10-jsa240718.pdf - Published Version
Available under License Creative Commons Attribution Non-commercial.

1MB
[img] PDF
Manuscript (MS718) - Final Version.pdf - Accepted Version
Restricted to Repository staff only
Available under License Creative Commons Attribution Non-commercial.

2MB

DOI: 10.3233/JSA-240718

Abstract

OBJECTIVES: The purpose of this study was to use machine learning to examine the relationship between training load and soccer injury with a multi-season dataset from one English Premier League club. METHODS: Participants were 35 male professional soccer players (aged 25.79±3.75 years, range 18–37 years; height 1.80±0.07 m, range 1.63–1.95 m; weight 80.70±6.78 kg, range 66.03–93.70 kg), with data collected from the 2014–2015 season until the 2018–2019 season. A total of 106 training loads variables (40 GPS data, 6 personal information, 14 physical data, 4 psychological data and 14 ACWR, 14 MSWR and 14 EWMA data) were examined in relation to 133 non-contact injuries, with a high imbalance ratio of 0.013. RESULTS: XGBoost and Artificial Neural Network were implemented to train the machine learning models using four and a half seasons’ data, with the developed models subsequently tested on the following half season’s data. During the first four and a half seasons, there were 341 injuries; during the next half season there were 37 injuries. To interpret and visualize the output of each model and the contribution of each feature (i.e., training load) towards the model, we used the Shapley Additive Explanations (SHAP) approach. Of 37 injuries, XGBoost correctly predicted 26 injuries, with recall and precision of 73% and 10% respectively. Artificial Neural Network correctly predicted 28 injuries, with recall and precision of 77% and 13% respectively. In the model using Artificial Neural Network (the relatively more accurate model), last injury area and weight appeared to be the most important features contributing to the prediction of injury. CONCLUSIONS: This was the first study of its kind to use Artificial Neural Network and a multi-season dataset for injury prediction. Our results demonstrate the potential to predict injuries with high recall, thereby identifying most of the injury cases, albeit, due to high class imbalance, precision suffered. This approach to using machine learning provides potentially valuable insights for soccer organizations and practitioners when monitoring load injuries.

Item Type:Article
ISSN:2215-020X
Uncontrolled Keywords:Soccer injury; predictive analytics; machine learning; English premier league; artificial neural network
Group:Faculty of Health & Social Sciences
ID Code:39595
Deposited By: Symplectic RT2
Deposited On:14 Mar 2024 09:30
Last Modified:24 Apr 2024 13:10

Downloads

Downloads per month over past year

More statistics for this item...
Repository Staff Only -