Commenced in January 2007
Frequency: Monthly
Edition: International
Paper Count: 32727
Injury Prediction for Soccer Players Using Machine Learning

Authors: Amiel Satvedi, Richard Pyne


Injuries in professional sports occur on a regular basis. Some may be minor while others can cause huge impact on a player’s career and earning potential. In soccer, there is a high risk of players picking up injuries during game time. This research work seeks to help soccer players reduce the risk of getting injured by predicting the likelihood of injury while playing in the near future and then providing recommendations for intervention. The injury prediction tool will use a soccer player’s number of minutes played on the field, number of appearances, distance covered and performance data for the current and previous seasons as variables to conduct statistical analysis and provide injury predictive results using a machine learning linear regression model.

Keywords: Injury predictor, soccer injury prevention, machine learning in soccer, big data in soccer.

Procedia APA BibTeX Chicago EndNote Harvard JSON MLA RIS XML ISO 690 PDF Downloads 1627


[1] B. Alamar, Sports analytics: a guide for coaches, managers, and other decision makers. New York: Columbia University Press, 2013.
[2] E. Alpaydın, Introduction to machine learning second edition. Cambridge, MA: The MIT Press, 2014.
[3] R. J. Bar and J. F. X. DeSouza, “Tracking Plasticity: Effects of Long-Term Rehearsal in Expert Dancers Encoding Music to Movement,” PlosOne, 29-Jan-2016.
[4] F. Wunderlich and D. Memmert, “The Betting Odds Rating System: Using soccer forecasts to forecast soccer,” PlosOne, 05-June-2018.
[5] K. Pelechrinis and E. Papalexakis, “The Anatomy of American Football: Evidence from 7 Years of NFL Game Data,” PlosOne, 22-Dec-2016.
[6] M. Bush, C. Barnes, D. T. Archer, B. Hogg, and P. S. Bradley, “Evolution of match performance parameters for various playing positions in the English Premier League,” ScienceDirect, Feb-2015.
[7] B. Pang, L. Lee, and S. Vithyanathan, “Thumbs up? Sentiment Classification using Machine Learning” Association for Computational Linguistics, July-2002.
[8] K. Ozcan, A. Mahabalagiri, and S. Velipasalar, “Autonomous tracking and counting of footsteps by mobile phone cameras,” An introduction to biometric recognition - IEEE Journals & Magazine, Nov-2015.
[9] J. Castellano, D. Casamichana, and C. Lago, “The Use of Match Statistics that Discriminate Between Successful and Unsuccessful Soccer Teams,” Economics and Culture, 23-Oct-2018.
[10] H. Chen, P. B. Rinde, L. She, S. Sutjahjo, C. Sommer, and D. Neely, “Expert Prediction, symbolic learning, and neural networks,” IEE-Expert Volume 9 Issue 6, Dec-1994.
[11] V. Di Salvo, W. Gregson, G. Atkinson, P. Tordoff, and B. Drust, “Analysis of high intensity activity in Premier League soccer,” Current neurology and neuroscience reports., Mar-2009.
[12] Roiger R. (2016). Basic Data Mining Techniques. In Cohen R. (Ed.), Data Mining a Tutorial-Based Primer (pp. 64-95). Boca Raton, Florida: CRC Press.
[13] A. Heuer and O. Rubner, “Optimizing the Prediction Process: From Statistical Concepts to the Case Study of Soccer,” PlosOne, 08-Sept-2014.
[14] T. Nishimoto, K. Mukaigawa, S. Tominaga, N. Lubbe, T. Kiuchi, T. Motomura, and H. Matsumoto, “Serious injury prediction algorithm based on large-scale data and under-triage control,” ScienceDirect, Jan 2017.
[15] Nationwide Children's Hospital. “Injuries to High School Baseball Players Becoming More Serious,” ScienceDaily, 03-Jun-2008.
[16] J. Kang and D.-S. E. Joonbeom Lee, “Smartphone-Based Traveled Distance Estimation Using Individual Walking Patterns for Indoor Localization,” American Journal of Drug and Alcohol Abuse, 18-Sep-2018.
[17] O.-S. Kwon and D. C. Knill, “The brain uses adaptive internal models of scene statistics for sensorimotor estimation and planning,” PNAS, 12-Mar-2013.
[18] C. Lago-Penas, J. Lago-Ballesteros, A. Dellal, and M. Gomez, “Game-Related Statistics that Discriminated Winning, Drawing and Losing Teams from the Spanish Soccer League,” Journal of Sports Science and Medicine, 01-June-2010.
[19] G. D. Myer, K. R. Ford, J. Khoury, P. Succop, and T. E. Hewett, “Biomechanics laboratory-based prediction algorithm to identify female athletes with high knee loads that increase risk of ACL injury,” British Journal of Sports Medicine, 17-June-2010.
[20] D. Memmert and R. Rein, “Match Analysis, Big Data and Tactics: Current Trends in Elite Soccer,” Research Gate, March-2018.
[21] D. Memmert and J. Perl, “Game creativity analysis using neural networks,” Journal for Sports Sciences, 19-Jan-2009.
[22] AFS Enterprises. (2020). English Premier League Statistics of Player’s Age filtered by club (Data File). Retrieved from
[23] Transfermarkt GmbH & Co. KG. (2020). English Premier League Statistics of Player’s Injury History (Data File). Retrieved from
[24] Premier League. (2020). English Premier League Statistics of Player Attributes (Data File). Retrieved from
[25] Fenn, A. (2017, September 22). Premier League v amateur fitness. FourFourTwo.