Football Datasets

If you want to do your own number crunching on football results you need data. There are a lot of websites out there with results and statistics, but to get the information into a convenient format can be a hassle. Luckily there are some places where you can get good datasets available for download. Here I have collected some of them. I will update this list when I find more good sites. Also, if you know of some, feel free to mention them in the comments.

This list was last updated January 8th 2017.

football-data.co.uk
Match results and statistics from many European leagues and tournaments, England, Germany, Italy, Spain etc. With data all the way back to the 1990’s plus odds data from different bookmakers. The datasets are updated each week.
Format: csv, excel, zip
Link: http://www.football-data.co.uk/data.php

James Curley’s engsoccerdata
All English league matches from the very beginning in 1888 to the present, in the convenient form of a R package.
Format: csv, rdata
Link: engsoccerdata @ github

Stryktipsetisistastund.se
Match results from Swedish, Norwegian and Finish leagues in a format similar to football-data.co.uk.
Format: csv
Link: stryktipsetisistastund.se/data

Ouseful.info: Sports data and R
A list of R packages for sports and football analytics, including some packages that consists mostly of data sets.
Format: R packages
Link

Football stadium coordinates
Small data set compiled by me, with GPS coordinates for the home stadiums for about 130 European teams.
Format: csv
Link

Kaggle
A data set with details on 25k eurpean matches and 11k players. Registration required.
Format: csv
Link: European Soccer Database
Link: Open Data Spotlight: The Ultimate European Soccer Database | Hugo Mathien

WoSo Stats
WoSo Stats collects data for women’s soccer. Including detailed match event data.
Format: csv
Link

The Guardian
The Guardian sometimes post football related datasets, especially in connection with large football events. Player statistics from Opta, historical match results and more.
Format: Google Docs (can be downloaded as csv or excel)
Link: World Cup 2010
Link: EURO 2012

The Guardian – Top 100 players
Each year The Guardian asks 100+ experts to rank the best players in the world. The complete details of the voting is available as a spreadsheet.
Format: Google Docs (can be downloaded as csv or excel)
Link: 2012 (The full voting data is not available, but the 2012 ranking is available in the 2013 spreadsheet)
Link: 2013
Link: 2014
Link: 2015
Link: 2016

openfootball football.db
A large database of football results.
Format: database
Link

Openfooty API
An API which makes available results, player stats, lineups, tables et cetera. Registration required.
Format: XML, php, json
Link: Openfooty

MCFC Analytics
Manchester City and Opta released a ton of data on every player from the 2011-12 Premier League season. Registration required.
Format: Excel, xml
Link: MCFC Analytics

Kaggle
Some historical World Cup data. Registration required.
Format: csv
Link: World Cup 2010 – Take on the Quants
Link: World Cup 2010 – Confidence Challenge

16 thoughts on “Football Datasets

  1. Pingback: Getting started: Where do I get my data from? | stats and snakeoil

  2. Am looking for footaball data that contains the usual teams involved, match result plus stadium where match was played.

    • You can use the football stadium coordinates dataset i compiled. It contains stadium name and and coordiantes and should be easy to merge with a table of results. It miht be a bit outdated, but its a start.

  3. Dear Jonas,
    Thank you for your excellent blog that helps me a lot.
    I am interested in the effect of a new trainer on a teams performance. So, I would like to ask if there exists a historical dataset (as far back as possible) containing all trainers of each team that played in the premier league at some point in this period. I would just need 4 columns: Team, Name of the Coach, Date of hire, end of contract. However, if you have something similar but for another country I would be interested too.
    Thank you very much in advance
    Best Regards,
    Stefan

  4. Every time I go looking for something to do with football/sports modelling, I end up back here.

    Amazing work – thanks for all your efforts!

  5. Hi. Love your blog! I’m looking for a dataset of English premier league results, with the time of all goals scored shown. I’m a bit of a novice so ideally in csv so I can stick it into excel for cleaning and analysis (I’m basically looking at which teams have lost most points from late goals). Thanks if you can help!

  6. Hi, I am looking for a data set which has a collection of transfers, including details about the player (Position, Nationality, Age, Basic performance data), and details about the transer (The fee in particular)

    I’m also interested in data about player wages, aswell as their details again. (Wages, Position, Club, Age) etc etc

    Thanks in advance, great job on the site!

  7. Hey Jonas,

    first of all, thanks for the information offered in your blog!

    Im writing my bachelor thesis about Elo Ratings at the moment which contains statistical analysis I have to do, whether specific hypothesis about Elo Ratings are consistent with real world data or not. For my research, I will need the end results (tables) of the top five European leagues for the last 25-50 years. So far I haven’t found any useful datasets to work with. Do you maybe have a clew?

    Best Regards, Marlon

  8. I would like to obtain Team Data from about 40 leagues across the world. The applications I have looked at so far require me identifying the league number for each of the 40 leagues and carrying out 40 API calls. Are you aware of anywhere that would provide this data and allow me to download it it a simpler fashion?

    • Not that I am aware of. Identifying 40 league numbers from an API, and manually plotting them into your script does not sound like much work, actually.

Leave a Reply

Your email address will not be published. Required fields are marked *