Soccer Match Event Data
- Joe G
- Mar 28, 2021
- 1 min read
Updated: Mar 30, 2021
The data was collected by:
and it contains all the Spatio-temporal events (passes, shots, fouls, etc.) that occur during all matches of an entire season of seven competitions (La Liga, Serie A, Bundesliga, Premier League, Ligue 1, FIFA World Cup 2018, UEFA Euro Cup 2016) as published on figshare. A match event contains information about its position, time, outcome, player, and characteristics.
In total, there are 10 data sets presented on figshare. Different variables were recorded in each data set, however, some common attributes can be found in several data sets. These common attributes make it easier to merge several data sets together. In this project, only 4 data sets (Players, Competitions, Events, and Matches data sets) will be used. These will be merged together through common attributes. The data sets contain continuous, nominal, and or categorical variables.
The players, matches, events, and competitions datasets contain 14, 23, 12, and 5 attributes, respectively. The data contains information of all matches played in 7 competitions. They have records of various events which happened in all the matches. For each player, there are at least 50 attributes with match-related events related to a particular player.
In our project, the matches data set will be the core data around which more elaborate information can be provided by the events, competition, and player data sets. Seven match events are categorized for which it is registered which player took place in the event as well as temporal and position-specific information of the event. Detailed information about the players is provided and all this data is linked to the competition data as well.

Comments