Football analytics beginner guide

Football analytics beginner guide

Football analytics beginner guide


Unlocking the Beautiful Game: A Beginner’s Guide to Football Analytics

Football, often dubbed "The Beautiful Game," has captivated billions for centuries. Traditionally, understanding its nuances relied heavily on the "eye test" – the subjective observations of coaches, scouts, and fans. While invaluable, this approach is increasingly complemented, and sometimes challenged, by the objective insights of football analytics.

Welcome to the exciting world of football analytics, where data transforms into actionable intelligence, revealing the hidden patterns and deeper truths behind every pass, shot, and tackle. This guide is your starting point, designed for anyone passionate about football who wants to move beyond the scoreline and understand the game on a deeper, data-driven level.

Why Football Analytics? Beyond the Score

At its core, football analytics is the process of collecting, processing, and interpreting data related to football matches, players, and teams to gain a competitive advantage or a deeper understanding. Why is it becoming indispensable?

  1. Objective Decision Making: Human perception is prone to bias. Analytics provides an objective lens, helping to confirm or challenge subjective opinions. A player might "look good," but data can reveal if their actions are truly impactful.
  2. Competitive Edge: Professional clubs use analytics for everything from identifying undervalued talent in the transfer market to fine-tuning tactical strategies against specific opponents.
  3. Performance Optimization: Coaches can pinpoint areas for player improvement, track fitness levels, and understand team dynamics.
  4. Deeper Fan Engagement: For fans, analytics offers a richer viewing experience. Understanding metrics like Expected Goals (xG) or PPDA (Passes Per Defensive Action) allows for more informed discussions and appreciation of tactical battles.
  5. Risk Mitigation: Data can help assess the risk of injuries, identify players prone to certain types of fouls, or even evaluate the potential success of different formations.

In essence, analytics allows us to dissect the game, understand why things happen, and predict what might happen next, moving beyond just what happened.

The Raw Material: Types of Football Data

Before we can analyze, we need data. Football data comes in various forms, each offering different insights:

  1. Event Data (Eventing Data): This is the most common and often most accessible type for beginners. It records every significant action that occurs during a match – passes, shots, tackles, fouls, interceptions, dribbles, saves, goals, etc. Each event is logged with its timestamp, location on the pitch (x, y coordinates), the players involved, and the outcome.

    • Example: A pass from Player A to Player B at 34:12 from position (10, 20) to (50, 60), successful. A shot by Player C at 67:05 from position (80, 45), blocked.
    • Accessibility: Many public data providers (like Opta, StatsBomb, Wyscout, although commercial, often have free samples or derived data available on sites like Fbref.com, Understat.com) primarily offer event data.
  2. Tracking Data (Positional Data): This is more advanced and less publicly available due to its complexity and cost. It records the precise location (x, y coordinates) of every player and the ball on the pitch multiple times per second (e.g., 25 frames per second).

    • Example: At 34:12:01, Player A is at (10.1, 20.3), Player B is at (50.0, 60.1), Ball is at (30.0, 40.0).
    • Insights: Crucial for understanding spatial relationships, team shape, defensive lines, off-the-ball movement, pressing intensity, and player fatigue (distance covered, sprint speed).
  3. Physical Data (Wearable Data): Collected from GPS trackers, heart rate monitors, and other wearable devices worn by players during training and matches.

    • Insights: Provides information on player load, fatigue, sprint distances, top speeds, and recovery rates, vital for sports science and injury prevention.
  4. Qualitative Data: This isn’t numerical but equally important. It includes scout reports, coach observations, player interviews, and medical histories.

    • Insights: Provides context, explains motivations, and helps understand player personality, leadership qualities, and mental resilience – aspects that numbers alone can’t fully capture.

For beginners, focusing on event data is the most practical starting point.

Key Metrics and Concepts for Beginners

Once you have data, what do you look for? Here are some fundamental metrics that have revolutionized football analysis:

  1. Expected Goals (xG): This is arguably the most famous and widely used metric. xG measures the probability that a shot will result in a goal, based on historical data of thousands of similar shots. Factors considered include:

    • Distance to goal
    • Angle to goal
    • Type of assist (through ball, cross, cutback)
    • Body part used (head, foot)
    • Presence of defenders
    • Whether it was a rebound, a one-on-one, etc.
    • Why it’s useful: It tells you about the quality of chances created/conceded, not just the quantity. A team might score 2 goals from 0.5 xG (lucky), or 0 goals from 3.0 xG (unlucky/poor finishing). It helps identify sustainable performance.
  2. Expected Assists (xA): Similar to xG, xA measures the probability that a pass will become an assist, given the quality of the chance it creates for the receiving player.

  3. Passes Per Defensive Action (PPDA): A key metric for understanding pressing intensity. It measures the number of passes an opponent is allowed to make in their defensive and middle third before a defensive action (tackle, interception, foul, block) is made by the pressing team.

    • Lower PPDA: Indicates a more aggressive, high-pressing team. (e.g., Liverpool under Klopp)
    • Higher PPDA: Indicates a more passive, sitting-back team. (e.g., many counter-attacking teams)
  4. Progressive Passes/Carries: These metrics measure how often a player or team moves the ball significantly closer to the opponent’s goal.

    • Progressive Pass: A pass that moves the ball at least 10 yards closer to the opponent’s goal, or into the penalty box, excluding passes from the defensive 40% of the pitch.
    • Progressive Carry: A carry (dribble) that moves the ball at least 10 yards closer to the opponent’s goal, or into the penalty box, excluding carries from the defensive 40% of the pitch.
    • Why useful: Identifies players who are good at breaking lines and advancing play.
  5. Shot-Creating Actions (SCA) & Goal-Creating Actions (GCA): These metrics count the two offensive actions directly leading to a shot (SCA) or a goal (GCA), such as passes, dribbles, or drawing a foul. They help credit players who contribute to attacks even if they don’t get the final assist or goal.

  6. Possession Percentage: While basic, understanding how a team uses its possession (e.g., high possession with low xG vs. low possession with high xG) is crucial for tactical analysis.

Tools for the Beginner Analyst

You don’t need expensive software to start. Your most powerful tools are:

  1. Spreadsheets (Microsoft Excel / Google Sheets): This is your best friend.

    • Data Organization: Clean and structure your raw data.
    • Calculations: Perform basic arithmetic (averages, sums, percentages) and more complex formulas.
    • Filtering & Sorting: Quickly find specific events or players.
    • Basic Visualizations: Create simple bar charts, line graphs, and scatter plots to present your findings.
    • Learn: Familiarize yourself with functions like SUMIF, COUNTIF, AVERAGEIF, VLOOKUP, PIVOT TABLES.
  2. Public Data Sources:

    • Fbref.com: An incredible free resource for historical and current player/team statistics, including xG, xA, progressive passes, and much more, derived from Opta data.
    • Understat.com: Primarily focused on xG data for top European leagues.
    • Sofascore / WhoScored: Provide detailed match statistics, player ratings, and sometimes event maps.
  3. Visualization Tools (Optional but Recommended):

    • Tableau Public / Power BI Desktop (Free Versions): If you want to move beyond basic spreadsheets, these tools offer much more sophisticated and interactive data visualization capabilities. They have a learning curve but are highly rewarding.
  4. Python / R (Advanced): These programming languages are the backbone of professional analytics, offering immense power for data manipulation, statistical modeling, and advanced visualization. Not for beginners, but something to aspire to.

The Analytical Process: A Step-by-Step Approach

So, how do you actually do football analytics?

  1. Define Your Question: Start with a clear question.

    • Examples: "Which player in League X is most effective at progressing the ball forward?" "Is my team creating high-quality chances?" "How does our pressing intensity compare to top teams?" "Who should replace Player A based on their defensive contributions?"
  2. Acquire Data: Find the data relevant to your question. For beginners, this means extracting data from Fbref.com or Understat.com and importing it into your spreadsheet. You might need to manually copy-paste or use simple web scraping techniques (for personal use).

  3. Clean and Prepare Data: Raw data is rarely perfect. You might need to:

    • Remove duplicate entries.
    • Correct inconsistencies (e.g., player names spelled differently).
    • Format columns correctly (numbers, text, dates).
    • Handle missing values.
  4. Analyze and Calculate: This is where you apply your metrics.

    • Calculate xG per 90 minutes for forwards.
    • Sum up progressive passes for midfielders.
    • Calculate PPDA for your team and opponents.
    • Use conditional formatting to highlight highs and lows.
  5. Visualize Insights: Numbers alone can be dry. Visualizations make your findings understandable and impactful.

    • Bar charts for comparing player metrics.
    • Line graphs to show trends over time.
    • Scatter plots to compare two metrics (e.g., xG vs. Actual Goals).
    • Heatmaps for player activity or shot locations (more advanced, often requires programming).
  6. Interpret and Communicate: The most crucial step. What do your findings mean in the context of the game?

    • Don’t just present numbers; tell a story.
    • Consider the "why" behind the data. Is a player’s low xG due to poor finishing or poor chance creation?
    • Remember the human element: Data is a tool, not the sole answer. Combine it with your eye test and understanding of the game.
    • Present your findings clearly and concisely.

Getting Started: Your First Steps

  1. Watch Games Actively: Don’t just watch for entertainment. Try to identify patterns. Where do teams win the ball? How do they build up play? What kind of chances are they creating?

  2. Master Spreadsheets: Seriously, dedicate time to learning Excel or Google Sheets. There are countless free tutorials online.

  3. Explore Free Data Sources: Spend time on Fbref.com and Understat.com. Play around with their filters and tables. Download data if possible and try to replicate some of their numbers.

  4. Read and Learn:

    • Blogs: Follow leading football analytics blogs (e.g., StatsBomb, The Athletic’s analytics articles, Wyscout’s blog).
    • Books: Look for introductory books on sports analytics.
    • Online Courses: Many platforms (Coursera, Udemy, YouTube) offer free or affordable courses on data analysis, statistics, and even specific football analytics topics.
  5. Start Small, Be Curious: Don’t try to solve everything at once. Pick a single player or a single match and try to analyze one specific aspect. What was the xG for a particular game? How many progressive passes did your favorite midfielder make?

  6. Join the Community: Follow football analysts on Twitter/X, Reddit, or other forums. Engage in discussions, ask questions, and learn from others.

Challenges and Limitations

While powerful, analytics isn’t a silver bullet:

  • Data Quality and Availability: Not all data is perfect or easily accessible. Public data often lacks the granularity of professional datasets.
  • Context is King: Numbers without context can be misleading. A defender’s low tackle count might mean they’re excellent at positioning, not poor at defending.
  • Correlation vs. Causation: Just because two things happen together doesn’t mean one causes the other.
  • The "Eye Test" Still Matters: Analytics complements, but doesn’t fully replace, human observation, intuition, and understanding of the emotional and psychological aspects of the game.
  • Over-reliance: Blindly following data without critical thinking can lead to bad decisions.

Conclusion

Football analytics is an incredibly exciting and rapidly evolving field that blends the passion of sport with the precision of data science. It empowers us to appreciate the Beautiful Game on a deeper level, moving beyond surface-level observations to uncover the strategic brilliance, individual contributions, and underlying dynamics that shape every match.

As a beginner, you have a wealth of resources at your fingertips. Start with curiosity, embrace the numbers, and combine them with your love for football. The journey into football analytics will not only enhance your understanding of the sport but also equip you with valuable analytical skills applicable far beyond the pitch. So, grab your spreadsheet, dive into the data, and start unlocking the secrets of the beautiful game!

Football set-piece tactics 2025

Leave a Reply

Your email address will not be published. Required fields are marked *