__STYLES__
In this project, I wanted to tell the story of France's history with the World Cup, along with their expectations in this year's tournament.
This dataset contains 6 tables, in CSV format:
source: Maven World Cup Challenge
I considered what might be important to a team's success in the World Cup including:
From there, I utilized BigQuery to analyze the data. Below are a few sample queries.
WITH games AS (
SELECT
COUNT(DISTINCT CASE WHEN home_team LIKE 'France%' AND home_goals > away_goals THEN ID ELSE NULL END) AS home_wins
, COUNT(DISTINCT CASE WHEN home_team LIKE 'France%' THEN ID ELSE NULL END) AS home_games
, SUM(CASE WHEN home_team LIKE 'France%' AND home_goals > away_goals THEN home_goals ELSE NULL END) AS home_goals
, COUNT(DISTINCT CASE WHEN away_team LIKE 'France%' AND away_goals > home_goals THEN ID ELSE NULL END) AS away_wins
, COUNT(DISTINCT CASE WHEN away_team LIKE 'France%' THEN ID ELSE NULL END) AS away_games
, SUM(CASE WHEN away_team LIKE 'France%' AND away_goals > home_goals THEN away_goals ELSE NULL END) AS away_goals
, COUNT(DISTINCT CASE WHEN home_team LIKE 'France%' OR away_team LIKE 'France%' THEN ID ELSE NULL END) AS total_games
FROM `bright-zodiac-346921.world_cup.international_matches`
)
SELECT
*
, home_wins / home_games AS home_win_percentage
, away_wins / away_games AS away_win_percentage
FROM games
WITH ranking AS (
SELECT
COUNT(DISTINCT CASE WHEN winner LIKE 'France%' THEN year ELSE NULL END) AS winner
, COUNT(DISTINCT CASE WHEN runners_up LIKE 'France%' THEN year ELSE NULL END) AS runner_up
, COUNT(DISTINCT CASE WHEN third LIKE 'France%' THEN year ELSE NULL END) AS third_place
, COUNT(DISTINCT CASE WHEN fourth LIKE 'France%' THEN year ELSE NULL END) AS fourth
, COUNT(DISTINCT CASE WHEN host_country LIKE 'France%' THEN year ELSE NULL END) AS host_country
FROM `bright-zodiac-346921.world_cup.world_cups`
)
SELECT
*
FROM
ranking
In this challenge, I wanted to represent France's past and future in the World Cup. Rather than reporting on KPIs and business metrics, I used information such as how many World Cups France has qualified for and how many games they have played in. This challenge was unique in that it encouraged data outside of the given dataset.
I found that France has been a World Cup winner twice and has hosted the World Cup twice. The team is currently ranked number 4 but they have won both games in the group stage of the tournament and will be moving on. I researched players' statistics to gain a better understanding of the likelihood of France advancing further in the tournament.