__STYLES__
Analyzing real-life data from the New York Stock Exchange (NYSE). A subset of a large dataset provided by Kaggle was used in this analysis. The dataset contained historical financial data from S&P 500 companies listed on the NYSE between 2013 and 2015, and sometimes 2016 if available.
Background: Two (2) datasets Fundamentals.csv and Securities.csv files were downloaded from Kaggle. The Fundamentals file provides the fundamental financial data gathered from SEC 10K annual filings from 448 companies listed on the S&P 500 index. The Securities file provided the industry or sector information for the companies under their respective categories in the S&P 500 index. Both datasets were cleaned and combined to one dataset for the purpose of this project.
Tasks to be performed included: calculating summary statistics, drawing inference from the statistics, measuring business metrics, and using financial models to forecast future growth projects for the 448 companies listed in the NYSE.
The cleaned dataset contains the following information:
Skills to be Demonstrated: As a seasoned Data Analyst, I wanted to showcase my ability to:
Below is the link to the Excel File. Please download and view the dynamic dashboards locally on your computer using Microsoft Excel (Note: Excel dashboards do not open properly in google sheets).
Excel Financial Forecast Model & Dashboard
Data cleaning using Excel functions (NULL functions, IF statements, INDEX, MATCH, VLOOKUPs) was necessary to fix data quality and tidiness issues. I conducted exploratory data analysis to understand trends, patterns and correlations between variables in the dataset. And then I conducted an explanatory data anlysis using research questions derived from EDA of the dataset. These research questions include:
On average, PepsiCo (PEP) earned the highest revenue of $65.38 billion between 2013 and 2015 , followed by Coca-Cola (KO) in 2nd position with $45.71 billion. Dr Pepper (DPS) ranks third with $6.13 billion average revenue between 2013 & 2015, while Monster Beverages (MNST) comes in lastly with $2.48 billion. The mean of the average revenue earned by all 4 companies between 2013 and 2015 was $29.93 billion with a standard deviation of $30.69 billion. This is because the gap or range between PEP (the highest earner) and MNST (the lowest earner) was $62.9 billion. The chart indicates that PEP earned approximately 26.4 times more than MNST; and when compared to the others, PEP earned approximately 10.7 times more than DPS, and 1.4 times more than KO.
On average between 2013 and 2015, PepsiCo (PEP) generated the highest operating profit of $9.67 billion, followed by Coca-Cola (KO) with $9.55 billion, then Monster Beverages (MNST) with $1.38 billion, and finally Dr Pepper (DPS) with $1.17 billion. The mean of the average operating profit by all 4 companies between 2013 and 2015 was $5.44 billion with a standard deviation of $4.81 billion. This is because the gap or range between PEP (the highest) and DPS (the lowest) was $8.49 billion. The chart indicates that PEP made approx. 8 times more operating profit than DPS, 7 times more than MNST, and almost the same amount with KO. This suggests that although PEP earned significantly more revenues between 2013 and 2015 (as seen from the previous slide), there may be a lot of contributing factors such as costs and expenditures taking away from their profits.
On average between 2013 and 2015, PepsiCo (PEP) had the highest cost of revenue of approx. $30.40 billion, followed by Coca-Cola (KO) with $17.93 billion, then Dr Pepper (DPS) with $2.52 billion, and finally Monster Beverages (MNST) with $1.10 billion. The mean of the average cost of revenue by all 4 companies between 2013 and 2015 was $12.99 billion with a standard deviation of $13.89 billion. This is because the gap or range between PEP (the highest) and MNST (the lowest) was $29.31 billion. The chart indicates that PEP spent approx. 28 times more on cost of revenue than MNST, 12 times more than DPS, and almost 2 times more than KO.
Between 2013 and 2015, PepsiCo earned the highest average revenue of $65.38 billion, followed by Coca-Cola in 2nd position with $45.71 billion. Dr Pepper ranks third with $6.13 billion, while Monster Beverages comes in lastly with $2.48 billion. PepsiCo also generated the highest operating profit of $9.67 billion on average, followed closely by Coca-Cola with $9.55 billion, while Dr Pepper and Monster Beverages made significantly less operating profit. However, PepsiCo had the highest cost of revenue of approximately $30.40 billion, followed by Coca-Cola, Dr Pepper, and Monster Beverages. These figures suggest that while PepsiCo earned significantly more revenue than its competitors, it also incurred significantly higher costs.
The NYSE-Data dataset only contained data about the total revenue generated, the amounts spent on cost to generate the revenue, and the operating profit before deduction of income taxes and interest payments. However, to better understand how these soft-drinks companies performed between 2013 and 2015, data enrichment is necessary to gather data on other financial metrics such as the profit margin, operating margin, debt-to-asset ratio, and net profit margin, to better understand performance and measure profitability for these soft-drinks companies.
As an extension to this project, I plan on gathering data on the other financial metrics listed above from the financial statements of Pepsi, Coca-Cola, Dr Pepper and Monster Beverages. The assumption is that these data points are readily available through their respective financial statements for the years 2013 - 2015 from their respective websites. If not, I will have to use websites that provide publicly available datasets such as:
... just to name a few.
Once again you can download the Excel Dashboard using the link below. Please download and view the dynamic dashboards locally on your computer using Microsoft Excel (Note: Excel dashboards do not open properly in google sheets).