Project Brief: New York Leading Causes of Death - Data Analysis and Visualization
As a data analyst, I undertook a comprehensive analysis of New York City's leading causes of death using Power BI. The dataset was sourced from data.gov and encompassed data spanning from 2007 to 2019. The primary objective of this project was to identify and visualize the most significant patterns and trends related to mortality in the city. Through a series of analytical steps and data visualizations, I sought to answer specific questions and gain insights into the demographic patterns influencing mortality rates.
Analytical Steps:
- Data Preparation: The raw dataset was imported into Power Query, where I ensured appropriate data types were applied to facilitate accurate analysis.
- Data Cleansing: I utilized the Replace Value function to simplify the sex, race ethnicity, and leading cause columns. Additionally, missing values were addressed to ensure data integrity.
- Creating Measures: Key measures were generated to calculate the total number of deaths, total diseases, and race-related statistics.
Data Visualization
To gain a comprehensive understanding of the data, I visualized the information in the form of graphs and charts, and deduced valuable insights that shed light on various demographic patterns influencing mortality in New York City.
Questions:
- Top Leading Causes of Death: What are the top leading causes of death in New York overall?
- Gender-Based Mortality: What is the percentage of death for males and females over the period of years?
- Mortality Rate Over Time: What is the overall mortality rate in New York for each year?
- Ethnic Group-Based Mortality: Which ethnic group or race recorded the highest and the least number of deaths during the period?
Insights Obtained:
- The analysis revealed that heart diseases accounted for the highest number of deaths (231,423) between 2007 and 2019 in New York City. Nephritis and Nephrotic Syndrome had the lowest number of deaths among the top 10 causes of death analyzed.
- Over the examined period, the data indicated that females experienced a slightly higher number of deaths compared to males, indicating a noteworthy gender-related pattern in mortality rates.
- The records showed that the highest number of deaths occurred in 2018 (54,700), while the fewest number of deaths was recorded in 2015 (51,000). This temporal variation offers insights into the changing mortality rates over the years.
- Among different ethnic groups, White Non-Hispanic individuals had the highest number of deaths (325,000). Analyzing variations in mortality rates among different ethnic groups can provide a deeper understanding of health disparities.
Overall, the project sheds light on the leading causes of death, mortality rates, and demographic patterns in New York City, offering valuable insights for further analysis and informed decision-making.