__STYLES__

Netflix Beyond the Screen: A Data-Driven Exploration of the Streaming Universe

Tools used in this project
Netflix Beyond the Screen: A Data-Driven Exploration of the Streaming Universe

About this project

This data analysis project provides a comprehensive and insightful exploration of Netflix's vast and complex streaming ecosystem. Utilizing various data visualization techniques and analytical tools, the project offers a unique perspective into the entertainment trends, user behavior, and content preferences of Netflix's global audience. From analyzing the top genres and titles to exploring the age ratings and durations of movies, the project offers valuable insights into the streaming giant's data, shedding light on the ever-evolving landscape of modern entertainment.

Here are the questions I tried to answer:

What are the most common genres of content on Netflix?

Are certain genres more popular in certain countries?

How has the length of movies or TV shows on Netflix changed over time?

The number of movies on Netflix, based on the released year.

What is the distribution of content by age rating?

Project Steps:

The columns were: show_id, type(movie/Tv show), title, director, cast, country, date_added, release_year, rating (PG/R/TV), duration, listed_in(Genre), description

  • It had about 8800 rows, but about 3500 rows had missing values.
  • I cleaned the data in Power Query.
  1. filtering out all null values (Is filtering the correct thing to do?)
  2. changed the dates to show uniform date' type.
  3. Split columns by a delimiter for cast and country to have only one value in the column to make visualizations easier. I deleted the rest of the data in the new column.
  4. Split columns by a delimiter for the duration to make it numeric data. (120 mins to just 120).
  • Then I added a new custom column in data-view with nested -if : Duration_type = IF(netflix_titles[duration.1] >= 1 && netflix_titles[duration.1] <= 15, "Season", IF(netflix_titles[duration.1] > 15 && netflix_titles[duration.1] <= 30,"Less than 30 minutes", IF(netflix_titles[duration.1] > 30 && netflix_titles[duration.1] <= 60,"30 to 60 mins", IF(netflix_titles[duration.1] > 60 && netflix_titles[duration.1] <= 120,"1-2hour", IF(netflix_titles[duration.1] > 120,">2 hours") ) ) ) )

  • added new table for measures: To find the count of movies, Tv shows from column[type] :

  1. Movies = CALCULATE(COUNT(netflix_titles[Title]),FILTER(netflix_titles,netflix_titles[type]="Movie"))
  2. TV Shows = CALCULATE(COUNT(netflix_titles[Title]),FILTER(netflix_titles,netflix_titles[type]="TV Show")) is Then I created this dashboard.
Discussion and feedback(0 comments)
2000 characters remaining
Cookie SettingsWe use cookies to enhance your experience, analyze site traffic and deliver personalized content. Read our Privacy Policy.