__STYLES__

Flipkart Exploratory Data Analysis πŸ“ŠπŸ”

Flipkart Exploratory Data Analysis πŸ“ŠπŸ”

About this project

Flipkart Exploratory Data Analysis πŸ“ŠπŸ”

Project Link

Project Description πŸ“

This project involves performing an exploratory data analysis (EDA) on the Flipkart dataset. The main goal is to gain insights and understanding from the data to identify patterns, trends, and relationships that can provide valuable information about the products and brands available on Flipkart. πŸš€

Importing Dependencies πŸ“š

The initial step is to import the necessary Python libraries, including Pandas and NumPy for data manipulation, Plotly Express and Graph Objects for data visualization, and Google Colab for mounting the Google Drive to access the dataset. πŸ“ˆπŸ”’πŸ“Š

Loading Dataset πŸ“‚

The dataset is loaded from a CSV file stored in Google Drive using Pandas' read_csv() function. The first few rows of the dataset are displayed to understand its structure and contents. πŸ“πŸ‘€

Data Preprocessing πŸ”„

Before proceeding with the analysis, data preprocessing is performed to handle missing values and calculate the discount percentage for each product. The missing values in the 'retail_price', 'discounted_price', 'image', 'description', 'brand', and 'product_specifications' columns are handled appropriately. The discount percentage is calculated as the difference between the 'retail_price' and the 'discounted_price' divided by the 'retail_price', and then multiplied by 100. πŸ› οΈπŸ“

Data Analysis πŸ“Š

The data analysis section covers various aspects of the dataset. πŸ”

  1. Top Products and Brands Distribution πŸ₯‡πŸ† A pie chart is used to visualize the distribution of the top products and brands purchased on Flipkart. The top 10 main products and top 10 brands with the highest purchase counts are identified and displayed in the pie chart. πŸ°πŸ›οΈ
  2. Brands with High Discounts πŸ’°πŸ’― Brands that offer high discounts are identified and displayed using a bar chart. The mean discount percentage for each brand is calculated, and the top brands with the highest average discount percentages are presented in the bar chart. πŸ“ŠπŸŽ
  3. 5-Star Rating ⭐⭐⭐⭐⭐ The number of products with a 5-star rating is calculated and presented using a funnel plot. The funnel plot illustrates the number of products at each stage, starting from the total number of products to the number of products with ratings and finally the number of products with a 5-star rating. πŸ…πŸŽ―
  4. Ratingwise Count πŸ“ˆπŸ‘ The count of products for each rating level is visualized using a scatter plot. The number of products is displayed on the y-axis against the rating on the x-axis. πŸ“ˆπŸ“‰
  5. Variation in Discount throughout a Year πŸ“…πŸ“ˆ The average retail price and discounted price are calculated for each date in the dataset, and the variation in prices throughout the year is visualized using an area chart. πŸ“ˆπŸ“‰πŸ“…
  6. User Engagement Pattern throughout the Day βŒšπŸ“ˆ The number of clicks on product URLs is plotted against the time of day using a scatter plot. The plot provides insights into user engagement patterns on Flipkart throughout the day. πŸ•°οΈπŸ“Š

Conclusion πŸ“πŸŽ‰

The exploratory data analysis of the Flipkart dataset has provided valuable insights into product and brand trends, user engagement, and pricing patterns. The findings can be used to optimize marketing strategies, identify popular products and brands, and understand user behavior on the platform. πŸ“ˆπŸ’ΌπŸ§ 

Discussion and feedback(0 comments)
2000 characters remaining