SkillAgentSearch skills...

SalesAnalysis

Using Python, Pandas & Matplotlib to analyze and answer business questions about 12 months worth of sales data. The data contains hundreds of thousands of electronics store purchases broken down by month, product type, cost, purchase address, etc.

Install / Use

/learn @kfrawee/SalesAnalysis
About this skill

Quality Score

0/100

Category

Sales

Supported Platforms

Universal

README

Sales Analysis Awesome made-with-python

Using Python, Pandas & Matplotlib to analyze and answer business questions about 12 months' worth of sales data. The data contains hundreds of thousands of electronics store purchases broken down by month, product type, cost, purchase address, etc.

Check:


First we start by assessing our data for:

  • Quality: issues with content. Low quality data is also known as dirty data.
  • Tidiness: issues with structure that prevent easy analysis. Untidy data is also known as messy data.

Then cleaning our data:

  • Drop NaN values from DataFrame
  • Removing rows based on a condition
  • Change the datatypes (to_datetime, astype)
  • Extact data from values
  • Merge all datasets into one dataset

Once we have cleaned up our data a bit, we move the data exploration section.

In this section we explore 5 high level business questions related to our data:

  • What was the best month for sales? How much was earned that month?
  • What city sold the most product?
  • What time should we display advertisemens to maximize the likelihood of customer’s buying product?
  • What products are most often sold together?
  • What product sold the most? Why do you think it sold the most?

To answer these questions we walk through many different pandas & matplotlib methods. They include:

  • Concatenating multiple csvs together to create a new DataFrame (pd.concat)
  • Adding columns
  • Parsing cells as strings to make new columns (.str)
  • Using the .apply() method
  • Using groupby to perform aggregate analysis
  • Plotting bar charts and lines graphs to visualize our results
  • Labeling our graphs

Author

This project was completed by Muhammad Elkfrawy.<br> Following Keith Galli's video

View on GitHub
GitHub Stars7
CategorySales
Updated1mo ago
Forks5

Languages

Jupyter Notebook

Security Score

75/100

Audited on Feb 20, 2026

No findings