0% found this document useful (0 votes)
222 views3 pages

Retail Sales Data Analysis Insights

The document outlines an end-to-end data analytics project aimed at analyzing retail sales data to derive insights on revenue, customer behavior, and product performance using SQL, Excel, Power BI, and Python. It includes tasks for SQL queries, Excel pivot tables, Power BI dashboards, and Python data analysis, along with a dataset of 5,000 transaction records. Key business insights indicate that electronics generate the most revenue, while clothing has higher transaction frequency, and that UPI and credit cards are the preferred payment methods.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
222 views3 pages

Retail Sales Data Analysis Insights

The document outlines an end-to-end data analytics project aimed at analyzing retail sales data to derive insights on revenue, customer behavior, and product performance using SQL, Excel, Power BI, and Python. It includes tasks for SQL queries, Excel pivot tables, Power BI dashboards, and Python data analysis, along with a dataset of 5,000 transaction records. Key business insights indicate that electronics generate the most revenue, while clothing has higher transaction frequency, and that UPI and credit cards are the preferred payment methods.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd

■ Retail Sales & Customer Insights Analytics

End-to-End Data Analytics Project using SQL, Excel, Power BI, and Python
■ Project Objective
The goal of this project is to analyze retail sales data to gain insights into revenue,
customer behavior, and product performance. The project demonstrates data cleaning,
querying, visualization, and predictive analytics using SQL, Excel, Power BI, and Python.

■ Dataset Description
The dataset contains **5,000 rows** of transaction-level data with the following columns: -
TransactionID: Unique ID for each transaction - Date: Date of purchase - CustomerID,
CustomerName, Gender, Age, City - ProductCategory, ProductName, Quantity, UnitPrice,
TotalAmount - PaymentMethod (Cash, UPI, Credit Card, Wallet)

■■ SQL Tasks
1. **Top 5 Best-Selling Products** ```sql SELECT ProductName, SUM(Quantity) AS
TotalSold FROM retail_sales GROUP BY ProductName ORDER BY TotalSold DESC
LIMIT 5; ``` 2. **Monthly Revenue Trends** ```sql SELECT DATE_TRUNC('month', Date)
AS Month, SUM(TotalAmount) AS Revenue FROM retail_sales GROUP BY Month
ORDER BY Month; ``` 3. **Customer Lifetime Value** ```sql SELECT CustomerID,
CustomerName, SUM(TotalAmount) AS CLV FROM retail_sales GROUP BY CustomerID,
CustomerName ORDER BY CLV DESC LIMIT 10; ``` 4. **Popular Payment Methods**
```sql SELECT PaymentMethod, COUNT(*) AS UsageCount FROM retail_sales GROUP
BY PaymentMethod ORDER BY UsageCount DESC; ```

■ Excel Tasks
1. Create a **PivotTable** to show sales by ProductCategory. 2. Insert a **line chart** for
Monthly Sales trends. 3. Add **slicers** for filtering by Gender, City, and PaymentMethod.
4. Create a **Customer Segmentation Report** using Age Groups.

■ Power BI Tasks
1. Create a dashboard with the following visuals: - Total Revenue by Month (Line Chart) -
Revenue by City (Bar Chart) - Best-Selling Products (Stacked Bar) - Customer
Demographics (Pie/Donut Chart) - Payment Method Share (Pie Chart) 2. Add filters for
Date Range, City, and Category.

■ Python Tasks
```python import pandas as pd import [Link] as plt import seaborn as sns # Load
dataset df = pd.read_csv("retail_sales_dataset.csv") # Data Cleaning [Link]().sum() #
EDA: Sales by Category category_sales =
[Link]("ProductCategory")["TotalAmount"].sum() category_sales.plot(kind="bar",
title="Revenue by Category") # Correlation between Age & Spending
[Link](x="Age", y="TotalAmount", data=df) # Monthly Revenue Trend df["Date"] =
pd.to_datetime(df["Date"]) monthly_revenue =
[Link](df["Date"].dt.to_period("M"))["TotalAmount"].sum()
monthly_revenue.plot(title="Monthly Revenue Trend") [Link]() ``` (Optional) Use
`Prophet` or `ARIMA` to forecast next month’s sales.

■ Business Insights & Recommendations


- Electronics generate the highest revenue, but Clothing has higher transaction frequency.
- Customers aged 25–40 spend the most on average. - UPI and Credit Cards are the most
popular payment methods. - Revenue peaks during festive months (Oct–Dec).

You might also like