This project explores an e-commerce shipment dataset to extract insights about product shipments, delivery performance, and customer interactions using MySQL. It demonstrates SQL fundamentals such as data filtering, aggregation, subqueries, views, and optimization techniques.
- Database: MySQL
- Interface: MySQL Workbench or Command Line
- Data Source: UK-based e-commerce CSV dataset (2010β2011)
- Database Name:
Databasetask - Table Name:
Tableofdatabase - Key Columns:
ID,Warehouse_block,Mode_of_Shipment,Customer_care_calls,Customer_rating,Cost_of_the_Product,Prior_purchases,Product_importance,Gender,Discount_offered,Weight_in_gms,Reached_on_Time_Y_N
- β Total shipments & sample records
- π¦ Delivery performance: On-time vs delayed shipments
- π Shipment cost analysis by mode of transport
- π’ Revenue breakdown by warehouse (after discount)
β οΈ Delay analysis by product importance- π° Top 5 expensive delayed shipments
- π Above-average cost products using subqueries
- π Reusable view for high-value delayed shipments
- ποΈ Indexes on
Warehouse_blockandReached_on_Time_Y_N - ποΈ Views to simplify analysis
- π EXPLAIN used to inspect query performance
This project demonstrates core SQL skills:
- Table creation and CSV data import
- Filtering and grouping data with
WHERE,GROUP BY,ORDER BY - Performing subqueries and calculations (
AVG,SUM) - Creating reusable views and optimizing performance with indexes
A great example of applying SQL to analyze real-world structured data effectively.