E-Commerce
Product
Shipping
Data Analysis

E-Commerce Shipping Data Analysis

Tags

Data cleaning
Pivot tables
Visualization
Interactive dashboard

Tools

Excel

Source

Get the raw data from Kaggle

Context

An international e-commerce based company wants to discover key insights from their customer database. They want to use some of the most advanced machine learning techniques to study their customers. The company sells electronic products.

Data Cleaning:

Since this dataset was pulled from Kaggle to much cleaning was not necessary. I trimmed few unnecessary data & added a few columns by using the existing data for analysis. I've also converted string data to boolean for the column called Reached on time. This column recorded if on time delivery was a success or not.

Exploratory Data Analysis (EDA):

EDA was performed in two separate tables to figure out various stuff regarding shipments, effects of shipments & modes used. Both the tables are reformed by selecting sets of columns from the cleaned table depending on the analysis. The tables created for EDA are:

1) Weight range & modes of shipment: This table mostly focuses on the relationship between weight and the modes selected for shipping that weight. This was done to see if any pattern was visible on that context and because of that reason weight ranges (in kgs) was added as a new column. This also came in handy while filtering data.


  • Here Cost of products are colorized in a descending fashion, the range is darker shades of blue (meaning larger values) to darker shades of red (meaning smaller values).
  • Weight in grams have bar charts showing visual difference also a range column is calculated from it.
  • Click on the dropdown icon to filter data live.

2) Modes of shipment related to customer satisfaction: This table mostly focuses on the relationship between the modes selected for shipping and customer satisfaction after getting shipment. This was done to see if any pattern was visible on that context.


  • Here Customer ratings are colorized in a descending fashion, the range is red < orange < yellow < light green < green.
  • Click on the dropdown icon to filter data live.

3) Customer satisfaction to repeating customers ratio: This table mostly focuses on the ratio between the number of returning customers and satisfaction level of that customer. This is also related to mode of transport which effects the customer satisfaction that determines the return rate.


  • Here Customer ratings are colorized in a descending fashion, the range is red < yellow < darker shade of yellow < light green < green.
  • Click on the dropdown icon to filter data live.

Modes:

There are 3 types of modes transporting the loads. A visual representation is shown based on the popularity of each individual mode.

It is clearly shown that mostly ships are used while transporting. Even the combined number of shipments by flights and roads are less than half while comparing with ship transports.


  • Because of this large number, the filtered (filtered customer rating - ascending to descending) version of second EDA table shows that even if ships are mostly used they generate the larger portion of low ratings.
  • For a better understanding visualization are represented as dashboards.

Dashboard:

1) This interactive Visualization/Dashboard explains the relationship between product weight & shipment modes.


Filtering is possible based on level of importance received from customers.