Skip to content

chirathlv/Project-Winelist

Repository files navigation

Winelist Corporation

Table of Contents

  1. Introduction- Project Win(e)list
  2. Data, Technology and Coding Standards
    1. Data Sources
    2. Technology Stack
    3. Coding and Release Standards
  3. Data Cleanse and Visualisation
    1. Data Cleanse
    2. Analysis- Customer Segmentation
    3. Analysis- Channel Effectiveness
    4. Analysis- Campaign Effectivenes
    5. Key Findings
  4. Recommendations
  5. References

Introduction- Project Win(e)list

Winelist Corporation, a subsidiary of a boutique grocery chain based in US, has requested an analysis of its wine customers and sales. This analysis will serve as input into their wine sales maximisation and marketing spend strategy. Data visualisation tools have been used to present the findings and recommendations.

Data, Technology and Coding Standards

Data Sources

Winelist Corporation has provided the following data: Winelist Corporation

Additionally, the American Census data has been sourced using API American Census Data

Technology Stack

The following technologies have been deployed:

Coding and Release Standards

Following rules have been applied during code development and testing:

  1. All variables must reflect their purpose. Underscore to be used as and when required.
  2. Each step of the code must contain comments to explain the purpose of the code.
  3. A git hub repository called Project1 must be set up with branches for each developer.
  4. Each developer must use their own git hub branch to code and unit test developed code.
  5. Lead developer must review code prior to merge.
  6. Lead developer is responsible for merging all code.
  7. Each developer must download the most recent code from main branch before commencing code changes.
  8. Each release must provide a brief message on changes made prior to committing the code.

Data Cleanse and Visualisation

Data Cleanse

The following data cleanse rules have been applied to the source marketing data set:

  1. Identified data set must be formatted correctly prior to use- Date format yyyy/mm/dd, amount must be integer.
  2. Calculate age from birth year.
  3. Redundant data, if any, must be dropped- drop columns Z_CostContact', 'Z_Revenue', 'Response', 'MntGoldProds'.
  4. ‘Absurd’ and 'YOLO' marital status must be changed to the most frequent value for consistency.
  5. Rename 2n Cycle to 'Undergraduate'.
  6. Duplicates, if any must be dropped.
  7. Ensure all columns have appropriate and correct headers.
  8. Group records into appropriate sub categories- Age in steps of 10 years, Income in steps of $10k.
  9. Ensure records are sorted on the required fields.
  10. Validate that entire data set has been correctly loaded.

Additionally, the following data transformation rules have been applied to the American census data downloaded from the American Govt's Census data site.

  1. Collect only data for the target customer segment and their location.
  2. Identify the state and county correctly.
  3. Rename headers with correct labels.
  4. Convert numeric values from strings to numbers.

Analysis- Customer Segmentation

SCP1- Which customer segment provides the maximum sales? Data set analysed for age, education, marital status and income values by grouping the data set on the required category. Analysis:

  • Wine sales are significantly higher amongst the educated, particularly graduates.
  • Wine sales to "married" and "together" customers are the largest segment of wine customers.
  • Customers who buy wines are generally in the income range of $60-$80k.
  • It also seems that Winelist does not have customers in the higher income range of $100k and above.

Below is the visual representation of the analysis: Total Wine Sales per Income Bracket Wine Customers based on Age Groups Wine Sales based on Marital Status Total Sales based on Education per Marital Status Wine consumption based on Marital Status and Age Group Wine Sales based on Education and Income Bracket

SCP2- Which product in the food basket best aligns with wine consumption? This section analyses the customer Food Basket to understand the products associated with wine purchases.

Analysis: Based on below, customer who buys Wine generally purchases meat. Thsi association maybe used to develop product tie in campaigns.

Food Basket Comparison

Analysis- Channel Effectiveness

SCP3- Which channel provides the maximum sales? This section analyses stores sales versus web site sales based on the proportion of purchase transactions- at store and at website.

Analysis:

  • It can be said that instore sales are higher than sales through company website. Note that the data set is pre Covid 19.

Website versus Instore sales

Analysis- Campaign Effectivenes

SCP4- Do deals lead to an increase in sales? Which promotion campaigns are successful? To determine which campaign was the most effective, the successful acceptance of the campaign offer has been mapped. In the absence of specific channel sales data, corresponding customer sales values are used to overlay a sales trend line. Additionally catalog versus deals sales are analysed based on number of successful transactions.

Analysis:

  • Campaign 5 was the most effective in successfully concluding with a sale on either channel.
  • Catalogs seem to be more effective in bringing in customers to stores

Effectiveness- Marketing campaigns

Comparison- Deals versus Catalogs

Key Findings

  1. Older customers, particularly in the age group of 40-50 years followed by 50-60 years are the largest customer segment.
  2. There is direct correlation between education and wine consumption- people who have completed basic education are not wine consumers.
  3. Couples, either married or together, are the largest customer segment.
  4. There is a direct correlation between income and education which also leads to the conclusion that consumers who have higher incomes are wine consumers.Though in the data set provided, there are no customers with incomes of above $100k. Potentially a missing market segment.
  5. Store sales via catalogs are the highest. Note that the data set is pre covid 19. Since catalogs do better, either deals are not capturing interest or the cxustomer is price insensitive.
  6. Campaign 5 was the most effective as it had the highest acceptance. It appears that campaign 3 onwards, the sales trend is in the correct direction i.e. upwards.

Recommendations

Based on the latest US census data and the identified target customer segment of couples who are educated, it is recommended that Winelist Corporation direct their marketing dollars to the target counties depicted to maximise their sales.

Since campaign 5 appears to be the most effective, the company should use this to increase sales by directing some of gthe deals marketing dollars to this campaign.

SCP5- Based on customer and sales analysis overlaid with Census data, identify target markets

Top Wine States

Top Wine Counties

References

https://www.kaggle.com/

https://api.census.gov/data/

About

Winelist Corporation, a subsidiary of a boutique grocery chain based in US, has requested an analysis of its wine customers and sales. This analysis will serve as input into their wine sales maximisation and marketing spend strategy. Data visualisation tools have been used to present the findings and recommendations.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors