Customer retention is top priority for many companies, since the cost of acquiring new customers are several times more expensive than retaining existing ones. Therefore, Customer Churn Prediction is one of the most common applications in business. In fact, telecommunications and finance businesses were some of the earliest and widest adopters of customer retention applications.
In business, churn can be characterized as either subscription or non-subscription:
For subscription-based business:
For non-subscription-based business:
In this exercise, we develop machine learning models for a subscription-based businesses using Neural Network, Random Forest and 10 other algorithms. Our technology toolbox for this exercise are Tensorflow, Keras, and scikit-learn.
Monetize’s Machine Learning Accelerator Framework <Link to Blog: Machine Learning Accelerator Framework> shortens the life cycle of your data science project, through our learned best practices and automation of machine learning. The five major steps of this framework are:
The first and most important step is to clearly understand your business use case and goals.
A well-constructed model can inform a wide range of decisions (not just make a churn prediction) and and that information can flow into numerous internal tools and applications for further action.
By deploying a predictive machine learning model over large set of data points, we can gain new insights into user behavior, fuel new engagement strategies and impact customer retention.
A high quality Customer Churn Prediction Model will be able to provide these capabilities:
Use Case: Your Marketing department has designed a special retention offer to avoid customer attrition. How do we decide which customers should be offered the special retention deal prior to the expiration of their contracts?
In our use case, these will be Marketing Department, IT, Statisticians, Data Scientists. ***
A common situation in Customer Churn is a class imbalance in dataset. For example, 80% of the data are non-churning customers and 20% of the data are churning customers.
In Customer Churn situation, False Negatives are worse than False Positives. False Negatives are will-be-churned customers who will not be included in the marketing promotion. False Positives are customers who will not churned but receive the marketing promotion. A model with lower number of False Negatives is usually better.
For simplicity, we will use Accuracy Score as a metric for our dataset in this blog.
One approach to data collection is to take the data on prior churn and extract patterns, for example patterns of behavior that are useful and can help us to predict those customers who are more likely to leave in the future. Each customer might be described by a large number of attributes, such as age, usage, customer service history and many other factors.
Label Encoding (aka Integer Encoding)
Categorical data are variables that contain label values e.g. State, Country, size) rather than numeric values. Many machine learning algorithms such as neural networks are not able to support categorical values and require all input variables and output variables to be numeric.
In our dataset, we have categorical columns such as State, Subscribing to Internal Plan (Yes/No values), and Subscribing to Voicemail Plan (Yes/No values). These data are encoded to integers.
One-Hot Encoding
In addition, for categorical variables where no ordinal relationship exists, such as State, One-Hot Encoding is applied. Otherwise, it may result in poor performance or unexpected results in your prediction models because of a natural ordering between these integers.
Feature Scaling (aka Data Normalization)
We also normalize the data to values between [-1,1].
Machine Learning Accelerator Framework automatically builds models using various machine learning algorithms using the following algorithms:
The best results are obtained using tree-based algorithms and neural networks. Here they are:
In this blog, we covered different types of churn and illustrates a typical workflow to build your own customer churn prediction model. We also illustrate how Machine Learning Accelerator Framework enable your organization to utilize the industry’s best practices to build and evaluate machine learning models. Summary results of different models are illustrated at the end.
If you are interested in bringing your organization to ****** please email *******