Financial sectors, particularly banking has undergone a steady transformation over the decades. New technology, change in demands and regulation have caused the bank to face challenge in securing dominant positions in society. Many banks today are with good customer size with a hope that they will not be tapped with their competitors. But existing intensified competition can pose threat in customer turnover. Customer turnover is also referred as customer churn which means that customer disconnects the association with company over a period of time. Retaining the existing customers and increasing their lifetime gains primary importance in banks. The cost of attracting new customers can be expensive than holding the existing customers. Long term customers provide higher profits and create new referrals. Availability of latest technology, customer friendly bank officials, location and services may be the reasons for customer switching. With the advent of internet banking, clear customer feedback is lacking and hence predicting customer churn at the right time becomes complex. Customer relationship can be endowed with early and accurate churn predictions. This makes the bank officials to be proactive in engaging the customers at right time and prevent the exit.
The practical issue is how to find signs for churn. Collecting feedback on customer experience may be time consuming task. Moreover surveys are infrequent and the response of customers to feedbacks cannot be guaranteed. Extracting early warnings from already existing data may provide a solution for this problem. Short prediction prospect in accuracy but it would be late if the customer has decided to quit. Generally churn can be defined at the product level like discontinuing particular product or at the relationship level like disengaging the bank itself. At relationship level customer’s view can be understood clearly and engage them with complementary products that can strengthen the relationship. There is a growing demand in studying the set of characteristics to analyse and predict customer churn. Machine learning and data analysis algorithms have the capability to learn from past customer data and can generate triggers on churn data. They try to capture all aspects of customer relationship with bank. Apart from churn prediction, they can help in recommendations of new or allied products and perform customer life time calculation. Prediction models built with machine learning techniques can identify customers who are likely to churn. The model analyses the historical data of past churners and finds similarity with existing customer data. If they match, current customers are labelled as potential churners.
Setting up a churn prediction model involves defining a churner, prediction attributes of the model and techniques used to build the model. Customer who closes the account or inactive for long time or decrease the number of transactions over a period of time are considered as churner. Normally churn is expressed as a degree of customer inactivity or disengagement, observed over a given time. This expresses within the account in various forms such as the frequency of account actions or change in the account balance. It is also useful to define churn based on the rate of decline of assets over a specified period. Four types of prediction attributes are commonly used in churn analysis: demographic, perception, behavioural and environment. Demographic variables include age, profession, gender, family and geographical data. Perception attributes reveal the customer appreciation to services. They include quality of service, locational convenience and pricing. Behavioural attributes exhibits how often services are used and which services are availed most. Environment variables refer to changes in world that could affect the customer.
Machine learning algorithms work on features and feature engineering is the process of creating features using domain knowledge of the data. Features play a pivotal role in capturing customer behaviour. Basic indicators like net balance outflow in the last few months, to more nuanced indicators like rate of change of average gap between bill payments made, can prove effective in providing early warning signals of impending churn.
Churn prediction belongs to classification problem category. Various techniques from simple logistic regression to complex tree-based techniques like XG Boost exist and it is important to identify the technique that provides the right balance of interpretability and performance. Complex algorithms such as random forest and XGBoost capture non-linear patterns in data and can handle null values comfortably. Logistic regression provides a more apparent and innate explanation of the impact of each variable on the predicted outcome.
Adoption of prediction models starts with demonstrating the model’s predictive power on the past data and running several simulations to measure the efficacy of the model and associated scheme. Testing involves selecting customers likely to churn and observe the model based predictions.
Banks acknowledge that customer churn is a critical problem, but there is no systematic and proactive methods to address it. Even after building robust churn prediction models, the challenge prevails in creating enablers in every phase. It involves setting up the churn likelihood scores periodically to offer right retention strategies.