Bio-Inspired Approach to Extend Customer Churn Prediction for the Telecom Industry in Efficient Way

doi:10.21203/rs.3.rs-2314407/v1

Download PDF

Research Article

Bio-Inspired Approach to Extend Customer Churn Prediction for the Telecom Industry in Efficient Way

https://doi.org/10.21203/rs.3.rs-2314407/v1

This work is licensed under a CC BY 4.0 License

Journal Publication

published 08 Nov, 2023

Read the published version in Wireless Personal Communications →

You are reading this latest preprint version

Churn prevention has always been a top priority in business retention. The significant problem of customer churn was confronted by the telecommunications industry due to saturated markets, harsh competition, dynamic criteria, as well as the launch of new tempting offers. By formalizing the telecom industry's problem of churn prediction as a classification task, this work makes a contribution to the field. To effectively track customer churn, a churn prediction (CP) model is needed. Therefore, using the deep learning model known as the reformatted recurrent neural network in conjunction with the Elephant herding optimization (EHO) method, this work provides a novel framework to forecast customer turnover (R-RNN). EHO is a meta-heuristic optimization algorithm that draws inspiration from nature and is based on the herding behaviour of elephants. The distance between the elephants in each clan in relation to the location of a matriarch elephant is updated by EHO using a clan operator. For a wide range of benchmark issues and application domains, the EHO approach has been shown to be superior to several cutting-edge meta-heuristic methods. In order to classify the Churn Customer (CC) and a regular customer, RRNN is modified. This improved EHO effectively optimises the specific RNN parameters. If a client churns as a result, network usage is examined as a retention strategy. However, this paradigm does not take into account the number of consumers who leave based on how often they use their local networks. The results of the simulation and performance metrics-based comparison are assessed to show that the newly proposed technique can identify churn more successfully than pertinent techniques.

data selection

customer clustering

feature selection and extraction

enhanced EHO

R-RNN classifier

churn analysis

For telecommunications firms, client churn is a major issue because it lowers profits [1]. This is especially important given that telecommunications businesses compete in a crowded global market where it is getting harder to keep customers. Even though these businesses spend a lot of money on marketing to attract new customers, keeping an existing customer is typically less expensive [2]. Due to these factors, preventing customer turnover has elevated to a top priority for telecom firms.

Customer churn is the loss of a client in favour of a rival [3], signifying the breakdown of the partnership. Customer churn prediction enables one to pinpoint the reasons why a connection is ending and put up a plan that will reduce churn rate while boosting earnings. Therefore, for telecommunications firms, being able to predict a customer's desire to terminate a connection is essential and is regarded as a competitive advantage.

Customer attrition has been the subject of earlier investigations. For churn management, for instance, a clustering and classification system [1] is recommended. On the basis of ensemble and clustering classifiers, a new combination model is predicted [4]. The principles of data anonymization are used to predict client attrition [5]. However, no study has attempted to forecast telecom client attrition using discriminant analysis and logistic regression, despite the fact that numerous studies have attempted to explain and predict consumer churn.

This study seeks to use factor analysis to look into the business characteristics of telecom clients in order to fill this highlighted gap in the literature and to use customer segmentation data to construct a discriminant model and a logistic regression model to forecast telecom client churn. To understand the elements that influence and make it possible to predict telecom customer turnover, data are gathered and analysed using a data mining approach. By creatively demonstrating how logistic regression analysis may be used to create a telecom customer churn prediction model, our study advances the prior work [6].

Businesses now have a good opportunity to interact with current or potential clients thanks to the growth of social media technology [7]. In earlier years, there were various perceptions of the number of mobile phone users. The telecom market has reached saturation, particularly in large cities. Due to the already saturated industry, many mobile telecommunications firms confront incredibly difficult business circumstances. The mobile telecommunications market is becoming increasingly crowded since many users are trading their registered services among rival businesses [8]. As a result of the digital explosion, there is now fierce competition among telecom service providers to offer unwavering internet and voice coverage jointly in urban and rural locations.

At numerous levels, service providers are under intense pressure to meet client needs [9]. Enhancing QoS in communication networks has opened the door to investigating effective devices for data and voice transmission [10]. The business realised that customer retention should be the primary goal of its marketing initiatives rather than customer acquisition because keeping existing customers is both more profitable and less cost-effective than attracting new ones [11]. In practise, network providers will make far less money by retaining current users than by luring in new ones [12].

Therefore, network providers are working harder to build predictive algorithms in order to identify those customers who are most likely to churn or depart. Numerous Machine Learning (ML) techniques were created in order to address the churning prediction problem [13]. For the telecommunications business, Swish RNN-centered customer CP has been created with an innovative feature selection (FS) technique minimising the aforementioned problems created during client CP [14, 29]. The size and complexity of optimization issues are expanding quickly, which suggests that they are becoming harder to solve using conventional optimization procedures [15]. The difficulties associated with feature selection, massive data, and large-scale optimizations [16] have been successfully overcome by meta-heuristic algorithms. Elephant herding optimization (EHO) [17], a new optimization algorithm, is recommended. Our approach proposes modifying this optimizer to adjust the network settings. As a result, we suggest the research modules in the following areas:

To create a new model that can analyze and predict the client churn.
To provide a novel FS strategy for the reformed RNN (R-RNN) based client CP in the telecommunications sector.
To efficiently adjust the NN model's parameters utilising enhanced EHO.
To anticipate that the findings of this study will assist telecom managers in identifying the customer churn profile and developing customer retention strategies.

In numerous studies, academics have employed various data mining approaches to forecast customer turnover in the telecom industry. They looked into several methods and used several algorithms, such as neural networks, decision trees, logistic regression, naive bayes, etc. They also made some recommendations.

To follow consumers and their activity in order to prevent churn, a straightforward data mining-based methodology [18] has been developed. The model was tested and trained on a dataset of 500 examples with 23 variables using three different techniques: decision trees, support vector machines (SVM), and neural networks for classification, and the k-means algorithm for clustering. Results indicate that SVM is the most accurate strategy for predicting churn in the telecom industry.

New features [19] have been implemented, including features that are contract-related, describe call patterns and call pattern changes, and are based on traffic statistics and customer profile information. The results of evaluating the well-known features using Naive Bayes and a Bayesian network were compared to those obtained using a decision tree. Results showed that probabilistic classifiers outperformed decision trees in terms of true positive rate.

Regression analysis, naive Bayes, decision trees, and neural networks are some of the quality indicators of churn prediction models that have been described [20]. They have also drawn attention to the connections between client lifetime value and churn prediction. The authors claim that further prediction facsimiles must be created and that grouping the suggested approaches is another option.

Different machine learning algorithms [21] have been used to detect active and switchable clients, including decision trees, ANNs (Artificial Neural Networks), K-means clustering, and linear and logistic regression. The best outcomes were obtained using a modified version of common decision trees.

A model that includes a zone for implementation [22] where the largest likelihood of customer churn can be addressed for retention measures has been developed. The author has also suggested utilising a hybrid strategy or examining additional categorization approaches in order to achieve even better performance.

A decision tree test on the customer churn factor [23] was run utilising a precise training sample set. The authors claim that decision trees can understand rule information with ease. The neighbourhood is one of the elements that have been attempted to be implicitly blamed for customer attrition.

Due to its rule-based architecture, the decision tree has proven to be the most effective technique out of the three [24]. Regression and neural networks are claimed to require training and test data in order to do computations and identify churners and non-churners. However, for neural networks and regression, the accuracy is mostly determined by the coefficients for regression and the weights for neural networks.

The some methods typically Churn prediction uses decision tree-based approaches, neural network trees, and regression techniques [25]. In terms of accuracy, decision tree-based algorithms outperform all others. However, because their data sets are so large, neural networks perform better than other methods.

The findings in logistic regression analysis churn prediction with low accuracy have been compared with decision tree-based regression techniques [26], whereas the accuracy evaluated in the case of decision trees is better. Decision tree-based algorithms are therefore better at predicting client turnover in the telecom industry. Particle Swarm Optimization (PSO), a population-based optimization technique inspired by the social behaviour of schooling fish, is one example of a machine learning technique that has been developed. They resemble evolutionary algorithms such as the Genetic Algorithm. One of PSO's drawbacks is that it is susceptible to local optima and converges less quickly in complex issues and high dimensional space. Another variant method of the Wavelet Neural Network (WNN) known as the Local Linear Wavelet Neural Network (LLWNN) activates localised hidden layer units. The fundamental drawback of WNN is that it requires a large number of hidden layer units to solve problems with higher dimensions. One of LLWNN's drawbacks is that an optimization technique is required in order to optimise the error acquired to a global optimum.

The Elephant Herding Optimization (EHO) [18] algorithm is a population-based optimization method that improves the algorithm's capabilities for both local and global searching. They do not become stuck in local optima since they are not derivative. Another benefit is how simple it is to apply. The categorization process is facilitated and accelerated by feature selection. With the use of different parameters, including Correct Classification Rate (CCR), Root Mean Square Error (RMSE), Sensitivity, and Specificity, the algorithm's performance is assessed. Then, the performance of EHO-NN is contrasted with that of other classification issues like LLWNN and PSO with and without feature selection.

Predictive planning (CP) is the process of determining whether or not a client will switch their telecommunications network. Customer churn happens when customers are dissatisfied with any telecom company's services. Customers start moving their business to other service providers as a result of it. In some other techniques, the

3.1 Elephant herding optimization

EHO is a swarm-based search method used to resolve various optimization problems. The elephant herding behaviour serves as the inspiration for this programme. Elephants are divided into clans, each of which is headed by a matriarch. The male elephants that are adults depart from their family unit. Thus, these two elephant group behaviours result in two operators in the EHO, the clan updating operator and the separating operator [17]. To tackle various sorts of global optimization issues, there are specific guidelines that must be followed.

Clan updating process

There are various groups of elephants in the population, and there are only a certain number of each group's members. Every group of elephants, also referred to as a clan, maintains its unity under the leadership of the matriarch. The eldest member of any group is typically the matriarch, which is also regarded as the elephant in this clan who is best suited to solve the optimization problem.

Clan separating process

As they become mature enough from the leading elephant groups at each generation, male elephants in each elephant group will leave their family group and remain alone. Here, preference will be given to simplifying the elephant's grazing behaviour into the following idealised rules in order to make it solve all types of global optimization problems:

The elephant population is divided into clans, and each clan contains a set number of elephants.
Each generation, a set number of male elephants will depart from their family group and
They live alone at a location apart from the main elephant group.

3.2 Recurrent neural networks (RNN)

The analysis of time series for prediction from a machine learning perspective has long been a significant research area. Numerous studies in the subject highlight the use of neural network techniques. With the introduction of recurrent neural networks (RNN), which have significantly improved the efficiency of produced models in this field, those techniques have become more and more popular. For prediction tasks involving memory, their capacity to recognise the temporal links within data is a significant advantage.

RNN-based new and reliable solutions to time series-based prediction challenges have been offered by numerous studies [29, 30]. This emphasises the value of applying such techniques to commercial challenges where the activity history of customers may be seen as a time series. The churn prediction problem is accurately formalised in this study as a particular case of a classification task. Then, in order to identify the churn, we provide an algorithmic solution to this problem. This approach is based on training an RNN-based model, whose objective is to forecast the likelihood of churn over a specified period of time in the future.

It is necessary to assume the churned customer as early as possible and meet their expectations in order to avoid it. Thus, a newly developed RNN-based customer CP with an improved EH optimization plan module is suggested for the telecommunications sector. This paper's major goal is to foretell if a consumer will leave or not. Extending packages or providing additional incentives or services to consumers who are inclined to migrate from the present service provider to another is an essential measure to stop customers from doing so. Because, the businesses that may produce the most revenue are the ones who provide these services. In this paper, we use client churn classification model with optimization algorithm to find out the elevated customers as depicted in below Fig. 1.

The data are collected as input for the telecom CP dataset. The initial pre-processing is then applied to the volume of data that has been gathered. The consumers are then screened based on their respective states and regions. The clustering algorithm then combines comparable clients centred on the state and the area together. The clustered data are then once more put through pre-processing, where they are numeralized and normalised. Consequently, complexity can also be avoided. The most important and necessary features are then extracted from the pre-processed data. The improved model of the EH optimization technique and the most useful tuning parameters were derived from the appropriately chosen characteristics. The classifier receives these chosen features as input. R-RNN effectively predicts whether or not a consumer will leave. If the client is predicted to be a CC, the customer's network usage history is reviewed. The equivalent threshold amount is fixed and centred on how much the client uses the network.

If the client uses the network extensively, the retention process is carried out to keep them in the same network. The consumer who uses the network sparingly won't be paid attention to.

4.1 Data collection Process

The data are primarily collected from the telecoms CP dataset. This dataset contains information about the demographics of the customers, their network usage history, their accounts, and other things.

4.2 Initial preprocessing Process

The preliminary pre-processing stage is carried out after data gathering. The obtained data are preprocessed by removing the duplicate customer entries from the dataset, and then they are transformed into a usable format.

4.3 Filtering and grouping Process

The data are filtered once from the prior result. The distinctive characteristics of the clients can be determined from this. By doing this, more analysis may be completed with less time and price.

Information about consumers from different states and countries is available in the telecom customer records, which can be used to identify the unique property. Analyzing global consumer records is a genuinely complex endeavour. Therefore, the customer records of the individual states and areas are aggregated and arranged into a cluster in order to alleviate these obligations (Cl). By using Euclidean distance computations, this is connected to its nearest medoid.

Cl ^∗ = {Cl₁, Cl₂, Cl₃,. . . ..Cl_n} (1)

4.4 Data Feature extraction Process

Two processes, such as numeralization and normalisation, are carried out by the function to find the feature extraction task. The pre-processing function's mathematical representation is produced by,

p _re = Q_ρ[Cl_n] (2)

Where, ρ_re represents the results of the preprocessing function, Cl_n implies the clustered data input and Q_ρ signifies the preprocessing functions, i.e. symbolized by,

Q _ρ = [Q_Nu, Q_No] (3)

Where Q_Nu infers the numeralisation and Q_No infers the normalization mode.

The preprocessed data's string values or characters are transformed into a numerical representation known as Numeralisation. On the other hand, numerical data are created from the clustered data. It is developed the numeralization function as,

Ń ^Nu = Q_Nu[Cl_n] (4)

Where Ń^Nu describes the results of numeralization function.

The normalisation technique improves the model's performance and training stability. The normalisation of the data uses log scaling. It calculates the log of the numbers and condenses a wide range into a small range. The log scale normalization is stated by,

Ň^No = log(Cl_n) (5)

Where Cl_n signifies the original value and Ň^No implies the normalised value.

The Feature Extraction (FE) aims to lessen the total features so that the total resources required to process such huge data can well be lessened. The mathematical expression for the FE(p_re) is rendered by,

FE(p _re ) = {F_Ip, F_Td, F_Nv, F_Vm, F_Tdc, F_Te, F_Teh, F_Tdh, F_Tec } (6)

Where International plan F_Ip, total minutes in day F_Td, number messages in vmail F_Nv, plan of voice mail F_Vm, total calls in day F_Tdc, total eve minutes F_Te, total day charge F_Tdh, total eve calls F_Tec, total eve charge F_Teh are the essential and appropriate characteristics.

4.5 Feature selection Process

It is possible to increase the model's accuracy, computing speed, and memory. The FS method thus considerably improves the CP's performance. The improved EHO is created for better FS. Additionally, it is utilised to fine-tune the neural network model's predetermined parameters.

4.6 Enhanced EHO Algorithm

1: Start off the population.

2: Reinforce or reiterate

3: Sort the elephants according to their fitness value, which was calculated in step three using the fitness function.

4: Apply the clan-updating process in step four. The matriarch, the elephant in clan i with the greatest fitness score, directs the movement of every elephant in the clan. The updated position is found to be

$${E}_{new,{c}_{i,j}}={E}_{{c}_{i,j}}+\vartheta *\left({E}_{best,{c}_{i}}- {E}_{{c}_{i,j}}\right)*u$$

Where ${E}_{new,{c}_{i,j}}$ denote new position of elephant j in clan i, ${E}_{{c}_{i,j }}$ symbolize the former status of Elephant J in the clan i, ${E}_{best,{c}_{i}}$ represent the position that fits best (matriarch), $\vartheta$ 𝜖 [0,1] is parameter, u 𝜖 [0,1] are the algorithm's random number is used in its latter phases to increase population variety. Position update for the clan's ideal candidate${E}_{best,{c}_{i}}$is estimated as:

$${E}_{new,{c}_{i}}={\mu *E}_{{center, c}_{i}}$$

$${E}_{{center, c}_{i}}= \frac{1}{{n}_{{c}_{i}}} {\sum }_{j=1}^{{n}_{{c}_{i}}}{E}_{{c}_{i,j,d }}$$

𝛍 𝜀 [0,1] is the algorithm's second parameter, and ${E}_{{center, c}_{i}}$, represents the clan's geographic centre; the number of elephants in the clan is c_i, ${n}_{{c}_{i}}$.

5: Operate a clan separation. According to the following equation, a certain number of the elephants in each clan i with the worst value are relocated to the new position,

$${E}_{worst,{c}_{i}}={E}_{min}+\left({E}_{max}- {E}_{min}+1\right)*rand$$

Where rand ε [0,1], ${E}_{min}$denotes the search space lower bound, ${E}_{max}$denotes the search space upper bound.

6: Assess the population using the most recent position.

7: Until the stop criterion, that is.

8: Provide the population's top response.

The fittest elephant in the clan can be described as:

E _new,ci,t = γ ∗ E_center,ci (11)

E _new,ci,t is obtained from the information of all the elephants present in clan p_k.

γ ∈ [0, 1] = determines, how much E_new,ci,t is affected by E_center,ci.

E _center,ci = centre of clan c_i and can be obtained by the given equation for d^th dimension. Here ‘d’ shows the d^th dimension i.e. 1 ≤ d ≤ D, where D shows the total dimension.

FF(E) = (𝜎 *(F _Td + F_Tdc+F_Tdh) + 𝛃 *( F_Ip+F_Nv + F_Vm) + 𝛛 *( F_Te+F_Teh+F_Tec) ) / (𝜎 + 𝛃 + 𝛛) … (12)

Where (, 𝛃, 𝛛) ∈ [0, 1].

4.7 Reformed RNN (R-RNN)

In essence, an RNN is a type of neural network. The output from the phase before is being inputted into this step. The primary and most important characteristic of RNNs is their hidden state, which retains some information about a sequence. In conventional neural networks, each Hidden Layer (HL) includes its own set of weights and biases.

Higher exploitation ability is therefore achieved because to the updating method of enhanced elephant hearing optimization. The BF movement comes to a halt if the termination standard is met. The number of achieved iterations serves as the stopping condition. The algorithm produces the optimal result based on the fitness levels. The chosen features from EHO module S(FF) are therefore mathematically demonstrated as,

S(FF) = {α₁, α₂, α₃,…, α_t} (13)

This section seeks to define the problem statement's constituent parts, first with relation to the churn indicator and then with regard to the loss function. Churn, as used in business, is the regular loss of clients who cease all activity for a sufficient amount of time. Depending on the industry, this time frame can be picked at random. The procedures are the same as for RNN [29]. We use the model of the aforementioned collection of well-chosen features as an input of trainable parameters in place of the number of players as a feature set.

The R-RNN is employed on the entered data I_p = {α₁, α₂, α₃…,α_t} that encompasses of a hidden vector sequence ĥ_hid = {ĥ₁, ĥ₂, ĥ₃,. .. .. ., ĥ_t} and the output vector sequence o_op = {o₁, o₂, o₃,…, o_n}by way of iterating the sequence (as t = 1 to T) in subsequent manner. The HL can well be gauged as,

ĥ_hid = ∂_act[W_αĥα_t + W_ĥĥ ĥ_t−1+B_a] (14)

Where, if W_i terms signifies the weight matrices, then W_αĥ is the weight matrix of input in hidden layer), the Ba terms imply bias vectors and∂_act indicates the hidden layer AF, which is calculated using the stated swish function by,

f(X) = X ∗ Sigmoid(λX) (15)

The model's trainable parameter, λ is stated in this sentence.

The output layer is then used to render the outcome (OL). The sigmoid AF turned on the OL. Additionally, it is possible to determine the OL by,

o _t = σ_S[W_αoĥ_t + B_a] (16)

α = W_αoĥ_t + B_a (17)

Next, the sigmoid AF is gauged utilising the following relation,

σ _S (α) = 1 / (1 + ε^−α ) (18)

To calculate the loss value, the difference between the actual value (α) and the expected value (ά) is computed. The error value is easily measurable as, Err = (α − ά)².

If Err = 0, the model provides the precise answer. Back-propagation is carried out by changing the weight values if the error value Err ≠ 0. Finally, the classification technique effectively predicts the CC without any misclassification errors.

4.8 Churn prediction

According to the R-RNN classifier, the "2" form of the final outcome is CC and non-CC.

Non-churn customer: Customer who is willing to stick with the same telecommunications network is known as a non-churn customer.
Churn customer: Customer who is willing to stick to another telecommunications network is known as a churn.
4.9 Retention process

If the result is a CC, the specific customer's network usage history is checked. The equivalent threshold values are set with their network utilisation as the focal point. If the customer's network usage remains high, or the threshold value is higher, the customer retention process is carried out. Customer retention refers to the method of keeping current customers and existing customers on the same network by making a few alluring offers and forbidding them from switching to any other telecommunication networks. In contrast, if a consumer only uses a small portion of the network, or if their network utilisation is below the threshold, they are ignored.

5.1 Dataset used

The various stringent laws that are in place can make it difficult to access telecommunications data. The proposed study made use of the CCP, 2020 dataset. It has 4250 training samples, 19 features, and a single "churn" Boolean variable that indicates the class of the sample [29]. The dataset includes the customer's state, their target variable, their international plan, their voice plan, how long they have been with their current telecommunications provider, and other information. These traits are employed in the proposed work to distinguish between churners and non-churners.

5.2 Simulation Performance analysis

For the purpose of demonstrating the effectiveness of the work, performance analysis and comparative analysis are undertaken. For the purpose of carrying out the suggested work, MATLAB is used. The Customer Churn Prediction, 2020 dataset is where the data came from. The proposed module performance can be determined from the categorization results in order to state the efficiency.

Additionally, it may help to achieve good accuracy without requiring further iteration. Thus, the suggested optimisation technique effectively selects the valuable properties. Thus, the ategorization process's computing complexity may be reduced. F-Measure and performance analysis of the R-RNN using several prevailing methodologies, such as RNN, DNN, CNN, and ANN, are validated to determine the effectiveness of the R-RNN with regard to various performance metrics, such as sensitivity, specificity, precision, recall, and accuracy.

The suggested R-performance RNN's was evaluated in terms of sensitivity, specificity, and accuracy using several prevalent approaches. The results are shown in Table 1 and shown in Fig. 2. The suggested R-RNN achieves high accuracy rates, sensitivity, and specificity. The suggested R-RNN thus achieves superior performance metrics rates. The proposed work then makes a more accurate prediction of the CC.

Table 2

Performance analysis of precision, recall and F-Measure
Methods	Performance metrics (%)
Methods	Precision	Recall	F-Measure
Proposed R-RNN	95.8	98.97	97.48
RNN	91.59	94.26	92.95
DNN	88.6	85.51	87.01
CNN	88.65	82.06	84.7

Figure 3 shows a comparison analysis of the precision, recall, and F-Measure values attained by the R-RNN and other widely used works. If the design is considered to be more robust and effective, the metrics value of the model should remain as high as feasible. However, when compared to the suggested work, the metrics values achieved by the RNN, DNN, CNN, and ANN are quite low, as shown in Table 2. As a result, the proposed approach produces favourable results in a CP system.

Table 2

Performance analysis of sensitivity, specificity and accuracy
Methods	Performance metrics (%)
Methods	Precision	Recall	F-Measure
Techniques	Sensitivity	Specificity	Accuracy
Proposed R-RNN	98.35	92.37	96.09
RNN	94.17	86.11	91.01
DNN	85.51	82.13	84.29

Therefore, the Recurrent NN with an improved EHO optimizer is an error-prone model as well as delivers the exact outcomes devoid of any mis-prediction.

A novel optimization technique is suggested for the telecom sector's revised RNN-based customer CP. The steps of the suggested approach are data collection, preliminary preprocessing, state and area filtering, customer grouping with state and area, FE, CP, FS, classification, and retention procedure. The experimentation analysis is used after that. Performance analysis is conducted along with a comparison of the suggested and existing methodologies regarding some performance indicators in order to verify the effectiveness of the proposed algorithm. The created approach effectively manages much uncertainty and predicts whether the consumer will leave or not.

For the investigation, the CP dataset—a publicly accessible dataset—is used. The proposed R-RNN achieves the highest metrics rate. The suggested algorithm results in an effective optimization. The proposed R-RNN also chooses the information-rich features with the fewest iterations. As a result, the suggested method locates the CC as soon as possible. The current best practises outperform the suggested strategy. It continued to be increasingly dependable and sturdy. In the future, the study can be expanded to investigate the shifting CC behavioural patterns by utilising cutting-edge prediction and trend analysis approaches.

Funding statement

No fundings received from any organization

Declaration of competing interest

The authors declare that they do not have any conflict of interest with organisations.

Data availability statement

My manuscript has no associated data.

Author contributions

Conceptualization, Methodology, Software, Visualization, data curation, Writing-original draft

Validation, resources, Investigation, Writing-review and editing, supervision, project administration

Peji´c Bach, M., Pivar, J., & Jakovi´c, B. (2021). Churn Management in Telecommunications: Hybrid Approach Using Cluster Analysis and Decision Trees. J Risk Financ Manag, 14, 544.
Kim, S., Chang, Y., Wong, S. F., & Park, M. C. (2020). Customer resistance to churn in a mature mobile telecommunications market. Int J Mob Commun, 18, 41–66.
Xie, Y., Li, X., Ngai, E. W. T., & Ying, W. (2009). Customer churn prediction using improved balanced random forests. Expert Systems With Applications, 36, 5445–5449.
Fathian, M., Hoseinpoor, Y., & Minaei-Bidgoli, B. (2016). Offering a hybrid approach of data mining to predict the customer churn based on bagging and boosting methods. Kybernetes, 45, 732–743.
Holtrop, N., Wieringa, J. E., Gijsenberg, M. J., & Verhoef, P. C. (2017). No future without the past? Predicting churn in the face of customer privacy. International Journal Of Research In Marketing, 34, 154–172.
Zhang, T.. Telecom customer segmentation and precise package design by using data mining (Dissertação de mestrado, Iscte-Instituto Universitário de Lisboa). Repositório do Iscte 2018. Available online: https://repositorio.iscte-iul.pt/handle/10071/17567 (accessed on 13 February 2022).
Dai, Y., & Wang, T. (2021). Prediction of customer engagement behaviour response to marketing posts based on machine learning. Connection Science, 33(4), 891–910. https://doi.org/10.1080/09540091. 2021.1912710.
Alboukaey, N., Joukhadar, A., & Ghneim, N. (2020). Dynamic behavior based churn prediction in mobile telecom. Expert Systems with Applications, 162, 1–17. https://doi.org/10.1016/j.eswa.2020.113779.
Sridhar, A., Sharvani, G. S., Reddy, M., A. H., & Nagaraj, K. (2020). Envisaging prominence of Indian telecom operators using an ensemble link based approach. Indian Journal of Computer Science and Engineering (IJCSE), 11(3), 297–310. https://doi.org/10.21817/indjcse/2020/v11i3/201103359.
Meeravali, S., Bhattacharyya, D., Rao, N. T., & Hu, Y. C. (2021). Performance analysis of an improved forked communication network model. Connection Science, 33(3), 645–673. https://doi.org/10.1080/09540091.2020.1867064.
Vafeiadis, T., Diamantaras, K. I., Sarigiannidis, G., & Chatzisavvas, K. C. (2015). Comparisons of machine learning techniques for customer churn prediction. Simulation Modelling Practice and Theory, 55, 1–9. https://doi.org/10.1016/j.simpat.2015.03.003.
Lu, N., Lin, H., Lu, J., & Zhang, G. (2014). A customer churn prediction model in telecomindustry using boosting. IEEE Transactions on Industrial Informatics, 10(2), 1659–1665. https://doi.org/10.1109/TII.2012.2224355.
Amin, A., Al-Obeidat, F., Shah, B., Adnan, A., Loo, J., & Anwar, S. (2017). Customer churn prediction in telecommunication industry using data certainty. Journal of Business Research, 97, 290–301. https://doi.org/10.1016/j.jbusres.2018.03.003.
Maldonado, S., Lopez, J., & Vairetti, C. (2020). Profit-based churn prediction based on minimax probability machines. European Journal of Operational Research, 284(1), 273–284. https://doi.org/10.1016/j.ejor.2019.12.007.
Santucci, V., Baioletti, M., & Milani, A. (2020). An algebraic framework for swarm and evolutionary algorithms in combinatorial optimization. Swarm And Evolutionary Computation, 55, 100673.
Yi, J. H., Deb, S., Dong, J., Alavi, A. H., & Wang, G. G. (2018). An improved NSGA-III algorithm with adaptive mutation operator for big data optimization problems. Future Gener Comput Syst, 88, 571–585.
Wang, G. G., Deb, S., & Coelho, L. S. (2015). Elephant Herding Optimization. In Proceedings of the 2015 3rd International Symposium on Computational and Business Intelligence (ISCBI 2015), Bali, Indonesia, 7–9 December 2015; IEEE: Bali, Indonesia, ; pp. 1–5.
Essam Shaaban, Y., Helmy, A., Khedr, M., & Nasr (2012). "A proposed model of prediction of abandonment", International Journal of Engineering and Applications Research (IJERA) ISSN: 2248–9622 Vol. 2, Issue 4, June-July
Kirui, C., Hong, L., Cheruiyot, W., & Kirui, H. (2013). Predicting customer churn in mobile telephony industry using probabilistic classifiers in data mining. International Journal of Computer Science Issues (IJCSI), 10(2 Part 1), 165.
Lazarov, V., & Capota, M. (2007). Churn prediction. Bus Anal Course TUM Comput Sci, 33, 34.
Qureshi, S. A., Rehman, A. S., & Qamar, A. M. (2013). Aatif Kamal, proposed a model of forecast of the abandonment of telecommunications subscribers using machine learning,IEEE,
Lu, N., Lin, H., Lu, J., & Zhang, G. (2012). A customer churn prediction model in telecom industry using boosting. IEEE Transactions on Industrial Informatics, 10(2), 1659–1665.
binti Oseman, K., Haris, N. A., & bin Abu Bakar, F. (2010). Data mining in churn analysis model for telecommunication industry. Journal of Statistical Modeling and Analytics Vol, 1, 19–27.
Hadden, J., Tiwari, A., Roy, R., & Ruta, D. (2008). “Churn prediction: Does technology matter?. ” International Science Index, Engineering and Technology.
Almana, A. M., Aksoy, M. S., & Alzahrani, R. (2014). A survey on data mining techniques in customer churn analysis for telecom industry. International Journal of Engineering Research and Applications, 4(5), 165–171.
Gürsoy, U. (2010). Customer churn analysis in telecommunication sector. İstanbul Üniversitesi İşletme Fakültesi Dergisi, 39(1), 35–49.
R, R. C., Jeyanthi, V. G. P., Revathy, S., Gladance, L. M., & Mary, A. V. A. (2022). "Prediction of Electricity Bill using Supervised Machine Learning Technique," 2022 6th International Conference on Trends in Electronics and Informatics (ICOEI), pp. 1232–1236, doi: 10.1109/ICOEI53556.2022.9777232.
Hiremath, A. R. (2016).A Review on Swarm Intelligence System.
Saraswat, S., & Tiwari, A. (2018). A new approach for customer churn prediction in telecom industry. International Journal of Computer Applications, 181(11), 40–46.
Subramaniam, S., Thangavelu, A., & Ramasubbian, H. (2013). "Fact-AnAdaptive method to predict client abandonment rate using the widespread multi-criterion classification method for decision-making". Asian Journal of Science and Technology, 4(11), 227–233.

AuthorBiography.docx

Download PDF

Journal Publication

published 08 Nov, 2023

Read the published version in Wireless Personal Communications →

Editorial decision: Accept
31 Jul, 2023
Reviewers agreed at journal
30 Nov, 2022
Reviewers invited by journal
30 Nov, 2022
Editor assigned by journal
28 Nov, 2022
First submitted to journal
27 Nov, 2022

You are reading this latest preprint version

Bio-Inspired Approach to Extend Customer Churn Prediction for the Telecom Industry in Efficient Way

Status:

Journal Publication

Version 1

Abstract

Figures

I. Introduction

Ii. Related Work

Iii. Methodology

Iv. Proposed Work

V. Results And Discussions

Vi. Conclusion

Declarations

References

Supplementary Files

Status:

Journal Publication

Version 1