<?xml version="1.0" encoding="UTF-8"?>
<TEI xml:space="preserve" xmlns="http://www.tei-c.org/ns/1.0" 
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" 
xsi:schemaLocation="http://www.tei-c.org/ns/1.0 https://raw.githubusercontent.com/kermitt2/grobid/master/grobid-home/schemas/xsd/Grobid.xsd"
 xmlns:xlink="http://www.w3.org/1999/xlink">
	<teiHeader xml:lang="en">
		<fileDesc>
			<titleStmt>
				<title level="a" type="main">Applying Machine Learning Methods to Forecasting Customer Churn for a Telecommunications Company *</title>
			</titleStmt>
			<publicationStmt>
				<publisher/>
				<availability status="unknown"><licence/></availability>
			</publicationStmt>
			<sourceDesc>
				<biblStruct>
					<analytic>
						<author>
							<affiliation key="aff0">
								<orgName type="institution">Sholom-Aleichem Priamursky State University</orgName>
								<address>
									<addrLine>70A, Shirokaya street</addrLine>
									<postCode>679015</postCode>
									<settlement>Birobidzhan</settlement>
									<country key="RU">Russian Federation</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff1">
								<orgName type="department">G.I.Nevelskoy</orgName>
								<orgName type="institution">Maritime State University named after</orgName>
								<address>
									<addrLine>50A, Verkhneportovaya street</addrLine>
									<postCode>690059</postCode>
									<settlement>Vladivostok</settlement>
									<country key="RU">Russian Federation</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff2">
								<orgName type="department">G.I.Nevelskoy</orgName>
								<orgName type="institution">Maritime State University named after</orgName>
								<address>
									<addrLine>50A, Verkhneportovaya street</addrLine>
									<postCode>690059</postCode>
									<settlement>Vladivostok</settlement>
									<country key="RU">Russian Federation</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff3">
								<orgName type="institution">Sumy State University</orgName>
								<address>
									<addrLine>2, Rymskogo-Korsakova street</addrLine>
									<postCode>40007</postCode>
									<settlement>Sumy</settlement>
									<country key="UA">Ukraine</country>
								</address>
							</affiliation>
						</author>
						<author>
							<affiliation key="aff4">
								<orgName type="institution">Irkutsk National Research Technical University</orgName>
								<address>
									<addrLine>83, Lermontova street</addrLine>
									<postCode>664074</postCode>
									<settlement>Irkutsk</settlement>
									<country key="RU">Russian Federation</country>
								</address>
							</affiliation>
						</author>
						<title level="a" type="main">Applying Machine Learning Methods to Forecasting Customer Churn for a Telecommunications Company *</title>
					</analytic>
					<monogr>
						<imprint>
							<date/>
						</imprint>
					</monogr>
					<idno type="MD5">860E9605AA5F0FE695DECF95B9BC1677</idno>
				</biblStruct>
			</sourceDesc>
		</fileDesc>
		<encodingDesc>
			<appInfo>
				<application version="0.7.2" ident="GROBID" when="2023-03-25T00:45+0000">
					<desc>GROBID - A machine learning software for extracting information from scholarly documents</desc>
					<ref target="https://github.com/kermitt2/grobid"/>
				</application>
			</appInfo>
		</encodingDesc>
		<profileDesc>
			<textClass>
				<keywords>
					<term>Machine Learning</term>
					<term>Customer Churn</term>
					<term>Telecommunications Company</term>
					<term>Area Under The Curve</term>
					<term>An Ensemble of Techniques</term>
				</keywords>
			</textClass>
			<abstract>
<div xmlns="http://www.tei-c.org/ns/1.0"><p>The paper presents a brief overview of existing approaches to predicting customer churn using the example of a telecommunications company. The authors provide research on churn forecasting using 11 different machine learning methods. For training, data containing 20 different information parameters about clients were used. The quality of education was assessed using the traditional Area Under the Curve characteristic. The paper also provides research results confirming that the use of ensembles of machine learning methods increases the quality of predicting customer churn.</p></div>
			</abstract>
		</profileDesc>
	</teiHeader>
	<text xml:lang="en">
		<body>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="1">Introduction</head><p>The number of clients for any company is undoubtedly an important parameter, and the more clients, the higher the company's profit. There is no monopoly on the provision of mobile services to customers, high-speed Internet access or cable television, therefore, the so-called outflow of customers is possible for one reason or another, most often associated with the transition to another telecommunications company offering more favorable conditions from the point of view of customers ... To ensure the stable operation of a telecommunications company, an analysis of the customer base is necessary, for example, to ensure targeted promotions, as well as an analysis of the reasons for customer churn, including with the ability to predict the number of customers who have left and the reasons for switching to the services of another telecommunications company. According to experts, attracting one new client costs companies on average seven times more (in some cases, from 5 to 20 times more) than retaining an existing one <ref type="bibr" target="#b0">[1]</ref><ref type="bibr" target="#b1">[2]</ref><ref type="bibr" target="#b2">[3]</ref>. Reducing customer churn by 5% increases the company's profits from 25% to 85% <ref type="bibr" target="#b3">[4]</ref>. Therefore, understanding how exactly to maintain customer engagement is a natural foundation for developing strategies for customer retention. In this regard, there is a number of scientific studies aimed both at predicting customer churn and predicting their preferences in any specific services. For example, in work <ref type="bibr" target="#b4">[5]</ref>, the authors try to understand the behavior of customers when paying bills, using such machine learning methods as logistic regression, one rule and support vector machines, they proposed an analytical approach to studying and predicting the payment behavior of customers. As input for the analytical system, they use: customer ID (a hashed unique number indicating each customer), action type (such as SMS, IVR, phone calls, service cut related and legal actions), the date of the action's execution, stage changer fl ag (such as payment through the banking system, unpaid invoice occurrence, or action timer fl ag). The feasibility of using machine learning methods is ensured by their ability to detect patterns in data of various natures and universes <ref type="bibr" target="#b5">[6]</ref>.</p><p>In <ref type="bibr" target="#b6">[7]</ref>, on the basis of Call Data Records data, various types of customers are determined, which subsequently affects the analysis of the possibility of their outflow and the impact on other customers of the company. Authors include these types of customers: follower, standard, leader, core and important customer. The authors note that customers who interact with the Leader and Important categories are more likely to churn after an influential member who also left the company. For clustering, the authors used neural networks such as linear perceptron, multilayer perceptron and networks with radial basis activation function. In the work of the authors <ref type="bibr" target="#b7">[8]</ref>, using the random forest classifier, decision tree classifier, gradient-boosted tree classifier and multiplexer perceptron, clients were segmented according to the "time-frequencymonetary" approach, while "Time" characterizes the total of calls duration and Internet sessions in a certain period of time, the "Frequency" parameter characterizes the frequency of using services frequently within a certain period, and the "Monetary" parameter is determined by the amount of money spent during a certain period.</p><p>It should be noted that there is often a situation where the assessment of the forecast of customer churn is carried out according to completely different criteria. For example, in <ref type="bibr" target="#b8">[9]</ref>, forecasting is based on the following information: region where the customer lives; time since the customer joined the operator (in months); average revenue; how long did the customer not pay the bills; amount that the customer is overdue; number of times the service was disconnected, etc. In the study <ref type="bibr" target="#b9">[10]</ref>, the following parameters are used to predict customer churn: state the US state, in which, the customer resides, the remaining seven-digit phone number, total number of calling minutes used during the day, the billed cost of daytime calls, total number of calling minutes used during the nighttime, and a few others. Moreover, some of these criteria may not be available to telcos, making it difficult to determine how, in practice, a telco will forecast these models. It is also worth mentioning several studies <ref type="bibr" target="#b10">[11]</ref><ref type="bibr" target="#b11">[12]</ref> on the use of customer analysis technologies in the banking sector, for example, to predict the likelihood of outflow based on information related to the sociodemographic parameters of customers, their activity in obtaining banking information, information about their salaries, and etc.</p><p>There is no unambiguous option for the machine learning methods used, there are a lot of them, it is almost impossible to choose an empirically suitable method, it is necessary to conduct training, in a practical way, selecting the optimal architecture and parameters of the machine learning method.</p><p>This paper presents the results of predicting the churn of customers of a telecommunications company using machine learning methods, since the latter are able to use statistical data, to determine dependencies between data of different nature.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="2">Preparation of training sample</head><p>The problem of predicting customer churn can be formulated as a binary classification problem <ref type="bibr" target="#b8">[9]</ref>. The solution to the problem is to classify customers with the corresponding characteristics (parameters) x ∈ X to one of the two classes Y = {no churn, churn}.</p><p>To solve such a problem, in this work, we used a dataset consisting of 21 columns and 7043 rows made publicly available by IBM and containing information about customers of a telecommunications company (available at https://www.kaggle.com/blastchar/telco-customer-churn). Information about each customer includes the following characteristics, which are identifying signs for predicting churn (they are presented in Tables <ref type="table" target="#tab_2">1-3</ref>), and, of course, each customer is assigned a unique customer identifier (customerID).</p><p>Also, data on the period of use of the services of a telecommunications company were used as input information, the parameter is calculated in months (tenure); the size of the client's monthly fee (MonthlyCharges) and the final amount of payments for the entire period of work with the client (TotalCharges). For each set of input information, consisting of 20 indicators, there is a predetermined output -Churncharacterizing whether the client left the telecommunications company with these parameters or not. Also, data on the period of use of the services of a telecommunications company were used as input information, the parameter is calculated in months (tenure); the size of the client's monthly fee (MonthlyCharges) and the final amount of payments for the entire period of work with the client (TotalCharges). For each set of input information, consisting of 20 indicators, there is a predetermined output -Churncharacterizing whether the client left the telecommunications company with these parameters or not.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="3">Modeling machine learning methods</head><p>The following machine learning methods have been selected for training, which have proven themselves well in solving various classification problems: AdaBoostadaptive boosting; Decision Tree -decisive trees; Extra Tree Classifier -random trees; Gradient Boosting -gradient boosting; KNeighbors -k-nearest neighbors method; Logistic Regression -logistic regression; Naive Bayes -naive Bayesian classifier; Neural Network -neural networks; Random Forest -random forest method; SVM -support vector machine; XGB -Gradient boosting on trees.</p><p>To assess the effectiveness of the models, the Area Under the Curve (AUC) characteristic was chosen, a statistical indicator that is often used in machine learning methods that determines the area bounded by a certain curve and an abscissa axis, called the ROC curve (from receiver operating characteristic). The use of AUC for binary classification problems is popular because of its simplicity, intuitive interpretation <ref type="bibr" target="#b8">[9]</ref>, and also its use in the case of unbalanced datasets <ref type="bibr" target="#b12">[13]</ref>. For a random classifier, the AUC is 0.5, and for an ideal classifier, the AUC is 1 <ref type="bibr" target="#b13">[14]</ref>. Since there are only two options for a conclusion in the problem being solved (whether the client leaves or not), it is the ROC analysis, which is a graphical method for assessing the quality of the work of a binary classifier, that is most promising for assessing the proposed method for predicting the churn of customers of a telecommunications company. To construct the ROC curve, a pair of the following values is used: sensitivity and specificity. The value "sensitivity" characterizes the share of truepositive classifications in the total number of positive observations and is marked along the vertical axis of the ROC-curve graph, and the value "specificity" characterizes the proportion of true-negative classifications in the total number of negative observations and is marked along the horizontal axis of the ROC-curve graph. It should be noted that the higher the "sensitivity" value, the more reliably the classifier recognizes positive examples, and the higher the "specificity" value, the more reliable the classifier recognizes negative observations. Thus, the ROC curve reflects the relationship between the probability of false alarms (proportion of falsepositive classifications) and the probability of "correct detection" (proportion of truepositive classifications). With an increase in sensitivity, the reliability of recognition of positive observations increases (the probability of "missing a target" decreases), but at the same time the probability of a false alarm increases. Table <ref type="table" target="#tab_3">4</ref> shows the learning outcomes of individual machine learning methods and their characteristics according to the AUC metric.  <ref type="table" target="#tab_3">4</ref> shows that three machine learning methods give the best results: LogisticRegression, AdaBoostClassifier, and XGB. For example, logistic regression gives correct results in 80.38% of examples. The authors decided to carry out computer modeling to identify ensembles of methods that optimally solve the problem of predicting the outflow of customers of a telecommunications company. For this, various combinations of the above machine learning methods were considered (Table <ref type="table" target="#tab_4">5</ref> shows a fragment of the results obtained).</p><p>Optimization of method ensembles according to the criterion of the maximum AUC value allowed us to identify the best ensemble of methods containing such machine learning methods as Logistic Regression, Gradient Boosting and XGB, while the number of correct conclusions such an ensemble produces in 81.37% of examples.</p></div>
<div xmlns="http://www.tei-c.org/ns/1.0"><head n="4">Conclusion</head><p>Thus, as a result of the work carried out, the authors investigated 11 different machine learning methods to solve the problem of predicting the churn of customers of a telecommunications company. The best machine learning method was LogisticRegression, which showed an AUC of 80.38%. The use of Logistic Regression in conjunction with other machine learning methods within the ensemble of machine learning methods allowed us to increase the forecasting quality to 81.37. Further research by the authors will be devoted to identifying those identification parameters that significantly affect the process of predicting customer churn in order to use a smaller number of identification features with constant values of the forecast quality. </p></div><figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_0"><head>Table 1 .</head><label>1</label><figDesc>Description of identification signs that have two answer options.</figDesc><table><row><cell>No.</cell><cell>Identification sign</cell><cell>Churn customers</cell><cell>Non Churn customers</cell></row><row><cell>1</cell><cell>Gender (gender) of the client (gender)</cell><cell>Male -50.7 % Female -49.3 %</cell><cell>Male -50.2 % Female-49.8 %</cell></row><row><cell>2</cell><cell>Whether the user is a Senior Citizen</cell><cell>0 -87.1 % 1 -12.9 %</cell><cell>0 -74.5 % 1 -25.5 %</cell></row><row><cell>3</cell><cell>The client has a partner (Partner)</cell><cell>Yes -52.8 % No -47.2 %</cell><cell>Yes -35.8 % No -64.2 %</cell></row><row><cell>4</cell><cell>Client has Dependents</cell><cell>Yes -34.3 % No -65.7 %</cell><cell>Yes -17.4 % No -82.6 %</cell></row><row><cell>5</cell><cell>The indicator of the client's phone number (PhoneService)</cell><cell>Yes -90.12 % No -9.88 %</cell><cell>Yes -90.9 % No -9.1 %</cell></row><row><cell>6</cell><cell>Paperless Billing</cell><cell>Yes -53.6 % No -46.4 %</cell><cell>Yes -74.9 % No -25.1 %</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_1"><head>Table 2 .</head><label>2</label><figDesc>Description of identification signs that have three answer options.</figDesc><table><row><cell>No.</cell><cell cols="2">Identification sign</cell><cell>Churn customers</cell><cell>Non Churn customers</cell></row><row><cell>1</cell><cell cols="2">The indicator of the presence of several communication lines at the client (MultipleLines)</cell><cell>Yes -41 % No -49.12 % No phone service -9.88 %</cell><cell>Yes -45.5 % No -45.4 % No phone service -9.1 %</cell></row><row><cell></cell><cell cols="2">The type of communication</cell><cell>DSL -37.9 %</cell><cell>DSL -24.6 %</cell></row><row><cell>2</cell><cell cols="2">line wire that the client is</cell><cell>Fiber optic -34.8 %</cell><cell>Fiber optic -69.4 %</cell></row><row><cell></cell><cell cols="2">using (InternetService)</cell><cell>No -27.3 %</cell><cell>No -6 %</cell></row><row><cell></cell><cell cols="2">Customer's Internet Security</cell><cell>Yes -33.3 %</cell><cell>Yes -15.8 %</cell></row><row><cell>3</cell><cell cols="2">Score (OnlineSecurity)</cell><cell>No -39.4 %</cell><cell>No -78.2 %</cell></row><row><cell></cell><cell></cell><cell></cell><cell>No internet service -27.3 %</cell><cell>No internet service -6 %</cell></row><row><cell></cell><cell cols="2">An indicator of whether the</cell><cell>Yes -36.8 %</cell><cell>Yes -28 %</cell></row><row><cell>4</cell><cell cols="2">customer is backing up</cell><cell>No -35.9 %</cell><cell>No -66 %</cell></row><row><cell></cell><cell>(OnlineBackup)</cell><cell></cell><cell>No internet service -27.3 %</cell><cell>No internet service -6 %</cell></row><row><cell></cell><cell cols="2">Indicator, the presence of</cell><cell>Yes -36.3 %</cell><cell>Yes -29.2 %</cell></row><row><cell>5</cell><cell>device</cell><cell>protection</cell><cell>No -36.4 %</cell><cell>No -64.8 %</cell></row><row><cell></cell><cell>(DeviceProtection)</cell><cell></cell><cell>No internet service -27.3 %</cell><cell>No internet service -6 %</cell></row><row><cell></cell><cell cols="2">Indicator whether the user</cell><cell>Yes -33.5 %</cell><cell>Yes -16.6 %</cell></row><row><cell>6</cell><cell cols="2">has technical protection</cell><cell>No -39.2 %</cell><cell>No -77.4 %</cell></row><row><cell></cell><cell>(TechSupport)</cell><cell></cell><cell>No internet service -27.3 %</cell><cell>No internet service -6 %</cell></row><row><cell></cell><cell cols="2">The indicator of whether the</cell><cell>Yes -36.6 %</cell><cell>Yes -43.6 %</cell></row><row><cell>7</cell><cell cols="2">client has live television</cell><cell>No -36.2 %</cell><cell>No -50.4 %</cell></row><row><cell></cell><cell cols="2">broadcasts (StreamingTV)</cell><cell>No internet service -27.3 %</cell><cell>No internet service -6 %</cell></row><row><cell></cell><cell cols="2">The indicator of whether the</cell><cell>Yes -37.1 %</cell><cell>Yes -43.8 %</cell></row><row><cell>8</cell><cell cols="2">client has streaming movies</cell><cell>No -35.6 %</cell><cell>No -50.2 %</cell></row><row><cell></cell><cell cols="2">(StreamingMovies)</cell><cell>No internet service -27.3 %</cell><cell>No internet service -6 %</cell></row><row><cell></cell><cell cols="2">Type of the concluded</cell><cell>Month-to-month -43 %</cell><cell>Month-to-month -88.6 %</cell></row><row><cell>9</cell><cell>payment</cell><cell>agreement</cell><cell>One year -25.3 %</cell><cell>One year -8.88 %</cell></row><row><cell></cell><cell>(Contract)</cell><cell></cell><cell>Two year -31.7 %</cell><cell>Two year -2.52 %</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_2"><head>Table 3 .</head><label>3</label><figDesc>Description of identification signs that have four answer options.</figDesc><table><row><cell>No.</cell><cell>sign Identification</cell><cell>Churn customers</cell><cell>Non Churn customers</cell></row><row><cell></cell><cell>Client payment</cell><cell>Mailed check -25.1 %</cell><cell>Mailed check -16.5 %</cell></row><row><cell>1</cell><cell>type (PaymentMethod)</cell><cell>Electronic check -25.1 % Credit card (automatic) -25 %</cell><cell>Electronic check -57.3 % Credit card (automatic) -12.4 %</cell></row><row><cell></cell><cell></cell><cell>Bank transfer (automatic) -24.8%</cell><cell>Bank transfer (automatic) -13.8%</cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_3"><head>Table 4 .</head><label>4</label><figDesc>Qualitative assessment of predicting customer churn using machine learning methods.</figDesc><table><row><cell>No.</cell><cell>Machine learning method</cell><cell>AUC value</cell></row><row><cell>1</cell><cell>LogisticRegression</cell><cell>0.8038</cell></row><row><cell>2</cell><cell>AdaBoostClassifier</cell><cell>0.8024</cell></row><row><cell>3</cell><cell>XGB</cell><cell>0.8024</cell></row><row><cell>4</cell><cell>GradientBoostingClassifier</cell><cell>0.7962</cell></row><row><cell>5</cell><cell>RandomForest</cell><cell>0.7815</cell></row><row><cell>6</cell><cell>ExtraTreesClassifier</cell><cell>0.7725</cell></row><row><cell>7</cell><cell>SVM</cell><cell>0.7687</cell></row><row><cell>8</cell><cell>KNeighbours</cell><cell>0.7659</cell></row><row><cell>9</cell><cell>Neural Network</cell><cell>0.763</cell></row><row><cell cols="2">10 Naive Bayes</cell><cell>0.7602</cell></row><row><cell cols="2">11 DecisionTree</cell><cell>0.7408</cell></row><row><cell cols="2">Table</cell><cell></cell></row></table></figure>
<figure xmlns="http://www.tei-c.org/ns/1.0" type="table" xml:id="tab_4"><head>Table 5 .</head><label>5</label><figDesc>Results of computer modeling of ensembles of machine learning methods.</figDesc><table><row><cell>No.</cell><cell>1 ensemble machine</cell><cell cols="2">2 ensemble machine</cell><cell cols="2">3 ensemble machine</cell><cell>AUC value</cell></row><row><cell></cell><cell>learning method</cell><cell cols="2">learning method</cell><cell cols="2">learning method</cell><cell>(%)</cell></row><row><cell>1</cell><cell>Logistic Regression</cell><cell cols="2">Gradient Boosting</cell><cell cols="2">gradient boosting on trees (XGB)</cell><cell>81.37</cell></row><row><cell>2</cell><cell>gradient boosting on trees (XGB)</cell><cell>adaptive (AdaBoost)</cell><cell>boosting</cell><cell cols="2">Gradient Boosting</cell><cell>80.71</cell></row><row><cell>3</cell><cell>k-nearest method (KNeighbors) neighbors</cell><cell cols="2">gradient boosting on trees (XGB)</cell><cell cols="2">neural networks</cell><cell>80.62</cell></row><row><cell></cell><cell>Logistic Regression</cell><cell cols="2">Gradient Boosting</cell><cell cols="2">k-nearest neighbors</cell></row><row><cell>4</cell><cell></cell><cell></cell><cell></cell><cell>method</cell><cell></cell><cell>80.23</cell></row><row><cell></cell><cell></cell><cell></cell><cell></cell><cell>(KNeighbors)</cell><cell></cell></row><row><cell>5</cell><cell>Logistic Regression</cell><cell cols="2">Gradient Boosting</cell><cell cols="2">support machine (SVM) vector</cell><cell>80.2</cell></row><row><cell>6</cell><cell>Logistic Regression</cell><cell cols="2">Gradient Boosting</cell><cell>random forest</cell><cell></cell><cell>80</cell></row><row><cell>7</cell><cell>random forest</cell><cell cols="2">support machine (SVM) vector</cell><cell cols="2">neural networks</cell><cell>79.9</cell></row><row><cell>8</cell><cell>k-nearest method (KNeighbors) neighbors</cell><cell cols="3">Logistic Regression naive classifier</cell><cell>bayes</cell><cell>79.43</cell></row><row><cell>9</cell><cell>k-nearest method (KNeighbors) neighbors</cell><cell>naive classifier</cell><cell>bayes</cell><cell cols="2">neural networks</cell><cell>79.43</cell></row><row><cell>10</cell><cell>k-nearest method (KNeighbors) neighbors</cell><cell>naive classifier</cell><cell>bayes</cell><cell cols="2">gradient boosting on trees (XGB)</cell><cell>79.34</cell></row></table></figure>
		</body>
		<back>
			<div type="references">

				<listBibl>

<biblStruct xml:id="b0">
	<analytic>
		<title level="a" type="main">Building comprehensible customer churn prediction models with advanced rule induction techniques</title>
		<author>
			<persName><forename type="first">W</forename><surname>Verbeke</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Expert systems with applications</title>
		<imprint>
			<biblScope unit="volume">38</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="2354" to="2364" />
			<date type="published" when="2011">2011</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b1">
	<analytic>
		<title level="a" type="main">Telecommunication subscribers&apos; churn prediction model using machine learning</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">A</forename><surname>Qureshi</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="m">Eighth International Conference on Digital Information Management</title>
				<meeting><address><addrLine>ICDIM; Islamabad</addrLine></address></meeting>
		<imprint>
			<publisher>IEEE</publisher>
			<date type="published" when="2013">2013. 2013</date>
			<biblScope unit="page" from="131" to="136" />
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b2">
	<analytic>
		<title level="a" type="main">The architecture of a churn prediction system based on stream mining</title>
		<author>
			<persName><forename type="first">B</forename><surname>Balle</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Artificial Intelligence Research and Development</title>
		<imprint>
			<biblScope unit="volume">256</biblScope>
			<biblScope unit="page" from="157" to="166" />
			<date type="published" when="2013">2013</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b3">
	<analytic>
		<title level="a" type="main">Churn Prediction in Telecommunication using Logistic Regression and Logit Boost</title>
		<author>
			<persName><forename type="first">H</forename><surname>Jain</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Khunteta</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Srivastava</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Procedia Computer Science</title>
		<imprint>
			<biblScope unit="volume">167</biblScope>
			<biblScope unit="page" from="101" to="112" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b4">
	<analytic>
		<title level="a" type="main">Using Behavioral Analytics to Predict Customer Invoice Payment</title>
		<author>
			<persName><forename type="first">M</forename><surname>Bahrami</surname></persName>
		</author>
		<author>
			<persName><forename type="first">B</forename><surname>Bozkaya</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><surname>Balcisoy</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Big Data</title>
		<imprint>
			<biblScope unit="volume">8</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="25" to="37" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b5">
	<analytic>
		<title level="a" type="main">Comparative models in customer base analysis: parametric model and observation-driven model</title>
		<author>
			<persName><forename type="first">S.-M</forename><surname>Xie</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Business Economics and Management</title>
		<imprint>
			<biblScope unit="volume">21</biblScope>
			<biblScope unit="issue">6</biblScope>
			<biblScope unit="page" from="1731" to="1751" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b6">
	<analytic>
		<title level="a" type="main">Social Network Analysis and Churn Prediction in Telecommunications Using Graph Theory</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">M</forename><surname>Kostić</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">I</forename><surname>Simić</surname></persName>
		</author>
		<author>
			<persName><forename type="first">M</forename><forename type="middle">V</forename><surname>Kostić</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Entropy</title>
		<imprint>
			<biblScope unit="volume">22</biblScope>
			<biblScope unit="issue">7</biblScope>
			<biblScope unit="page">753</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b7">
	<analytic>
		<title level="a" type="main">Predictive analytics using big data for increased customer loyalty: Syriatel Telecom Company case study</title>
		<author>
			<persName><forename type="first">W</forename><forename type="middle">N</forename><surname>Wassouf</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Big Data</title>
		<imprint>
			<biblScope unit="volume">7</biblScope>
			<biblScope unit="page">29</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b8">
	<analytic>
		<title level="a" type="main">Profit Driven Decision Trees for Churn Prediction</title>
		<author>
			<persName><forename type="first">B</forename><surname>Baesens</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">European journal of operational research</title>
		<imprint>
			<biblScope unit="volume">284</biblScope>
			<biblScope unit="issue">3</biblScope>
			<biblScope unit="page" from="920" to="933" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b9">
	<analytic>
		<title level="a" type="main">A Hybrid Swarm Intelligent Neural Network Model for Customer Churn Prediction and Identifying the Influencing Factors</title>
		<author>
			<persName><forename type="first">H</forename><surname>Faris</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Information</title>
		<imprint>
			<biblScope unit="volume">9</biblScope>
			<biblScope unit="issue">11</biblScope>
			<biblScope unit="page">288</biblScope>
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b10">
	<analytic>
		<title level="a" type="main">Propensity to Churn in Banking: What Makes Customers Close the Relationship with a Bank?</title>
		<author>
			<persName><forename type="first">C</forename><surname>Alexandru</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Economic Computation &amp; Economic Cybernetics Studies &amp; Research</title>
		<imprint>
			<biblScope unit="volume">54</biblScope>
			<biblScope unit="issue">2</biblScope>
			<biblScope unit="page" from="77" to="94" />
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b11">
	<analytic>
		<title level="a" type="main">Forecasting client retention -a machine-learning approach</title>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">E</forename><surname>Schaeffer</surname></persName>
		</author>
		<author>
			<persName><forename type="first">S</forename><forename type="middle">V R</forename><surname>Sánchez</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Retailing and Consumer Services</title>
		<imprint>
			<biblScope unit="volume">52</biblScope>
			<biblScope unit="page">101918</biblScope>
			<date type="published" when="2020">2020</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b12">
	<analytic>
		<title level="a" type="main">Customer churn prediction in telecom using machine learning in big data platform</title>
		<author>
			<persName><forename type="first">A</forename><forename type="middle">K</forename><surname>Ahmad</surname></persName>
		</author>
		<author>
			<persName><forename type="first">A</forename><surname>Jafar</surname></persName>
		</author>
		<author>
			<persName><forename type="first">K</forename><surname>Aljoumaa</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of Big Data</title>
		<imprint>
			<biblScope unit="volume">6</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page">28</biblScope>
			<date type="published" when="2019">2019</date>
		</imprint>
	</monogr>
</biblStruct>

<biblStruct xml:id="b13">
	<analytic>
		<title level="a" type="main">Benchmarking sampling techniques for imbalance learning in churn prediction</title>
		<author>
			<persName><forename type="first">B</forename><surname>Zhu</surname></persName>
		</author>
	</analytic>
	<monogr>
		<title level="j">Journal of the Operational Research Society</title>
		<imprint>
			<biblScope unit="volume">69</biblScope>
			<biblScope unit="issue">1</biblScope>
			<biblScope unit="page" from="49" to="65" />
			<date type="published" when="2018">2018</date>
		</imprint>
	</monogr>
</biblStruct>

				</listBibl>
			</div>
		</back>
	</text>
</TEI>
