Customer Churn Prediction for Telecommunications Industry

Yabaş, Utku

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14365/21

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	İnce, Türker	-
dc.contributor.advisor	Çankaya, Hakkı Candan	-
dc.contributor.author	Yabaş, Utku	-
dc.date.accessioned	2023-06-16T12:27:28Z	-
dc.date.available	2023-06-16T12:27:28Z	-
dc.date.issued	2014	-
dc.identifier.uri	https://tez.yok.gov.tr/UlusalTezMerkezi/TezGoster?key=1zw6GvYMe-q3Hf6HR-3US5iZAo7DuRsgzhIKGnEM0Z9SCxURTLm_-h0hrYV90myu	-
dc.identifier.uri	https://hdl.handle.net/20.500.14365/21	-
dc.description.abstract	Müşteri kaybetmek, telekom firmaları açısından kaybettirdiği para bakımından önemli bir endişedir. Bu tez çalışmasında, en son veri madenciliği yöntemlerini analiz ederek, servislerden ayrılacak veya başka bir firmanın servisini kullanmayı düşünen müşterileri tahmin etmek için yeni metotlar geliştirdik. Önerdiğimiz yaklaşımın performansını yoğun bir şekilde değerlendirdik. Bu değerlendirmeyi yapmak için Orange Telecom tarafından "Knowledge Discovery and Data Mining 2009"(KDD) yarışması için sunduğu gerçek ve kullanıma açık bir veri kümesi kullandık. Bu veri kümesinde toplam 100.000 örnek ve 230 değişken bulunmaktadır. Bu yüzden veri kümesi "büyük veri" kapsamına girmektedir. IBM bu yarışmada birinci olmuştur, ancak önemli ölçüde bilişimsel kaynak kullanmaktadır. Biz alternatif metotlar ve daha uygun kaynaklar kullanarak, yarışmadaki en yüksek skorlara ulaşmayı hedefledik. Bu çalışmada, toplu sınıflandırıcı teknikleri üzerine yoğunlaştık. Tek ve güçlü sınıflandırıcılar ile en son toplu sınıflandırıcıları "müşteri ayrılma" problemi için karşılaştırdık. Ayrıca, bu metotların performanslarını arttırmak için iyi performans gösteren sınıflandırıcıları seçerek; bunları oylayıcı sınıflandırıcı ile birleştirdik. Genel olarak, elde ettiğimiz sonuçlar, yarışmanın en yüksek sonuç alan resmi yarışmacıları ile yakındı. Önerdiğimiz yaklaşımın, "müşteri ayrılması tahmini" dışındaki başka zorlayıcı otomatik öğrenme problem alanları için de değerli olabileceğine inanıyoruz. Yöntemimizin doğruluğunu onaylamak için, UCI Machine Learning kütüphanesinden topladığımız veri kümeleri ile deneyler yaptık. Bu deneyler sonucunda çoğu veri kümesinde yöntemimiz, içinde bulunan toplu sınıflandırıcıdaki bütün algoritmalardan daha iyi sonuçlar elde etmiştir.	en_US
dc.description.abstract	Customer churn is a concern for telecommunication service providers due to its associated costs. In this thesis, we analysed state-of-the-art data mining algorithms and developed novel methods to accurately predict customers who will change and turn to another provider for the same or similar service. We extensively evaluated performance of our proposed approach using a public and real dataset compiled by Orange Telecom for the Knowledge Discovery and Data Mining (KDD) 2009 Competition. This dataset has $100,000$ instances with $230$ attributes, which makes it a ``big data''. IBM achieved the highest score on this dataset requiring significant amount of computational resources. We aimed to find alternative methods that can match or improve the recorded highest score with more efficient use of resources. In our study, we focus on ensemble of classifiers techniques. We compared performance of single, powerful classifiers to state-of-the-art ensemble methods for churn detection problem. Additionally, we showed that these results can be further improved by combining selected subset of well performing classifiers by a voting classifier. Overall, the results with our proposed approach were similar to the official top scorers of the competition. We believe that our proposed approach can be valuable for solving other challenging machine learning problem domains (such as ``big data'' problems) rather than churn prediction.Also, we performed experiments using the selected datasets from the UCI Machine Learning repository. Our proposed approach outperforms the single powerful algorithms contained in the ensemble for most of the datasets tested.	en_US
dc.language.iso	en	en_US
dc.publisher	İzmir Ekonomi Üniversitesi	en_US
dc.rights	info:eu-repo/semantics/openAccess	en_US
dc.subject	m¨u¸steri kayıp tahmini	en_US
dc.subject	¸coklu sını?andırıcı	en_US
dc.subject	oylayan sını?andırıcı	en_US
dc.subject	otomatik ¨o^grenme	en_US
dc.subject	veri madencili^gi.	en_US
dc.subject	churn prediction	en_US
dc.subject	ensemble classi?er	en_US
dc.subject	voting classi?er	en_US
dc.subject	data mining,machine learning.	en_US
dc.subject	Bilgisayar Mühendisliği Bilimleri-Bilgisayar ve Kontrol	en_US
dc.subject	Computer Engineering and Computer Science and Control	en_US
dc.title	Customer Churn Prediction for Telecommunications Industry	en_US
dc.title.alternative	Telekomünikasyon Servislerinden Aboneliklerini İ·ptal Edecek Müşterileri Önceden Tahmin Etmek	en_US
dc.type	Master Thesis	en_US
dc.department	İEÜ, Lisansüstü Eğitim Enstitüsü, Akıllı Mühendislik Sistemleri Ana Bilim Dalı	en_US
dc.identifier.startpage	1	en_US
dc.identifier.endpage	103	en_US
dc.institutionauthor	Yabaş, Utku	-
dc.relation.publicationcategory	Tez	en_US
dc.identifier.yoktezid	353633	en_US
dc.identifier.scopusquality	N/A	-
dc.identifier.wosquality	N/A	-
item.openairecristype	http://purl.org/coar/resource_type/c_18cf	-
item.openairetype	Master Thesis	-
item.cerifentitytype	Publications	-
item.fulltext	With Fulltext	-
item.languageiso639-1	en	-
item.grantfulltext	open	-
crisitem.author.dept	05.04. Software Engineering	-
Appears in Collections:	Lisansüstü Eğitim Enstitüsü Tez Koleksiyonu

Files in This Item:

File	Size	Format
21.pdf	1.39 MB	Adobe PDF	View/Open

Show simple item record

CORE Recommender

Page view(s)

124

checked on Apr 14, 2025

Download(s)

54

checked on Apr 14, 2025

Google Scholar^TM

Check

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM