Skip navigation
st. Mary's University Institutional Repository St. Mary's University Institutional Repository

Please use this identifier to cite or link to this item: http://hdl.handle.net/123456789/7883
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBirhanu, Misrak-
dc.date.accessioned2024-04-29T11:51:42Z-
dc.date.available2024-04-29T11:51:42Z-
dc.date.issued2024-02-
dc.identifier.urihttp://hdl.handle.net/123456789/7883-
dc.description.abstractTelecommunication operators play a vital role in connecting individuals and businesses worldwide, facilitating seamless communication and adopting global connectivity. However, the telecommunications industry is vulnerable to various malicious activities and challenges by fraudsters seeking to exploit weaknesses in the system. One such form of fraud that has emerged as a significant challenge for telecom operators is SIM-box fraud. This thesis work is targeted to develop a model that helps to detect SIM-box fraudulent subscribers in a near real-time manner. To achieve this, we have set up API integration with Ethio telecom CRM and CBS environment to retrieve call detail record flat files (textual DB) on an hourly basis. Then, we developed a function using ASP.net C# that enables us to preprocess the textual raw data and store it in a database that has been configured using an SQL server to store call detail records, voice, SMS, and Data tuples. SQL view has been created that joins the CDR, Voice, SMS, and Data tables to combine all the required attributes in one place to facilitate further data analysis. Next, we aggregated different tuples using SQL query and created a C# function that can derive additional attributes that help to track the behaviors of available calls. Once, we analyzed and compiled data of call detail records that incorporate the Voice, SMS, and Data utilization of each subscriber, we split the dataset into 1_hour, 1_day, and 7_day datasets and fed them into selected machine learning algorithms. Finally, we experimented by feeding the preprocessed, aggregated, and analyzed dataset to machine learning algorithms of Random Forest (RF), Support Vector Machine (SVM), and Neural Network (NN) algorithms using sci-kit-learn (sklearn) python library and 100% accuracy has recorded in RF and NN algorithms in all 1_Hour, 1_Day and 7_Day datasets. Hence, we have concluded that with a good CDR analysis engine or module, RF, and NN can effectively identify possible fraudulent subscribers.en_US
dc.language.isoenen_US
dc.publisherSt. Mary's Universityen_US
dc.subjectTelecommunications fraud, SIM box, international call bypass, Machine learning , Call Detail Record, Classification, Voice call terminationen_US
dc.titleNear Real-time SIM-box Fraud Detection in Telecommunication System Using Machine Learning Approach in the Case of Ethio Telecom.en_US
dc.typeThesisen_US
Appears in Collections:Master of computer science

Files in This Item:
File Description SizeFormat 
16. Misrak Birhanu.pdf3.73 MBAdobe PDFView/Open
Show simple item record


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.