Comparative analysis of tree-based intrusion detection modelling and machine learning classification models using cyber-security dataset

Mokoele, Motlatso Sarel

ULSpace Home
→
Faculty of Science and Agriculture
→
School of Mathematical & Computational Sciences
→
Theses and Dissertations (Computer Science)
→
View Item

dc.contributor.advisor	Mokwena, S. N.
dc.contributor.author	Mokoele, Motlatso Sarel
dc.date.accessioned	2025-02-10T07:08:30Z
dc.date.available	2025-02-10T07:08:30Z
dc.date.issued	2024
dc.identifier.uri	http://hdl.handle.net/10386/4884
dc.description	Thesis (M.Sc. (Computer Science)) -- University of Limpopo, 2024	en_US
dc.description.abstract	Cybersecurity has become an ever-pressing concern in the modern digital landscape, demanding robust and efficient intrusion detection systems. In this research, we conducted a comparative analysis of tree-based intrusion detection modelling and several popular machine learning classification models, using the widely used KDD99 dataset. To enhance the efficiency of the proposed model, we employ a hybrid feature selection method that combines the Gini index and information gain and incorporates them using the concepts of a decision tree (DT). Models under evaluation include DT, Support Vector Machine (SVM), K-Nearest Neighbours (KNN), and Logistic Regression (LR). We present a comprehensive evaluation of these models based on various performance metrics, including accuracy, F1 score, confusion matrix, precision, recall, and execution time. The dataset is meticulously pre-processed to eliminate noise and address any biases that may affect the results. The findings of this research reveal important insights into the strengths and weaknesses of different intrusion detection models. Our analysis sheds light on the performance variation between the tree-based model and SVM, KNN, and LR. In addition, we discuss the factors that contribute to the observed effectiveness of the model. The results demonstrate the effectiveness of the hybrid feature selection approach in enhancing the performance of tree-based models. In addition, we identify the most suitable models for specific performance criteria, guiding practitioners in selecting the appropriate model for their specific intrusion detection requirements. The results of this study contribute significantly to the advancement of intrusion detection techniques and provide valuable guidance to cybersecurity practitioners and researchers. The research highlights potential areas for further investigation and improvement, paving the way for more efficient and accurate intrusion detection systems in the future.	en_US
dc.format.extent	x, 123 leaves	en_US
dc.language.iso	en	en_US
dc.relation.requires	PDF	en_US
dc.subject	Cyber-security	en_US
dc.subject	Intrusion Detection	en_US
dc.subject	Machine Learning	en_US
dc.subject	Hybrid Feature Selection	en_US
dc.subject	Tree-based Intrusion Detection Modelling	en_US
dc.subject	Support Vector Machine	en_US
dc.subject	K-Nearest Neighbours	en_US
dc.subject	logistic regression	en_US
dc.subject	Decision Tree	en_US
dc.subject.lcsh	Computer security	en_US
dc.subject.lcsh	Cyber intelligence (Computer security)	en_US
dc.subject.lcsh	Data sets	en_US
dc.subject.lcsh	Machine learning	en_US
dc.title	Comparative analysis of tree-based intrusion detection modelling and machine learning classification models using cyber-security dataset	en_US
dc.type	Thesis	en_US