Comparative analysis of tree-based intrusion detection modelling and machine learning classification models using cyber-security dataset

dc.contributor.advisorMokwena, S. N.
dc.contributor.authorMokoele, Motlatso Sarel
dc.date.accessioned2025-02-10T07:08:30Z
dc.date.available2025-02-10T07:08:30Z
dc.date.issued2024
dc.descriptionThesis (M.Sc. (Computer Science)) -- University of Limpopo, 2024en_US
dc.description.abstractCybersecurity has become an ever-pressing concern in the modern digital landscape, demanding robust and efficient intrusion detection systems. In this research, we conducted a comparative analysis of tree-based intrusion detection modelling and several popular machine learning classification models, using the widely used KDD99 dataset. To enhance the efficiency of the proposed model, we employ a hybrid feature selection method that combines the Gini index and information gain and incorporates them using the concepts of a decision tree (DT). Models under evaluation include DT, Support Vector Machine (SVM), K-Nearest Neighbours (KNN), and Logistic Regression (LR). We present a comprehensive evaluation of these models based on various performance metrics, including accuracy, F1 score, confusion matrix, precision, recall, and execution time. The dataset is meticulously pre-processed to eliminate noise and address any biases that may affect the results. The findings of this research reveal important insights into the strengths and weaknesses of different intrusion detection models. Our analysis sheds light on the performance variation between the tree-based model and SVM, KNN, and LR. In addition, we discuss the factors that contribute to the observed effectiveness of the model. The results demonstrate the effectiveness of the hybrid feature selection approach in enhancing the performance of tree-based models. In addition, we identify the most suitable models for specific performance criteria, guiding practitioners in selecting the appropriate model for their specific intrusion detection requirements. The results of this study contribute significantly to the advancement of intrusion detection techniques and provide valuable guidance to cybersecurity practitioners and researchers. The research highlights potential areas for further investigation and improvement, paving the way for more efficient and accurate intrusion detection systems in the future.en_US
dc.format.extentx, 123 leavesen_US
dc.identifier.urihttp://hdl.handle.net/10386/4884
dc.language.isoenen_US
dc.relation.requiresPDFen_US
dc.subjectCyber-securityen_US
dc.subjectIntrusion Detectionen_US
dc.subjectMachine Learningen_US
dc.subjectHybrid Feature Selectionen_US
dc.subjectTree-based Intrusion Detection Modellingen_US
dc.subjectSupport Vector Machineen_US
dc.subjectK-Nearest Neighboursen_US
dc.subjectlogistic regressionen_US
dc.subjectDecision Treeen_US
dc.subject.lcshComputer securityen_US
dc.subject.lcshCyber intelligence (Computer security)en_US
dc.subject.lcshData setsen_US
dc.subject.lcshMachine learningen_US
dc.titleComparative analysis of tree-based intrusion detection modelling and machine learning classification models using cyber-security dataseten_US
dc.typeThesisen_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
mokoele_ms_2024.pdf
Size:
2.39 MB
Format:
Adobe Portable Document Format
Description:
Thesis

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.61 KB
Format:
Item-specific license agreed upon to submission
Description: