Show simple item record

dc.contributor.advisor Modipa, T. S.
dc.contributor.author Mabunda, Judith Goodness Khanyisa
dc.date.accessioned 2024-09-04T13:24:15Z
dc.date.available 2024-09-04T13:24:15Z
dc.date.issued 2023
dc.identifier.uri http://hdl.handle.net/10386/4568
dc.description Thesis (M.Sc. (e-Science)) -- University of Limpopo, 2023 en_US
dc.description.abstract The rising amount of fraud in claims has been of great concern to the insurance companies. In this research work, we developed two machine learning models namely, Extreme Gradient Boosting (XGBoost) and Random Forest for the purpose of insurance fraud detection based on auto insurance claims data. The models detect fraudulent claims and classify them into fraudulent or non-fraudulent. Different data pre-processing techniques are used to clean, explore, and extract relevant features. The effectiveness of the algorithms are observed using performance evaluation metrics: precision, recall and f1 score and confusion matrix. We also introduced the Synthetic Minority Oversampling (SMOTE) and Random Oversampling (ROS) data augmentation techniques to handle the imbalanced data and compare the results of the models before and after the data is balanced. The comparative results of classification algorithms conclude that the XGBoost model is effective in fraud detection than the Random Forest model on imbalanced data. In addition to this, the Random Forest model was effective in predicting fraudulent claims when the data augmentation techniques were applied. en_US
dc.format.extent vii, 61 leaves en_US
dc.language.iso en en_US
dc.relation.requires PDF en_US
dc.subject Insurance fraud detection en_US
dc.subject Gradient boosting en_US
dc.subject Random forest algorithms en_US
dc.subject Insurance claims en_US
dc.subject.lcsh Artificial intelligence en_US
dc.subject.lcsh Application software en_US
dc.subject.lcsh Computer science -- Congresses en_US
dc.subject.lcsh Technology -- Congresses en_US
dc.subject.lcsh Insurance fraud en_US
dc.subject.lcsh Application software -- Development en_US
dc.title Insurance fraud detection using extreme gradient boosting and random forest algorithms en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search ULSpace


Browse

My Account