Adebisi Abraham Owodunni
Application of Machine Learning Techniques for the Prediction of Heart Disease
Owodunni, Adebisi Abraham; Jaber, Tareq; Mian, Zhibao
Authors
Dr Tareq Al Jaber T.Al-Jaber@hull.ac.uk
Lecturer/Assistant Professor
Dr Zhibao Mian Z.Mian2@hull.ac.uk
Lecturer
Abstract
As important as the heart is to humans, unfortunately, 43% of death is from heart disease [2] declared by Global Burden of Disease research. By 2030, deaths from cardiovascular disease will reach 23.6 million where heart disease takes the lead [3]. Annually, 10 million people die globally according to World Health Organization (WHO). There have been (pre)established conventional ways of detecting this disease in humans like angiography, electrocardiograms among others, which are not only expensive for the common man, but have been proven, but over 17 million individuals have lost their lives to lack of expertise, incapacitation with several side effects [4]. According to a WHO survey, only 67% of the time, doctors can accurately predict heart disease. Hence the need for noninvasive and a more efficient technique thereby leveraging on Data Science (Machine Learning - ML). This research makes use of ML techniques to classifying Heart Disease through the comparative way of their metrics to predict heart disease in individuals, ii. Investigate the most relevant features and the risk factors contributing to predicting heart disease, iii. Evaluate the performance of the developed models using appropriate metrics, iv. Provide insights and recommendations for healthcare professionals to improve early diagnosis and intervention strategies. These involve four classifiers: XGBoost, Random Forest (RF), Logistic Regression (LR), and Support Vector Machine, to classify and predict heart disease using the Framingham heart disease dataset. Different models were built after handling missing values and outliers in the dataset. Before balancing the dataset, the models built, LR and RF gave the best performance with an accuracy of 85% each. The dataset was later balanced/resampled, and important features selection was done using the XGBoost classifier, Sequential Feature Selection (SFS) and KBest methods respectively, and these improved the performance of the model. Ensemble techniques (AdaBoost and Bagging) were adopted and the AdaBoost model (RF classifier) performed as high as giving an accuracy of 93%. Hyperparameter tuning was done involving Randomized SearchCV and Grid SearchCV, but none outperformed the AdaBoost model’s performance. Lastly, the balanced dataset was split into train and test datasets (ratio of 80:20), and a model was built/trained with the train dataset and then tested with the test dataset, this gave an accuracy of 93% as that of the AdaBoost model, but a better CV_score: 0.9110, R2_score: 0.7078, AUC curve: 0.98, RSME: 0.2701, MAE: 0.0730 with Random Forest classifier.
Citation
Owodunni, A. A., Jaber, T., & Mian, Z. (2024). Application of Machine Learning Techniques for the Prediction of Heart Disease. Acta Scientific Computer Sciences, 6(3), 13-23
Journal Article Type | Article |
---|---|
Acceptance Date | Feb 22, 2024 |
Online Publication Date | Feb 22, 2024 |
Publication Date | Feb 22, 2024 |
Deposit Date | Apr 27, 2024 |
Publicly Available Date | Apr 29, 2024 |
Journal | Acta Scientific Computer Sciences |
Publisher | Acta Scientific |
Peer Reviewed | Peer Reviewed |
Volume | 6 |
Issue | 3 |
Pages | 13-23 |
Keywords | Random Forest Classifier; Logistic Regression Classifier; Sequential Feature Selection; AdaBoost and Bagging; Support Vector Machine Classifier; XGBoost classifier |
Public URL | https://hull-repository.worktribe.com/output/4634452 |
Publisher URL | https://actascientific.com/ASCS/ASCS-06-0510.php |
Files
Published article
(1.7 Mb)
PDF
Publisher Licence URL
http://creativecommons.org/licenses/by/4.0
Copyright Statement
Copyright: © 2024 Adebisi Abraham Owodunni and Tareq Al-Jaber. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
You might also like
Assessing the Impact of Usability from Evaluating Mobile Health Applications
(2024)
Journal Article
Fairness, Bias, and Ethics in AI: Exploring the Factors Affecting Student Performance
(2024)
Journal Article
Creating a Classification Module to Analysis the Usage of Mobile Health Apps
(2022)
Journal Article
Downloadable Citations
About Repository@Hull
Administrator e-mail: repository@hull.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search