A Comprehensive Investigation of the Performances of Different Machine Learning Classifiers with SMOTE-ENN Oversampling Technique and Hyperparameter Optimization for Imbalanced Heart Failure Dataset

Show simple item record

dc.contributor.author Al-Monsur, Abdullah
dc.contributor.author Ratul, Ishrak Jahan
dc.contributor.author Ar-Rafi, Abrar Mohammad
dc.date.accessioned 2023-05-04T05:55:52Z
dc.date.available 2023-05-04T05:55:52Z
dc.date.issued 2022-05-30
dc.identifier.citation [1] S. Elyassami and A. A. Kaddour, “Implementation of an incremental deep learning model for survival prediction of cardiovascular patients,” IAES International Journal of Artificial Intelligence (IJ-AI, vol. 10, no. 1, pp. 101–109, 2021. [2] S. Rahayu, J. Jaya Purnama, A. Baroqah Pohan, F. Septia Nugraha, S. Nurdiani, and S. Hadianti, “PREDICTION OF SURVIVAL OF HEART FAILURE PATIENTS USING RANDOM FOREST,” Tech. Rep. 2, sep 2020. [3] O. O. Oladimeji and O. Oladimeji, “Predicting Survival of Heart Failure Patients Using Classification Algorithms,” JITCE (Journal of Information Technology and Computer Engineering), vol. 4, pp. 90–94, sep 2020. [4] R. Gürfidan and M. Ersoy, “Classification of Death Related to Heart Failure by Machine Learning Algorithms,” Tech. Rep. 1, jan 2021. [5] A. Ishaq, S. Sadiq, M. Umer, S. Ullah, S. Mirjalili, V. Rupapara, and M. Nappi, “Improving the Prediction of Heart Failure Patients’ Survival Using SMOTE and Effective Data Mining Techniques,” IEEE Access, vol. 9, pp. 39707–39716, 2021. [6] D. Chicco and G. Jurman, “Machine learning can predict survival of patients with heart failure from serum creatinine and ejection fraction alone,” BMC Medical Informatics and Decision Making, vol. 20, no. 1, pp. 1–16, 2020. [7] T. Ahmad, A. Munir, S. H. Bhatti, M. Aftab, and M. A. Raza, “Survival analysis of heart failure patients: A case study,” PLoS ONE, vol. 12, p. e0181001, jul 2017. [8] P. E. Rubini, C. A. Subasini, A. Vanitha Katharine, V. Kumaresan, S. Gowdhamkumar, and T. M. Nithya, “A cardiovascular disease prediction using machine learning algorithms,” Annals of the Romanian Society for Cell Biology, vol. 25, no. 2, pp. 904–912, 2021. [9] L. Ali, A. Niamat, J. A. Khan, N. A. Golilarz, X. Xingzhong, A. Noor, R. Nour, and S. A. C. Bukhari, “An Optimized Stacked Support Vector Machines Based Expert System for the Effective Prediction of Heart Failure,” IEEE Access, vol. 7, pp. 54007–54014, 2019. 50 BIBLIOGRAPHY BIBLIOGRAPHY [10] B. Dhomse Kanchan and M. Mahale Kishor, “Study of machine learning algorithms for special disease prediction using principal of component analysis,” Proceedings - International Conference on Global Trends in Signal Processing, Information Computing and Communication, ICGTSPICC 2016, pp. 5–10, 2016. [11] A. Rahman, A. W. Chowdhury, H. I. Khan, K. M. Sabah, M. G. Amin, and S. M. Azad, “Association of high sensitive C - Reactive protein with severity of the left ventricular systolic dysfunction in acute anterior ST elevation myocardial infarction,” Bangladesh Medical Research Council Bulletin, vol. 44, pp. 71–76, nov 2018. [12] M. Aljanabi, M. Qutqut, and M. Hijjawi, “Machine Learning Classification Techniques for Heart Disease Prediction: A Review,” International Journal of Engineering and Technology, vol. 7, no. October, pp. 5373–5379, 2018. [13] D. S. P. Jaymin Patel, Prof.TejalUpadhyay, “Heart Disease Prediction Using Machine learning and Data Mining Technique,” International Journal of Computer Science and Engineering, vol. 7, no. 6, pp. 17–19, 2015. [14] M. C. S. Dangare and D. M. S. S. Apte, “A DATA MINING APPROACH FOR PREDICTION OF HEART DISEASE USING NEURAL NETWORKS,” nternational Journal of Computer Engineering and Technology (IJCET), ISSN 0976 – 6367(Print), ISSN 0976 – 6375(Online) Volume 3, Issue, vol. 3, no. Issue 3, October - December, pp. 30–40, 2012. [15] I. A. Zriqat, A. M. Altamimi, and M. Azzeh, “A Comparative Study for Predicting Heart Diseases Using Data Mining Classification Methods,” vol. 14, no. 12, pp. 868–879, 2016. [16] M. Shouman, T. Turner, and R. Stocker, “Applying k-Nearest Neighbour in Diagnosing Heart Disease Patients,” International Journal of Information and Education Technology, pp. 220–223, 2012. [17] S. Uddin, A. Khan, M. E. Hossain, and M. A. Moni, “Comparing different supervised machine learning algorithms for disease prediction,” BMC Medical Informatics and Decision Making, vol. 19, no. 1, pp. 1–16, 2019. [18] A. Mustaqeem, S. M. Anwar, M. Majid, and A. R. Khan, “Wrapper method for feature selection to classify cardiac arrhythmia,” Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBS, pp. 3656–3659, 2017. [19] N. Khateeb and M. Usman, “Efficient heart disease prediction system using Knearest neighbor classification technique,” ACM International Conference Proceeding Series, pp. 21–26, 2017. 51 BIBLIOGRAPHY BIBLIOGRAPHY [20] A. K. Sonam Nikhar, “Prediction of heart disease using machine learning algorithms,” International Journal of Advanced Engineering, Management and Science (IJAEMS), pp. 197–202, 2016. [21] “UCI Machine Learning Repository: Heart failure clinical records Data Set.” en_US
dc.identifier.uri http://hdl.handle.net/123456789/1877
dc.description Supervised by Mr. Fahim Faisal, Assistant Professor, Co-Supervisor Mr. Mirza Muntasir Nishat, Assistant Professor, Department of Electrical and Electronic Engineering (EEE), Islamic University of Technology (IUT), Board Bazar, Gazipur-1704, Bangladesh. This thesis is submitted in partial fulfillment of the requirements for the degree of Bachelor of Science in Electrical and Electronic Engineering, 2022. en_US
dc.description.abstract The chronic cardiac condition myocardial infarction (heart failure) is characterized by decreased blood supply to the body as a result of the heart muscles’ impaired contractile properties. Patients with heart failure, like those with any other cardiac disorder, have difficulty performing daily activities and have a shorter life expectancy, with the vast majority of cases resulting in death at some point during the patient’s lifetime. Treatment outcomes and patient quality of life improve significantly when patients with heart failure are identified early and are likely to survive. As a result, machine learning techniques can be extremely beneficial in this situation because they can be used to predict the survival of heart failure patients in advance, allowing patients to receive the most appropriate treat- ment at the earliest possible stage. As a result, six supervised machine learning algorithms were applied to a dataset of 299 people from the University of California, Irvine Machine Learning Repository in order to predict their chances of surviving heart failure. There were a variety of algorithms used in this study including Decision Tree Classifier, Logistic Regression, Gaussian Nave Bayes, Random Forest Classifier, K-Nearest Neighbors, and Support Vector Machine, among others. Prior to scaling the data, a preprocessing step was carried out, and both the standard and min-max scaling methods were employed. When it came to optimizing the hyperparameters, the techniques grid-search cross validation and random search cross validation were combined. Data resampling techniques such as the edited nearest neighbor (SMOTE-ENN) and synthetic minority oversampling (SMOTE) data resampling are also employed (SMOTE-ENN). It has been thoroughly compared and analyzed the outcomes of all of the different approaches. As a result of these findings, the Random Forest Classifier (RFC) outperforms all other approaches, achieving a test accuracy of 90 percent when compared to the other approaches when SMOTE-ENN and the standard scaling technique are employed. With the help of an imbalanced dataset, this comprehensive investigation vividly illustrates the application and compatibility of sev- eral machine learning algorithms. Among the methods for improving the performance of machine learning algorithms discussed in this investigation are the SMOTE-ENN algo- rithm and hyperparameter optimization. en_US
dc.language.iso en en_US
dc.publisher Department of Electrical and Electronic Engineering, Islamic University of Technology (IUT) The Organization of Islamic Cooperation (OIC) Board Bazar, Gazipur-1704, Bangladesh en_US
dc.subject Machine learning, hyperparameter tuning, smapling, prediction en_US
dc.title A Comprehensive Investigation of the Performances of Different Machine Learning Classifiers with SMOTE-ENN Oversampling Technique and Hyperparameter Optimization for Imbalanced Heart Failure Dataset en_US
dc.type Thesis en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search IUT Repository


Advanced Search

Browse

My Account

Statistics