Gradient Boosting for Heart Stroke Prediction: Investigating Unexpected Risk Factors
- 1 Department of AI & ML, Symbiosis Institute of Technology, Pune Campus, Symbiosis International (Deemed University), Pune, India
Abstract
Heart stroke prediction is a critical area in healthcare, aiming to identify individuals at risk and provide timely intervention. This research leverages machine learning algorithms, including Decision Tree, Random Forest, AdaBoost, and Gradient Boost, to predict the likelihood of stroke, with Gradient Boosting delivering the most accurate results. Our analysis uncovers intriguing and unexpected relationships between stroke risk and various factors such as heart disease, hypertension, and smoking habits. Contrary to conventional wisdom, our findings suggest that individuals with lower incidences of hypertension and heart disease exhibit increased stroke risk. Additionally, non-smokers appear to have a higher likelihood of experiencing a stroke compared to smokers. Furthermore, Body Mass Index (BMI), marital status, residence type, and work type also significantly influence stroke risk. These anomalous findings necessitate further investigation to understand the underlying causes and implications. This study highlights the importance of using advanced machine learning techniques to uncover complex patterns in health data, which can lead to more effective prevention strategies.
DOI: https://doi.org/10.3844/jcssp.2025.124.133
Copyright: © 2025 Aniket Kailas Shahade and Priyanka V. Deshmukh. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 199 Views
- 110 Downloads
- 0 Citations
Download
Keywords
- Heart Stroke Prediction
- Gradient Boosting
- Machine Learning
- Hypertension
- Heart Disease
- Smoking
- Body Mass Index
- Demographic Factors
- Health Data Analysis
- Risk Factors