Skip to content

Data Mining Process for Heart Disease Patients' Data Using Python

Notifications You must be signed in to change notification settings

jarrodtky/HeartDisease_EDA-ML_Python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 

Repository files navigation

HeartDisease_EDA-ML_Python

Project: Data Mining

Topic: Data Mining Process for Heart Disease Patients' Data

Keywords: Data Mining, Exploratory Data Analysis (EDA), Machine Learning, Heart Disease, Python

Table of Content

Project Overview

Motivation

  • In the healthcare industry, understanding what factors or indicators affect a disease is an essential part of the decision-making and problem-solving process.
  • People with cardiovascular disease or who are at high cardiovascular risk (due to the presence of one or more risk factors such as hypertension, diabetes, hyperlipidaemia or already established disease) need early detection and management wherein a machine learning model or statistical analyses can be of great help.
  • These indicators allow decision-makers to identify any potential ways to reduce risk factors of future health and increase the likelihood of disease prevention effectively (Santos et al., 2019).

Aim & Objective

  • Aim:
    • To improve the process of analyzing patients’ heart disease in the healthcare industry to allow earlier detection and avoidance of heart disease and morbidity.
  • Objective:
    • To create and select the best Machine Learning model that classifies patients into those who will develop heart disease in the future and those who will not based on the importance of data variables and models evaluation and assessment (i.e. Accuracy, Recall, AUC, etc.).
    • The insights gained by analyzing the feature importance of each data variable to the target data will aid in establishing which factor or indicator is critical in causing heart disease.

Contents

(1) HeartDisease_Dataset.csv

  • Heart Disease dataset file in CSV format.

(2) HeartDisease_EDA-ML_Python Folder

  • Contains the main Python notebook with implementation codes and explanations for the project.

Technologies Used

Jupyter Notebook Visual Studio Code Python Pandas Matplotlib NumPy scikit-learn

License

  • None (for now)

Credits

  • Took inspiration from Kaggle