Cardiovascular Diseases Prediction

Problem statement:

Cardiovascular diseases are the leading cause of death globally. It is therefore necessary to identify the causes and develop a system to predict heart attacks in an effective manner. The data has the information about the factors that might have an impact on cardiovascular health.

Task to be performed:

1.Preliminary analysis:

Perform preliminary data inspection and report the findings on the structure of the data, missing values, duplicates, etc.
Based on these findings, remove duplicates (if any) and treat missing values using an appropriate strategy

2.Prepare a report about the data explaining the distribution of the disease and the related factors using the steps listed below:

Get a preliminary statistical summary of the data and explore the measures of central tendencies and spread of the data
Identify the data variables which are categorical and describe and explore these variables using the appropriate tools, such as count plot
Study the occurrence of CVD across the Age category
Study the composition of all patients with respect to the Sex category
Study if one can detect heart attacks based on anomalies in the resting blood pressure (trestbps) of a patient
Describe the relationship between cholesterol levels and a target variable
State what relationship exists between peak exercising and the occurrence of a heart attack
Check if thalassemia is a major cause of CVD
List how the other factors determine the occurrence of CVD
Use a pair plot to understand the relationship between all the given variables

3.Build a baseline model to predict the risk of a heart attack using a logistic regression and random forest and explore the results while using correlation analysis and logistic regression (leveraging standard error and p-values from statsmodels) for feature selection

Setup and Installation:

pip install --upgrade pip
pip install -r requirements.txt
pip list

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
data.xlsx		data.xlsx
main.ipynb		main.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cardiovascular Diseases Prediction

Problem statement:

Task to be performed:

Setup and Installation:

About

Releases

Packages

Languages

Jigisha-p/Cardiovascular-Disease-Risk-Prediction

Folders and files

Latest commit

History

Repository files navigation

Cardiovascular Diseases Prediction

Problem statement:

Task to be performed:

Setup and Installation:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages