Skip to content

This repository contains a KNN classification model on Pima Indians Diabetes Database

Notifications You must be signed in to change notification settings

nilotpal-basu/Diabetes-Prediction-On-Pima-Indians-Diabetes-Database

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Diabetes Prediction on Pima Indians Diabetes Database

About the Dataset :

  • The dataset is taken from Pima Indians Diabetes Database
  • This dataset is originally from the National Institute of Diabetes and Digestive and Kidney Diseases. The objective of the dataset is to diagnostically predict whether or not a patient has diabetes, based on certain diagnostic measurements included in the dataset. Several constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 years old of Pima Indian heritage.

About the model :

  • K-Nearest Neighbor algorithm is used.
  • Dataset split into training and testing sets (80% training, 20% testing).
  • Model trained on both normalized and unnormalized feature values.
  • Data leakage is prevented by fitting the Standard Scaler only on the training set and applying it to both training and testing sets.
  • Checked performance with different n_neighbors values, selecting the best one based on testing accuracy.

About

This repository contains a KNN classification model on Pima Indians Diabetes Database

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published