Knowing sample dataset for use in Machine Learning

  • The Pima Indians Diabetic Dataset is used to illustrate the Machine Learning concepts.
  • The dataset describes the medical records for Pima Indians and whether or not each patient will have an onset of diabetes within five years.
# Columns description
   Pregnancies(preg)
Number of times pregnant

Glucose(plas)
Plasma glucose concentration a 2 hours in an oral glucose tolerance test

BloodPressure(pres)
Diastolic blood pressure (mm Hg)

SkinThickness(skin)
Triceps skin fold thickness (mm)

Insulin(test)
2-Hour serum insulin (mu U/ml)

BMI(mass)
Body mass index (weight in kg/(height in m)^2)

DiabetesPedigreeFunction(pedi)
Diabetes pedigree function

Age(age)
Age (years)

Outcome(class)
Class variable (0 or 1) 268 of 768 are 1, the others are 0




















































Dataset showing first 20 rows.......

Comments

Popular posts from this blog