Heart Disease

4 databases: Cleveland, Hungary, Switzerland, and the VA Long Beach

Characteristics
Multivariate
Subject Area
Health and Medicine
Associated Tasks
Classification

Attribute Type
--
# Instances
303
# Attributes
13

Info

This database contains 76 attributes, but all published experiments refer to using a subset of 14 of them. In particular, the Cleveland database is the only one that has been used by ML researchers to date. The "goal" field refers to the presence of heart disease in the patient. It is integer valued from 0 (no presence) to 4. Experiments with the Cleveland database have concentrated on simply attempting to distinguish presence (values 1,2,3,4) from absence (value 0).     The names and social security numbers of the patients were recently removed from the database, replaced with dummy values. One file has been "processed", that one containing the Cleveland database. All four unprocessed files also exist in this directory. To see Test Costs (donated by Peter Turney), please see the folder "Costs"


Introductory Paper

International application of a new probability algorithm for the diagnosis of coronary artery disease.

By R. Detrano, A. Jánosi, W. Steinbrunn, M. Pfisterer, J. Schmid, S. Sandhu, K. Guppy, S. Lee, V. Froelicher. 1989

Published in American Journal of Cardiology

Provided by
University of California, Irvine


Creators
  • Andras Janosi
  • William Steinbrunn
  • Matthias Pfisterer
  • Robert Detrano

DOI

10.24432/C52P4X

Login to Download

New to AIM-AHEAD Connect?
Create an account!

Features

Attribute Name Role Type Demographic Description Units Missing Values
age Feature Integer Age years no
sex Feature Categorical Sex no
cp Feature Categorical no
trestbps Feature Integer resting blood pressure (on admission to the hospital) mm Hg no
chol Feature Integer serum cholestoral mg/dl no
fbs Feature Categorical fasting blood sugar > 120 mg/dl no
restecg Feature Categorical no
thalach Feature Integer maximum heart rate achieved no
exang Feature Categorical exercise induced angina no
oldpeak Feature Integer ST depression induced by exercise relative to rest no
slope Feature Categorical no
ca Feature Integer number of major vessels (0-3) colored by flourosopy yes
thal Feature Categorical yes
num Target Integer diagnosis of heart disease no