Disease Prediction
CS 74/174 Project Proposal

Yilong Zhao

April 11, 2012

Problem

Healing illness is the responsibility of a doctor. However, it will be very helpful if patients can know a little bit about their health before hand. Therefore in this proect, I plan to study and apply several existing machine learning algorithms on the disease prediction puzzle to achieve the goal that by entering the body information, people can get a staightforward description of their current health situation. This contributes to the prevention of a severe disease which may come in silence since people can use it to do daily check.

Data Set Description

I will use the Cleveland database in this project. It has 76 attributes. The training goal refers to the presence of heart disease in the patient, it is basically an integer from 0 to 4. It is created by Medical Center, Long Beach and Cleveland Clinic Foundation.[1]

Learning Method

Since this project refers to prediction and classification problems. The basic machine learning algorithms I take into consider are listed as following:

In order to let the system choose the appropriate method automatically, I plan to implement some methods to combine the results of those learning algorithms and compare the result of the combination solution. The ideas are listed below.

Timeline

References

[1]    http://archive.ics.uci.edu/ml/datasets/Heart+Disease

[2]   Christopher M. Bishop, Pattern Recognition and Machine LearningSpringer 2006