Smart Stock Prediction

Chanjuan Wen

1 Goal

My goal for this project includes the following two parts:

2 Methods

2.1 Supervised Learning Algorithm

For the preliminary analysis, I mainly use Support Vector Machine(SVM) to do the prediction. SVM is acknowledged by many people as the best supervised learning algorithm for data analysis and pattern recognition[1]. In this project, we use SVM to predict the next day’s stock price. Based on the price data in the previous week, we use four kernels for SVM as following:

2.2 Unsupervised Learning Algorithm

For the classification of stocks with good performance, I plan to use some unsupervised learning algorithm, such as k-means and mixture of Gaussians. In statistics and data mining, k-means clustering algorithm aims to partition n observations into k clusters[2]. Another unsupervised learning algorithm, Mixture of Gaussians, can also be used to deal with clustering problems. In this project, we will use this two algorithms to partition a big set of stocks into some clusters, according to the measurement we define.

3 Dataset and Features

We retrieve historic stock data from Yahoo Finance by Yahoo Stock API. The dataset we collected mainly includes two parts:

4 Preliminary Results

By using four kernels of SVM, we have conducted several tests in the price prediction of seven stocks as following:

5 Remaining Tasks

My remaining tasks include the following things:

6 Timeline

Period Task
May 10 - May 20 Implement the classifer for optimal set of stocks
May 21 - May 30 Optimization, testing and polishing results
May 31 Final report

References

[1] http://en.wikipedia.org/wiki/Support_vector_machine

[2] http://en.wikipedia.org/wiki/K-means_clustering

[3] Christopher King, Christophe Vandrot, John Weng. A SVM Approach to Stock Trading, 2009.

[4] Hung Pham, Andrew Chien, Youngwhan Lim. A Framework for Stock Prediction, 2009.