Telugu
Letter Classifier
Problem:
Handwriting varies tremendously, based on various factors. We are trying to find a method that will recognize the letters regardless of the exact handwriting, so documents can be scanned in and then edited using a normal word processor.
Method:
This is
fundamentally a classification problem, and therefore, solving it will require
the implementation of a classifier.
I could not find a
paper specific to this problem.
However, these were the methods mentioned in reference to the
identification of handwritten numbers.
Gaussian Classifier
Baseline Nearest Neighbor Classifer
K-nearest neighbor classifer
Pairwise Linear Classifer
Principal Component Analysis
Radial Basis Function Network
Large Fully Connected Multi-Layer Neural Network:
Tangent Distance Classier (TDC)
PCA+quadratic
LeNet 1
LeNet 4
LeNet 4 / Local
LeNet 4 / K−NN
LeNet 5
Boosted LeNet 4
K−NN Euclidean
Tangent Distance
Soft Margin
Optimal Margin Classier (OMC)
Data Set:
There are no known databases of the Telugu script.
Therefore, I will have to collect my own data
By the Milestone:
I hope to have a complete Data set and narrow down the best method to distinguish Telugu letters. I also plan to get some part of the programming done.