Finding a handwritten dataset with an already extracted features

Question

I want to test my clustering algorithms on data of handwritten text, so I'm searching for a dataset of handwritten text (e.g. words) with already extracted features (the goal is to test my clustering algorithms on, not to extract features). Does anyone have any information on that ?

Thanks.

score 0 · Answer 1 · answered Dec 22 '11 at 11:22

0

There is a dataset of images of handwritten digits : http://yann.lecun.com/exdb/mnist/ .

answered Dec 22 '11 at 11:22

cyborg

9,989
4
38
56

Yes, I've already tested on this database using the 28*28 pixels values of each image as feature vector. But I want more to have an extracted features (descriptors) from a set of handwritten words, characters, or digits ... – shn Dec 22 '11 at 14:01

score 0 · Answer 2 · answered Jan 12 '12 at 16:07

0

Texmex has 128d SIFT vectors "to evaluate the quality of approximate nearest neighbors search algorithm on different kinds of data and varying database sizes", but I don't know what their images are of; you could try asking the authors.

answered Jan 12 '12 at 16:07

denis

21,378
10
65
88

The dataset corpus-texmex is intended for the evaluation of approximate nearest neighbor search methods only. – shn Feb 03 '12 at 16:30

Finding a handwritten dataset with an already extracted features

2 Answers2