8

I am testing computer vision algorithms for image categorization. I would like to find a dataset with a few categories of objects e.g. cats and dogs. This dataset should have all the variability within each class be due to the class's intrinsic variability. That is, I don't want to have to worry about pictures taken from different viewpoints or under different lighting conditions etc. Almost all the variability within a category should be due to the intrinsic variability of that category e.g. the category of cats would contain many different images because cats actually look different from one another, not because the images were produced under different conditions.

Preferably, the objects will be "cut out" (on a uniform background). The size of the dataset is not important. Synthetic images (perhaps produced with 3D graphics software) are also ok. The images need to come labeled with their category.

Does anyone know of a dataset like this?

7 Answers7

6

Computer Vision Online seems to have a very comprehensive list of CV and image processing datasets: http://computervisiononline.com/datasets

Community
  • 1
  • 1
bjoernz
  • 3,852
  • 18
  • 30
5

I tried SVMs a few years ago with this library:

http://www.cs.columbia.edu/CAVE/software/softlib/coil-100.php

code-gijoe
  • 6,949
  • 14
  • 67
  • 103
5

Try the caltech 256 dataset: http://www.vision.caltech.edu/Image_Datasets/Caltech256/

It will give you a good basis of comparison against other algorithms used in the community as well.

jeff7
  • 2,172
  • 14
  • 9
  • I'm familiar with caltech101. The purpose of datasets like caltech 101 and 256 is similar, but not the same as what I'm looking for. In caltech 101 the objects are mostly the same size and centered in the image but this is not well controlled (that isn't really the point). I am specifically interested in a dataset where all the objects are normalized to the same view and preferably cut out on a gray background. This means I'm looking for a very "easy" dataset... – innocuousQualia Mar 31 '11 at 12:23
  • Caltech is easy, MSR color is also easy. See this (http://people.csail.mit.edu/torralba/publications/datasets_cvpr11.pdf) – bendervader Aug 18 '12 at 22:35
3

I've listed some of the image data sets that we've been working on during my PhD time - but you should find really plenty of them on the web. From what you described you are looking for an object recognition task or segmentation dataset with groundtruth.

The ALOI Dataset might be interesting for you: "ALOI is a color image collection of one-thousand small objects, recorded for scientific purposes. In order to capture the sensory variation in object recordings, we systematically varied viewing angle, illumination angle, and illumination color for each object, and additionally captured wide-baseline stereo images. We recorded over a hundred images of each object, yielding a total of 110,250 images for the collection."

The traffic sign recognition datset might also be interesting for you. IIRC there are also ground truth segmentation masks for the traffic signs.

In either cases you should be able to replace the background with whatever you want (if you want to make the task a abit more complicated).

Good luck with the recognition task (if it is still relevant).

Locked
  • 186
  • 13
2

I think CIFAR dataset might help you. It says that it involves labeled subsets of the 80 million tiny images dataset. Loading the dataset seems to be easy on Python, Matlab or C (binary version) via provided ways.

petrichor
  • 6,459
  • 4
  • 36
  • 48
  • CIFAR doesn't work for the same reasons that Caltech256 doesn't work. I really need normalized images e.g. cutout cats and dogs of the same size, facing the same direction, same lighting etc. Thanks anyway though.. – innocuousQualia Apr 03 '11 at 14:19
1

That is, I don't want to have to worry about pictures taken from different viewpoints or under different lighting conditions etc. Almost all the variability within a category should be due to the intrinsic variability of that category e.g. the category of cats would contain many different images because cats actually look different from one another, not because the images were produced under different conditions.

This is not very realistic. In fact, you will end up with a system that does not work in practice at all - or only in very constrained environments.

One such dataset I know (I did not care for such datasets) is the UIUC car datasets. This database contains images of side views of cars for use in evaluating object detection algorithms. (single-scale + multi-scale)

Stefan
  • 1,131
  • 2
  • 12
  • 30
0

Try PASCAL, ImageNet & SUN. Caltech and MSRC are old and off.

You might want to read this paper as well: Unbiased Look at Dataset Bias - Torralba(MIT) & Efros (CMU): people.csail.mit.edu/torralba/publications/datasets_cvpr11.pdf‎

Arturo
  • 3,679
  • 6
  • 34
  • 43