COIL 20

Labelme

A large dataset of annotated images.

natural-image

Vision

STL-10 dataset

is an image recognition dataset for developing unsupervised feature learning, deep learning, self-taught learning algorithms. Like CIFAR-10 with some modi…

natural-image

Vision

Caltech 256

Pictures of objects belonging to 256 categoriesPictures of objects belonging to 256 categories.

classification, natural-image

Vision

OpenStreetMap

Vector data for the entire planet under a free license. It contains (an older version of) the US Census Bureaus data.

geospatial, natural-image

Vision

Googles Open Images

A collection of 9 million URLs to images that have been annotated with labels spanning over 6,000 categories under Creative Commons.

natural-image

Vision

LSUN

Scene understanding with many ancillary tasks (room layout estimation, saliency prediction, etc.) and an associated competition.

natural-image

Vision

The Street View House Numbers…

House numbers from Google Street View. Think of this as recurrent MNIST in the wild.

natural-image

Vision

COIL100

COIL100 : Different objects imaged at every angle in a 360 rotation.

natural-image

Vision

Pascal VOC

Generic image Segmentation / classificationnot terribly useful for building real-world image annotation, but great for baselines

natural-image

Vision

NEXRAD

Doppler radar scans of atmospheric conditions in the US.

geospatial, natural-image

Vision

MS COCO

Generic image understanding / captioning, with an associated competition.

natural-image

Vision

NORB

Binocular images of toy figurines under various illumination and pose.

natural-image

Vision

ImageNet

The de-facto image dataset for new algorithms. Many image API companies have labels from their REST interfaces that are suspiciously close to the 1000 cat…

natural-image

Vision

MNIST handwritten digits

MNIST: handwritten digits: The most commonly used sanity check. Dataset of 25x25, centered, B&W handwritten digits. It is an easy taskjust because somethi…

natural-image

Vision

CALTECH 101

The CALTECH 101 dataset by Li Fei-Fei contains images for 101 categories with about 40 to 800 images per category. Most categories have about 50 images at…

centered, image classification, natural-image, object, scene

Vision

Landsat8

Satellite shots of the entire Earth surface, updated every several weeks.

geospatial, natural-image

Vision

CIFAR10 / CIFAR100

32x32 color images with 10 / 100 categories. Not commonly used anymore, though once again, can be an interesting sanity check.

natural-image

Vision

Related datasets