Vision

812 Datasets

Datasets


CVL OCR DB

CVL OCR DB is a public annotated image dataset of 120 binary annotated (text/non-text) images of text in natural scenes. Images include signboards, shop...

ocr, sign recognition

Dubrovnik6K and Rome16K

The Dubrovnik6K and Rome16K datasets are image collections for SfM reconstruction, where the suffix refers to the number of images in the dataset. Dub...

urban, 3d reconstruction, dubrovnik, sfm, landmark, rome

ICDAR 2011

This challenge is set up around three tasks: Text Localisation, Text Segmentation and Word Recognition. Participation in any or all tasks is welcome. Ch...

text recognition, text detection, classification

Ljubljana CVL Face Database

Database contains 798 images of 114 persons, with 7 images per person and is freely available for research purposes. All images were taken in supervised...

face, person, human, lighting, recognition, illumination, pedestrian, biometry

Annotated Web Ears Dataset ...

Dataset contains 1000 images of 100 persons, with 10 images per person and is freely available. All images were acquired by cropping ears from images fr...

person, pedestrian, ear, recognition, human, lighting, biometry

MIT LaMem: Large-Scale Imag...

This database contains 60,000 images with memorability scores. The images come from a variety of datasets including SUN, COCO, image popularity, AVA, an...

objects, scenes, aesthetics, popularity, memorability

WIDER Attribute Dataset

WIDER ATTRIBUTE dataset is a human attribute recognition benchmark dataset, of which images are selected from the publicly available WIDER dataset. Ther...

human attribute, attribute recognition

Procedural texture perceptu...

The procedural texture perceptual similarity dataset contains a list of procedural textures along with their pairwise distances, as defined by a percept...

study, benchmark, procedural, texture

General 100

General-100 dataset contains 100 bmp-format images (with no compression). We used this dataset in our FSRCNN ECCV 2016 paper. The size of these 100 imag...

superresolution, image

LabelMeFacade

The LabelMeFacade dataset contains buildings, windows, sky and a limited number of unlabeled regions (maximally 20% covering of the image). This procedu...

segmentation, urban, semantic, recognition, facade, rectified