Vision

812 Datasets

Datasets


Stroke Width Transform Text

Stroke Width Transform Text dataset is by Boris Epstein and consists of 307 images and XXX text instances. Detecting Text in Natural Scenes with Stro...

text recognition, text detection, classification

Daimler Pedestrian Detectio...

15,560 pedestrian and non-pedestrian samples (image cut-outs) and 6744 additional full images not containing pedestrians for bootstrapping. The test set...

detection

Street View Text

The Street View Text (SVT) dataset contains 647 words and 3796 letters in 249 images harvested from Google Street View. The dataset is more challengin...

urban, text recognition, text detection, classification, outdoor

Hieroglyph Dataset

Ancient Egyptian Hieroglyph Dataset.

recognition

Oxford reconstruction data ...

Oxford colleges

multiview

Change Detection

The dataset folder contains 7 folders (one for each category). Each category folder contains 4 to 6 folders (one for each video). Each video folder co...

change detection, background modelling

ICG Graz240

The ICG Graz240 dataset consists of 240 buildings with 5400 redundant images with a total of 5542 window instances. Window detection itself is difficult...

urban, semantic segmentation, semantic, object detection, graz

Aachen Retrieval

The Aachen dataset consists of 4479 images taken with multiple cameras (3GB), 369 query images taken with the camera of a mobile phone together with the...

image retrieval, 3d reconstruction, aachen, sfm, landmark

Ikonos Aerial

Since its launch in September 1999, Space Imaging IKONOS earth imaging satellite has provided a reliable stream of image data that has become the standa...

urban, 3d reconstruction, photogrammetry, aerial, sfm

ECP Paris 2010

The Ecole Centrale Paris 2010 (Paris 2010) dataset consists of 30 images of densely annotated building facades in seven classes - wall, window, sky, sho...

urban, semantic segmentation, semantic, paris, procedural reconstruction