Vision

812 Datasets

Datasets


PETS 2016 IPATCH dataset

The PETS 2016 IPATCH dataset contains a set of fourteen multi camera recordings (visible, themal) collected off the coast of Brest, France, in collabora...

visible, thermal, multimodal, vessel, maritime, boat, gps, tracking, detection, radar

Inria Aerial Image Labeling

The Inria Aerial Image Labeling addresses a core topic in remote sensing: the automatic pixelwise labeling of aerial imagery (link to paper). Dataset ...

house, urban, aerial, building, segmentation, footprint, groundtruth, city, semantic

Swedish Traffic Sign Recogn...

The Swedish Traffic Sign Recognition provides Matlab code for parsing the annotation files and displaying the results. Part0 for each set contains the a...

urban, traffic, detection, city, sign, recognition

SydneyHouse HouseCraft

In HouseCraft, we utilize rental ads to create realistic textured 3D models of building exteriors. In particular, we exploit the address of the property...

house, urban, registration, floorplan, building, streetview, segmentation, localization, city, semantic

Zurich Summer Dataset

The Zurich Summer v1.0 dataset is a collection of 20 chips (crops), taken from a QuickBird acquisition of the city of Zurich (Switzerland) in August 200...

annotation, urban, pan, gsd, superpixel, nir, aerial, satellite, segmentation, zurich, rgb, city, semantic

Multispectral Imaging (MSI)

Multispectral Imaging (MSI) datasets were acquired using IRIS II which is a lightweight portable system comprising of a high resolution camera, a novel ...

illumination, wavelength, registration, alignment, matching, groundtruth, multi-spectral

GeoFaces

A large dataset of geotagged face images collected from Flickr. The zip file contains text files containing urls of the images. Face2GPS: Estimating G...

gender, face, geotagged, classification, age, localization, human

Berkeley DeepDrive Video

The Berkeley DeepDrive Video Dataset contains 2x order of magnitude more video training data.

driving, urban, learning, endtoend, deep, autonomous

Visual Discriminative Quest...

The dataset contains 11202 ambiguous image pairs collected from Visual Genome. Each image pair is annotated with 4.6 discriminative questions and 5.9 no...

question, vqa, genome, vision, biology, language

Osnabruck - Synthetic Scala...

Voxel Based Dataset for Systematic 3D reconstruction by artificial neural networks (ANNs). A synthetic scalable cube dataset for training, testing and...

deep learning, synthetic city urban, 3d, sfm, reconstruction