Vision

812 Datasets

Datasets


KAIST Multispectral Pedestr...

We developed imaging hardware consisting of a color camera, a thermal camera and a beam splitter to capture the aligned multispectral (RGB color + Therm...

pedestrian, thermal, rgb

ETHZ Multi-Person Tracking

Robust Multi-Person Tracking from Mobile Platforms In all cases, data was recorded using a pair of AVT Marlins F033C mounted on a chariot respectively...

tracking, pedestrian, color, sequence

NYU Symmetry Database

The mirror symmetry database contains 176 single-symmetry and 63 multyple-symmetry images (.png files) with accompanying ground-truth annotations (.mat ...

symmetry, detection, groundtruth, mirror

The Oxford RobotCar Dataset

The Oxford RobotCar Dataset contains over 100 repetitions of a consistent route through Oxford, UK, captured over a period of over a year. The dataset c...

driving, street, urban, time, recognition, autonomous, video, segmentation, robot, classification, detection, car, year

ImageNET

The ImageNET dataset is the latest dataset by Li Fei-Fei containing various dataset ranging from 1000 to 10000 categories.

image classification, object segmentation, retrieval

IMPART multi-modal/multi-view

The multi-modal/multi-view datasets are created in a cooperation between University of Surrey and Double Negative within the EU FP7 IMPART project. Th...

rgbd, color, dynamic, multi-view, action, outdoor, video, 3d, face, emotion, lidar, human, indoor, multi-mode, model

Facial Expression Research ...

FERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. T...

facial expressions, joy, cardinal classification, deep learning, stylization, animation, fear, human transfer, face, disgust, neutral, anger, annotation emotion, surprise, sad, image retrieval, facial expression

COCO-Stuff

COCO-Stuff augments the COCO dataset with pixel-level stuff annotations for 10,000 images. These annotations can be used for scene understanding tasks l...

annotation, benchmark, coco, segmentation, things, captioning, stuff, groundtruth, semantic

4D Light Field Dataset (HCI...

A synthetic light field dataset with 24 scenes. Data provided for each scene: - 9x9x512x512x3 light fields as individual PNGs - config files with c...

ground truth, light field, disparity, depth, synthetic

CMLA Subpixel Stereo Dataset

A 66 stereo pairs dataset with their subpixel ground truths. The construction and improvement of algorithms for subpixel stereovision requires very pr...

stereo, depth, pointcloud, noise, stereovision, 3d, groundtruth, subpixel