Vision

812 Datasets

Datasets


Crowd Dataset

The crowd datasets are collected from a variety of sources, such as UCF and data-driven crowd datasets. The sequences are diverse, representing dense cr...

video, pedestrian, scene, crowd, human, understanding, anomaly, detection

PHOS (Evaluating illuminati...

Phos is a color image database of 15 scenes captured under different illumination conditions. Every scene of the database contains 15 different images: ...

real lighting conditions, uneven illumination, shadows, feature detection, illumination invariance

MPI Multi-View Collection G...

Welcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects ...

face, reconstruction, depth, mesh, human, action, video, pose, multiview, tracking

Eurasian Cities dataset

The Eurasian Cities dataset contains 103 images of outdoor urban scenes taken in Eurasian cities. It is annotated with horizontal and vertical vanishing...

urban, vanishing, reconstruction, manhattan, outdoor, line, pose, point, geometry

MOT Challenge 2D and 3D

The MOT Challenge is a framework for the fair evaluation of multiple people tracking algorithms. In this framework we provide: - A large collection of...

multiple, benchmark, evaluation, benhttp://motchallenge.net/chmark, dataset, target, video, pedestrian, 3d, tracking, surveillance, people

Bristol Egocentric Object I...

The BEOID dataset includes object interactions ranging from preparing a coffee to operating a weight lifting machine and opening a door. The dataset is ...

video, object, egocentric, 3d, interaction, pose, tracking

ETHZ CVL Video SumMe

The Video Summarization (SumMe) dataset consists of 25 videos, each annotated with at least 15 human summaries (390 in total). The data consists of vide...

video, benchmark, summary, event, human, groundtruth, action

Pedestrian Parsing on Surve...

The Pedestrian Parsing dataset contains 3,673 images from 171 videos of different Surveillance Scenes (PPSS), where 2,064 images are occluded and 1,609 ...

segmentation, pedestrian, parsing

ChokePoint Dataset

We collected a video dataset, termed ChokePoint, designed for experiments in person identification/verification under real-world surveillance conditions...

face, real, human, recognition, world, pedestrian, identification, clustering, multiview, surveillance, detection, sequence

Street View House Number (S...

SVHN is a real-world image dataset for developing machine learning and object recognition algorithms with minimal requirement on data preprocessing and ...

urban, real, recognition, text, streetside, world, streetview, classification, detection, number