Vision

812 Datasets

Datasets


Stanford Dogs Dataset

The Stanford Dogs dataset contains images of 120 breeds of dogs from around the world. This dataset has been built using images and annotation from Imag...

fine-grained categorization, dogs, detection, classification

Video classification USAA d...

The USAA dataset includes 8 different semantic class videos which are home videos of social occassions which feature activities of group of people. It c...

classification

McGill Real-World Face Vide...

This database contains 18000 video frames of 640x480 resolution from 60 video sequences, each of which recorded from a different subject (31 female and ...

classification

e-Lab Video Data Set

Video data sets to train machines to recognise objects in our environment. e-VDS35 has 35 classes and a total of 2050 videos of roughly 10 seconds each.

classification

Face and Gesture Recognitio...

Face and Gesture Recognition Working Group FGnet

recognition

PUT face

9971 images of 100 people

recognition

Labeled Faces in the Wild

A database of face photographs designed for studying the problem of unconstrained face recognition

recognition

Urban scene recognition

Traffic Lights Recognition, Lara's public benchmarks.

recognition

PubFig: Public Figures Face...

The PubFig database is a large, real-world face dataset consisting of 58,797 images of 200 people collected from the internet. Unlike most other existin...

recognition

YouTube Faces

The data set contains 3,425 videos of 1,595 different people. The shortest clip duration is 48 frames, the longest clip is 6,070 frames, and the average...

recognition