Dataset Category

MPII Cooking Activities Datas…

Cooking Activities dataset.

action

Vision

GTEA Gaze+ Dataset

This dataset consists of seven meal-preparation activities, each performed by 10 subjects. Subjects perform the activities based on the given cooking reci…

action

Vision

UTD-MHAD: multimodal human ac…

The dataset consists of four temporally synchronized data modalities. These modalities include RGB videos, depth videos, skeleton positions, and inertial …

action

Vision

AFEW (Acted Facial Expression…

Dynamic temporal facial expressions data corpus consisting of close to real world environment extracted from movies.

human pose/expression

Vision

Expression in-the-Wild (ExpW)…

Contains 91,793 faces manually labeled with expressions. Each of the face images was manually annotated as one of the seven basic expression categories: a…

human pose/expression

Vision

ETHZ CALVIN Dataset

CALVIN research group datasets

human pose/expression

Vision

HandNet (annotated depth imag…

This dataset includes 214971 annotated depth images of hands captured by a RealSense RGBD sensor of hand poses. Annotations: per pixel classes, 6D fingert…

human pose/expression

Vision

3D Human Pose Estimation

Depth videos + ground truth human poses from 2 viewpoints to improve 3D human pose estimation.

human pose/expression

Vision

IPM Vision Group Image Stitch…

Images and parameters for registeration

image stitching

Vision

VIP Laparoscopic / Endoscopic…

Collection of endoscopic and laparoscopic (mono/stereo) videos and images

medical

Vision

Vision

684 Datasets

Datasets

MPII Cooking Activities Datas…

GTEA Gaze+ Dataset

UTD-MHAD: multimodal human ac…

AFEW (Acted Facial Expression…

Expression in-the-Wild (ExpW)…

ETHZ CALVIN Dataset

HandNet (annotated depth imag…

3D Human Pose Estimation

IPM Vision Group Image Stitch…

VIP Laparoscopic / Endoscopic…