This dataset comprises of 10 actions related to breakfast preparation, performed by 52 different individuals in 18 different kitchens.
actionThis dataset consists of seven meal-preparation activities, each performed by 10 subjects. Subjects perform the activities based on the given cooking re...
actionThe dataset consists of four temporally synchronized data modalities. These modalities include RGB videos, depth videos, skeleton positions, and inertia...
actionDynamic temporal facial expressions data corpus consisting of close to real world environment extracted from movies.
human pose/expressionContains 91,793 faces manually labeled with expressions. Each of the face images was manually annotated as one of the seven basic expression categories:...
human pose/expressionThis dataset includes 214971 annotated depth images of hands captured by a RealSense RGBD sensor of hand poses. Annotations: per pixel classes, 6D finge...
human pose/expressionDepth videos + ground truth human poses from 2 viewpoints to improve 3D human pose estimation.
human pose/expression