This dataset consists of seven meal-preparation activities, each performed by 10 subjects. Subjects perform the activities based on the given cooking reci…
actionThe dataset consists of four temporally synchronized data modalities. These modalities include RGB videos, depth videos, skeleton positions, and inertial …
actionDynamic temporal facial expressions data corpus consisting of close to real world environment extracted from movies.
human pose/expressionContains 91,793 faces manually labeled with expressions. Each of the face images was manually annotated as one of the seven basic expression categories: a…
human pose/expressionThis dataset includes 214971 annotated depth images of hands captured by a RealSense RGBD sensor of hand poses. Annotations: per pixel classes, 6D fingert…
human pose/expressionDepth videos + ground truth human poses from 2 viewpoints to improve 3D human pose estimation.
human pose/expressionCollection of endoscopic and laparoscopic (mono/stereo) videos and images
medical