Vision

684 Datasets

Datasets


AVA: A Large-Scale Database f…

Aesthetic Visual Analysis (AVA) contains over 250,000 images along with a rich variety of meta-data including a large number of aesthetic scores for each …

VascuSynth

The VascuSynth dataset contains 10 groups of data, each group is composed of 12 volumetric images with bifurcation numbers ranging from 1 to 56. All image…

WorkoutSU-10

The WorkoutSU-10exercisedataset comprises a collection of sequences of human body movements represented by 3D positions of skeletal joints. The dataset wa…

FlickrLogos-32

The FlickrLogos-32 dataset contains photos depicting logos and is meant for the evaluation of multi-class logo detection/recognition as well as logo retri…

YouCook

YouCook is an Annotated Data Set of Unconstrained Third-Person Cooking Videos and is prepared from 88 open-source YouTube cooking videos. The YouCook data…

UNICT-FD889

UNICT-FD889 dataset is a food dataset composed by 889 distinct plates of food. Each dish has been acquired with a smartphone multiple times to introduce p…

50 Salads

The dataset captures 25 people preparing 2 mixed salads each and contains over 4h of annotated accelerometer and RGB-D video data. Annotated activities co…

action, activity, classification, detection, recognition, tracking, video

JPL First-Person Interaction

JPL First-Person Interaction dataset (JPL-Interaction dataset) is composed of human activity videos taken from a first-person viewpoint. The dataset parti…

action, human, interactive, motion, recognition, video

ALOT: Amsterdam Library Of Te…

ALOT is a color image collection of 250 rough textures, recorded for scientific purposes. In order to capture the sensory variation in object recordings, …

Berkeley Multimodal Human Act…

The Berkeley Multimodal Human Action Database (MHAD) contains 11 actions performed by 7 male and 5 female subjects in the range 23-30 years of age except …