Vision

684 Datasets

Datasets


WordNet

WordNet is a large lexical database of English. Nouns, verbs, adjectives and adverbs are grouped into sets of cognitive synonyms (synsets), each expressin…

category, classification, hierarchy, imagenet, language

MIT Traffic Data Set

MIT traffic data set is for research on activity analysis and crowded scenes. It includes a traffic video sequence of 90 minutes long. It is recorded by a…

tracking

MSRC Kinect Gesture Dataset

The Microsoft Research Cambridge-12 Kinect gesture dataset consists of sequences of human movements, represented as body-part locations, and the associate…

action, gesture, human, kinect, recognition

Colosseum and San Marco

The Colosseum and San Marco are two image datasets for dense multiview stereo reconstructions used for evaluating the visual photo realism. The datasets…

3d reconstruction, aerial, flickr, landmark, photo-realism, sfm, streetside, urban

People in WBCN

This dataset is for people tracking in wide baseline camera networks and was designed as a contest at ICPR 2012. The contest consists of two challenges…

aerial, crowd, object detection, object tracking, occlusion, overlap, pedestrian, trajectory

SDHA Contest

The Semantic Description of Human Activities (SDHA) was a contest at ICPR 2010. The contest is composed of three different types of activity recognitio…

aerial, crowd, object detection, object tracking, occlusion, overlap, pedestrian, trajectory

NYU Depth v2

The NYU-Depth V2 data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft…

depth, kinect, label, reconstruction, semantic segmentation

NYU Depth v1

The NYU-Depth data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Ki…

depth, kinect, label, reconstruction, semantic segmentation

Multiple Instance Learning da…

MIL data sets used in our 2002 NIPS paper for Elepphant, Musk, TREC http://www.cs.cmu.edu/~juny/MILL/MIL-experiments.htm

classification, machine learning

Person identification in TV s…

Face tracks, features and shot boundaries from our latest CVPR 2013 paper. It is obtained from 6 episodes of Buffy the Vampire Slayer and 6 episodes of Bi…

recognition