Vision

684 Datasets

Datasets


Stanford 40 Actions

The Stanford 40 Actions dataset contains images of humans performing 40 actions. In each image, we provide a bounding box of the person who is performing …

action, boundingbox, detection, human, recognition

Labeling in 3D Scenes

This dataset package contains the software and data used for Detection-based Object Labeling on the RGB-D Scenes Dataset as implemented in the paper: De…

3d, depth, indoor, kinect, object, recognition, reconstruction

B3DO: Berkeley 3D Object Data…

For the first few decades of the fields existence, computer vision has been focused on algorithmic, logical approaches to perception. But it was only with…

3d, depth, indoor, kinect, object, recognition, reconstruction

HUJI Multi-illuminant Image S…

The Multi-illuminant Image Sequences dataset contains 16 video sequences (13 with single light source and 3 with two global light sources), recorded with…

balance, chromaticity, color, constancy, dichromatic, illumination, light, nature, object, physics, white

3DVis

The 3DVis dataset includes a set of 12 heterogeneous scenes for testing 3D scene registration and analysis methods. Models include homogeneous shapes, rep…

3d, matching, reconstruction, registration, shape, symmetry

Paris Art Deco Facades

The Paris Art Deco Facades dataset consists of 79 / 80 images of rectified facades of the architectural style Art Deco, which has different sizes of windo…

architecture, city, facade, grammar, paris, procedural, recognition, segmentation, semantic, urban

Robotic 3D Scan Repository

The Robotic 3D Scan Repository from Osnabrueck contains 23 different datasets showing a veriaty of 3D scans for objects, humans, cities, university campus…

3d, aerial, bremen, city, germany, heat, human, laser, lidar, osnabrueck, reconstruction, scan, urban

Salient Montages: Human-centr…

The Salient Montages is a human-centric video summarization dataset from the paper [1]. In [1], we present a novel method to generate salient montages f…

human, montage, saliency, summarization, video, wearable

Domain-specific Personal Vide…

The domain-specific personal videos highlight dataset from the paper [1] describes a fully automatic method to train domain-specific highlight ranker for…

action, domain, human, recognition, saliency, summarization, video, wearable

Crowd Dataset

The crowd datasets are collected from a variety of sources, such as UCF and data-driven crowd datasets. The sequences are diverse, representing dense crow…

anomaly, crowd, detection, human, pedestrian, scene, understanding, video