The Multi-illuminant Image Sequences dataset contains 16 video sequences (13 with single light source and 3 with two global light sources), recorded with a HD video camera (1820x1080pix, 60ps). A flat gray card with spectrally uniform reflectance appears in each video sequence. The single-illuminant dataset includes three outdoor scenes and six indoor scenes recorded under normal lighting. Additionnally, four sequences were recorded using red and blue filters to simulate extreme lighting conditions. The two-illuminant dataset (three sequences) consists of video acquired under artificial lights, sun and skylight, artificial light and natural daylight. Reference : Illuminant Chromaticity from Image Sequences Veronique Prinet, Dani Lischinski, Michael Werman ICCV 2013
This is a subset of the dataset introduced in the SIGGRAPH Asia 2009 paper, Webcam Clip Art: Appearance and Illuminant Transfer from Time-lapse Sequences.…
camera, change, illumination, light, nature, static, time, urban, video, webcamThe CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test image…
color, image classification, object, patch, scene, tinyThe UBO 2014 consists of 7 semantic categories. Each of these 7 material categories contains measurements of 12 different material instances for being cap…
classification, illumination, light, material, recognition, textureThe Daimler Mono Pedestrian Classification Benchmark dataset consists of two parts: a base data set. The base data set contains a total of 4000 pedestri…
classification, illumination, object, outdoor, pedestrian, scale, urbanThe Weather and Illumination Database (WILD) is an extensive database of high quality images of an outdoor urban scene, acquired every hour over all seaso…
camera, change, depth, estimation, illumination, light, newyork, static, time, urban, video, weather, webcamThe UK Bench dataset from Henrik Stewenius and David Nister contains 10200 images of N=2550 groups with each four images at size 640x480. The images are r…
centered, image retrieval, object, rotationThe multi-modal/multi-view datasets are created in a cooperation between University of Surrey and Double Negative within the EU FP7 IMPART project. The …
3d, action, color, dynamic, emotion, face, human, indoor, lidar, model, multi-mode, multi-view, outdoor, rgbd, videoThis dataset package contains the software and data used for Detection-based Object Labeling on the RGB-D Scenes Dataset as implemented in the paper: De…
3d, depth, indoor, kinect, object, recognition, reconstructionDatabase contains 798 images of 114 persons, with 7 images per person and is freely available for research purposes. All images were taken in supervised c…
biometry, face, human, illumination, lighting, pedestrian, person, recognitionThe TRaffic ANd COngestionS (TRANCOS) dataset, a novel benchmark for (extremely overlapping) vehicle counting in traffic congestion situations. It consist…
car, detection, highway, object, spain, traffic, transportation, urban, vehicleThe ICG Multi-Camera and Virtual PTZ dataset contains the video streams and calibrations of several static Axis P1347 cameras and one panoramic video from…
calibration, camera, crowd, detection, graz, multitarget, multiview, network, object, outdoor, panorama, pedestrian, tracking, videoThe ICG Multi-Camera datasets consist of Easy Data Set (just one person) Medium Data Set (3-5 persons, used for the experiments) Hard Data Set (crowd…
calibration, camera, detection, graz, indoor, multitarget, multiview, object, pedestrian, tracking, videoWe present the 2017 DAVIS Challenge, a public competition specifically designed for the task of video object segmentation. Following the footsteps of othe…
benchmark, code, hd, object, quality, resolution, segmentation, tracking, video segmentationThe Symmetry set dataset is a collection of images at different illuminations for the purpose of image matching using local symmetry features. Image Mat…
building, feature, illumination, image, lighting, matching, symmetry, urbanThe DTU Robot dataset consists of color images of 60 scenes acquired in a controlled setup from 119 different positions and under different lighting. For …
feature description, feature detection, feature matching, illumination, reconstruction, sfmThe multi-scale Weizmann horses (originally from Eran Borenstein, adapted by Jamie Shotton) consists of 656 images which is split into 50+50training, 50+5…
clutter, horse, nature, object detection by shape, object segmentationThe GaTech VideoContext dataset consists of over 100 groundtruth annotated outdoor videos with over 20000 frames for the task of geometric context evalua…
classification, context, geometry, nature, outdoor, segmentation, semantic, supervised, unsupervised, urban, videoSome datasets and evaluation tools are provided on this page for four different computer vision and computer graphics problems. Population counting Lin…
3d, counting, crowd, detection, groundtruth, line, network, object, pedestrian, pointcloud, reconstruction, road, surface, urbanRobust Multi-Person Tracking from Mobile Platforms In all cases, data was recorded using a pair of AVT Marlins F033C mounted on a chariot respectively a…
color, pedestrian, sequence, trackingThe Daimler Mono Pedestrian Detection Benchmark dataset contains a large training and test set. The training set contains 15.560 pedestrian samples (image…
detection, mono, object, outdoor, pedestrian, scale, urbanThe ETHZ Shape classes dataset from Vittorio Ferrari [?] consists of five object classes and a total of 255 images. All classes contain significant intra-…
applelogo, bottle, clutter, giraffe, matching, mug, nature, object detection by shape, segmentation, swanThe YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 vide…
detection, flow, object, optical, segmentation, videoThe Microsoft COCO (mscoco) is an image recognition and segmentation dataset which contains more 300k images for more than 70 categories. Other features…
benchmark, context, detection, object, recognition, segmentation, semanticMany different labeled video datasets have been collected over the past few years, but it is hard to compare them at a glance. So we have created a handy …
action, benchmark, classification, detection, object, recognition, videoA dataset acquired with 3 synchronized sensors (Primesense Carmine 1.09, Microsoft Kinect v2, Canon IXUS 950 IS), featuring: * 30 industry-relevant obje…
3d, estimation, object, pose, rgbd, texture-lessThe CHALEARN Multi-modal Gesture Challenge is a dataset +700 sequences for gesture recognition using images, kinect depth, segmentation and skeleton data.…
action, depth, gesture, human, illumination, kinect, recognition, segmentation, skeletonThe Stanford Background Dataset is a new dataset introduced in Gould et al. (ICCV 2009) for evaluating methods for geometric and semantic scene understand…
classification, geometry, nature, segmentation, semantic, urbanThe dataset contains 15 documentary films that are downloaded from YouTube, whose durations vary from 9 minutes to as long as 50 minutes, and the total nu…
detection, object, videoScanNet is an RGB-D video dataset containing 2.5 million views in more than 1500 scans, annotated with 3D camera poses, surface reconstructions, and insta…
3d, cad, indoor, layout, object, realism, recognition, rendering, room, scene, segmentation, syntheticThe UCF Person and Car VideoSeg dataset consists of six videos with groundtruth for video object segmentation. Surfing, jumping, skiing, sliding, big ca…
camera, groundtruth, model, motion, object, segmentation, videoThis site is dedicated to provide datasets for the Robotics community with the aim to facilitate result evaluations and comparisons. The datasets presente…
3d, city, laser, nature, urbanFor the first few decades of the fields existence, computer vision has been focused on algorithmic, logical approaches to perception. But it was only with…
3d, depth, indoor, kinect, object, recognition, reconstructionThe Farman Institute 3D Point Sets dataset contains 11 objects by a 3D laser scanner. This dataset was peer-reviewed by Image Processing On Line: Farman I…
3d, laser, model, object, point, reconstruction, scannerThe Tiny Images dataset consists of 79,302,017 images, each being a 32x32 color image. This data is stored in the form of large binary files which can be …
color, image classification, image retrieval, tinyThe Person Re-ID (PRID) 2011 dataset was created in co-operation with the Austrian Institute of Technology for the purpose of testing person re-identifica…
appearance, change, classification, graz, identification, illumination, multiview, pedestrian, trajectoryThe Aspect Layout dataset is designed to allow evaluation of object detection for aspect ratios in perspective images. Author text: In this project we…
aspect, detection, layout, object, perspective, ratioThis material is supplementary to Michael Stark, Bernt Schiele. How Good are Local Features for Classes of Geometric Objects. Eleventh IEEE Internatio…
binary, classification, object, shape, toolThe Shefeld Kinect Gesture (SKIG) dataset contains 2160 hand gesture sequences (1080 RGB sequences and 1080 depth sequences) collected from 6 subjects. Al…
action, depth, gesture, human, illumination, kinect, recognitionThe Longterm Pedestrian dataset consists of images from a stationary camera running 24 hours for 7 days at about 1 fps. It used for adaptive detection an…
background, change, coffee, detection, graz, illumination, indoor, multitarget, pedestrian, robustThe Our Database of Faces (ORL) dataset contains ten different images of each of 40 distinct subjects. For some subjects, the images were taken at differe…
expression, face, human, illumination, recognitionThe Visual Attributes dataset contains visual attribute annotations for over 500 object classes (animate and inanimate) which are all represented in Image…
attribute, classification, imagenet, object, recognitionThis data set comprises 144 images of an edge profile cutting head of a milling machine. The head tool contains a total of 30 cutting inserts. The cutting…
cutting, edge, head, inserts, localization, milling, monitoring, object, profile, tool, tools, wearThe Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divide…
benchmark, groundtruth, motion, object, pedestrian, segmentation, tracking, videoMultispectral Imaging (MSI) datasets were acquired using IRIS II which is a lightweight portable system comprising of a high resolution camera, a novel fi…
alignment, groundtruth, illumination, matching, multi-spectral, registration, wavelengthThe GaTech VideoSeg dataset consists of two (waterski and yunakim?) video sequences for object segmentation. There exists no groundtruth segmentation an…
camera, model, motion, object, segmentation, videoThe Kendall Square webcam dataset consists of two streams for one sunny day and one cloudy day of a city square. It is used for tracking and analyzing col…
appearance, change, color, detection, sky, weather, webcamThe BEOID dataset includes object interactions ranging from preparing a coffee to operating a weight lifting machine and opening a door. The dataset is re…
3d, egocentric, interaction, object, pose, tracking, videoTo encourage the open comparison of single image shadow removal in community, we provide an online benchmark site and a dataset. Our quantitatively verifi…
benchmark, illumination, removal, shadow, singleviewThe Fish4Knowledge project (groups.inf.ed.ac.uk/f4k/) is pleased to announce the availability of 2 subsets of our tropical coral reef fish video and ext…
animal, camera, classification, fish, motion, nature, recognition, video, waterThe Yale Face dataset from A. Georghiades contains 5760 single light source images of ten subjects, each shown in 9 poses and 64 illumination setups (lead…
face detection, illumination, pedestrian, pose estimationThe SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used for …
camera, flow, groundtruth, model, motion, object, optical, proposal, segmentation, stationary, videoTo evaluate our method we designed a new ground truth database of 50 images. The following zip-files contain: Data, Segmentation, Labelling - Lasso, Label…
background, boundingbox, color, image segmentation, optimizationThe CALTECH 256 dataset by Li Fei-Fei contains 30607 images for 256 categories.
centered, classification, detection, image, object, sceneThe VOT2016 pixel-wise annotations dataset contains pixel-wise per-frame annotations for sequences from VOT2016 dataset. The annotation is in a form of BW…
annotation, mask, object, segmentation, tracking, visualThe Comprehensive Cars (CompCars) dataset contains data from two scenarios, including images from web-nature and surveillance-nature. The web-nature data …
attribute, car, classification, fine-grained, object, recognition, urban, vehicleThe PASCAL VOC is augmented with segmentation annotation for semantic parts of objects. For example, for the person category, we provide segmentation mask…
detection, human, object, part, pascal, pedestrian, recognition, segmentation, semanticThe ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P134…
calibration, camera, detection, evaluation, graz, laboratory, multiview, object, pedestrian, segmentation, trackingThe CALTECH 101 dataset by Li Fei-Fei contains images for 101 categories with about 40 to 800 images per category. Most categories have about 50 images at…
centered, image classification, natural-image, object, sceneThe SUNCG dataset is a Large 3D Model Repository for Indoor Scenes. SUNCG is an ongoing effort to establish a richly-annotated, large-scale dataset of…
3d, indoor, layout, object, realism, recognition, rendering, room, scene, segmentation, syntheticThe INRIA Horses dataset from Frederic Jurie and Vittorio Ferrari consists of 170 images with one or more horses in side-view at several scales and clutte…
clutter, horse, nature, object detection by shape, segmentationThe Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames i…
benchmark, groundtruth, motion, object, pedestrian, segmentation, tracking, videoThe KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body join…
camera, detection, game, multitarget, multiview, object, outdoor, pedestrian, pose, recognition, soccer, trackingLASIESTA is composed by many real indoor and outdoor sequences organized in different categories, each of one covering a specific challenge in moving obje…
background, camera, challenge, dataset, detection, foreground, groundtruth, motion, object, stationary, subtraction