The following are multiview stereo data sets captured in our lab: a set of images, camera parameters and extracted apparent contours of a single rigid obj…
3d reconstruction, dense, depth, mesh, sfmThe object is a plaster dinosaur (stegosaurus). Click on thumbnail for a full-sized (640x480) image. Resolution of ground truth model: 0.00025m (you may w…
3d, 3d reconstruction, benchmark, multiview, sfmThe Cambridge-driving Labeled Video Database (CamVid) dataset from Gabriel Brostow [?] contains ten minutes of video footage and corresponding semanticall…
3d reconstruction, depth, semantic, semantic segmentation, sfm, urbanThe object is a plaster reproduction of Temple of the Dioskouroi in Agrigento, Sicily. Click on thumbnail for a full-sized (640x480) image. Resolution of …
3d, 3d reconstruction, benchmark, multiview, sfmAiguille du Midi. France showing photographs with Camera: Mamiya ZD. 55mm. - Resolution: 5Mpixels, 53 images - Photographer: B. Vallet (Imagine/EVD - 2006…
3d reconstruction, large scale, mesh, outdoor, sfmThe Google Street View Pittsburgh Research dataset is a street-level image collection provided by Google for research purposes. The dataset provided he…
3d reconstruction, panorama, pittsburgh, sfm, urbanZurich City Hall dataset (also CIPA dataset) nformation: Place: City Hall, Zurich, Switzerland Number of Images: 15, 1280 x 1000 pixels Camera: Fuji …
3d reconstruction, photogrammetry, sfm, urban, zurichThe CMU Geometric Context dataset by Derek Hoiem, Alexei A. Efros, Martial Hebert consists of 300 images used for training and testing the geometric conte…
3d reconstruction, context, depth, geometry, single viewThe Quad 6K dataset is a Structure-from-Motion dataset taken at Arts Quad at Cornell University campus and consists of 6514 images with ground truth posit…
3d gps, 3d reconstruction, groundtruth, landmark, sfm, urbanWe would like to announce the release of PASCAL-Context dataset. We augmented PASCAL VOC 2010 dataset with annotations for 400+ additional categories. In …
benchmark, category, dense, pascal, recognition, segmentation, semantic, shapeThe Notre Dame de Paris dataset used for 3D SfM reconstruction and contains 715 images provided by Noah Snavely. There are also version for NotreDame b…
3d, 3d reconstruction, flickr, frontview, landmark, limited, paris, pointcloud, sfmWe take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest ar…
depth, detection tracking, object detection, object tracking, odometry, optical flow, reconstruction, segmentation, semantic car depth, sfm, stereoThe Dubrovnik6K and Rome16K datasets are image collections for SfM reconstruction, where the suffix refers to the number of images in the dataset. Dubro…
3d reconstruction, dubrovnik, landmark, rome, sfm, urbanThe Colosseum and San Marco are two image datasets for dense multiview stereo reconstructions used for evaluating the visual photo realism. The datasets…
3d reconstruction, aerial, flickr, landmark, photo-realism, sfm, streetside, urbanThe SAMANTHA (Structure-and-Motion Pipeline on a Hierarchical Cluster Tree) dataset contains 4 sequences for 3D reconstruction: Pozzoveggiani, Piazza Dant…
3d reconstruction, geometry, landmark, model fitting, sfmCMU/VMR Urban Image+Laser dataset contains 372 images linked with 3D laser points projections. There are additional images (due to the laser scanner being…
3d reconstruction, laser, semantic segmentation, sfm, urbanThese sequences were used for our video interpolation work described in High-quality video view interpolation using a layered representation, C.L. Zitn…
3d reconstruction, camera, depth, segmentationThe Stable Structure from Motion datasets due to size limitations cannot put the images online. Instead here are the tracked image points and the final re…
3d, 3d reconstruction, church, geometry, landmark, robust, sfm, stabilityThis repository contains labeled 3-D point cloud laser data collected from a moving platform in a urban environment. Data are provided for research purpos…
3d reconstruction, laser, semantic segmentation, sfm, urbanThe Symmetric Bundle Adjustment dataset contains four sequences of the CAB building, Barcelona, Redmond and Capitole for 3D reconstruction considering sym…
3d reconstruction, bundle adjustment, sfm, symmetry, urbanThe Leuven Stereo Scene dataset is a scene and depth dataset. There exist two variants of this dataset - a CVPR 2007 paper [1] by Leibe et al. for detecti…
3d, depth, leuven, reconstruction, segmentation, semantic, sfm, stereo, urbanWelcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects of…
action, depth, face, human, mesh, multiview, pose, reconstruction, tracking, videoThis ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1] …
3d, architecture, benchmark, classification, code, mesh, outdoor, paris, pointcloud, recognition, reconstruction, segmentation, semantic, source, urbanThe Paris500k dataset consists of 501,356 geotagged images collected from Flickr and Panoramio. The dataset was collected from a geographic bounding box r…
3d reconstruction, flickr, geotag, image retrieval, landmark, panoramio, paris, sfmThe HCI 4D Lightfields dataset contains 11 objects with corresponding lightfields for depth estimation. Datasets can be downloaded individually below. F…
3d, 4d, benchmark, depth, evaluation, lightfield, reconstructionThe Aachen dataset consists of 4479 images taken with multiple cameras (3GB), 369 query images taken with the camera of a mobile phone together with their…
3d reconstruction, aachen, image retrieval, landmark, sfmSince its launch in September 1999, Space Imaging IKONOS earth imaging satellite has provided a reliable stream of image data that has become the standard…
3d reconstruction, aerial, photogrammetry, sfm, urbanThe CHALEARN Multi-modal Gesture Challenge is a dataset +700 sequences for gesture recognition using images, kinect depth, segmentation and skeleton data.…
action, depth, gesture, human, illumination, kinect, recognition, segmentation, skeletonInstance recognition from depth data. Contains various challenges of Pose, Clutter, Occlusion and similar looking objects (Bonde, U., Badrinarayanan, V., …
depth, detection, instance, poseScene Background Initialization (SBI) dataset The SBI dataset has been assembled in order to evaluate and compare the results of background initializati…
background, benchmark, change, detection, foreground, initializationFor the first few decades of the fields existence, computer vision has been focused on algorithmic, logical approaches to perception. But it was only with…
3d, depth, indoor, kinect, object, recognition, reconstructionThe xawAR16 dataset is a multi-RGBD camera dataset, generated inside an operating room (IHU Strasbourg), which was designed to evaluate tracking/relocaliz…
depth, medicine, operation, recognition, surgery, table, videoFine-Grained Visual Classification of Aircraft (FGVC-Aircraft) is a benchmark dataset for the fine grained visual categorization of aircraft. Data, anno…
aircraft, airplane, benchmark, classification, evaluation, fine-grained, recognitionThe Shefeld Kinect Gesture (SKIG) dataset contains 2160 hand gesture sequences (1080 RGB sequences and 1080 depth sequences) collected from 6 subjects. Al…
action, depth, gesture, human, illumination, kinect, recognitionThe Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divide…
benchmark, groundtruth, motion, object, pedestrian, segmentation, tracking, videoThe Graffiti dataset by Krystian Mikolajczyk and Cordelia Schmid contains 48 images split into 8 sequences with 6 images each showing different structured…
benchmark, feature description, feature detection, image rectificationThe Prague Texture Segmentation Datagenerator and Benchmark is designed to mutually compare and rank different (dynamic/static) texture segmenters (superv…
benchmark, synthetic, texture classification, texture segmentationThe Textures volume currently contains 154 images, all monochrome, 129 512x512 and 25 1024x1024. For the Brodatz texture images, the number in parenthes…
benchmark, classification, evaluation, segmentation, synthetic, textureThis dataset contains two image collections, TempleOfHeaven and SportsArena, that are deemed hard for Structure-from-Motion (SfM). The method is descri…
3d reconstruction, ambiguous structures, structure-from-motionThe MSR RGB-D Dataset 7-Scenes dataset is a collection of tracked RGB-D camera frames. The dataset may be used for evaluation of methods for different app…
depth, kinect, location, reconstruction, tracking, videoUnlike the previous SHREC contests, the objective of this SHREC 2012 contest is to evaluate the performance of 3D-mesh segmentation techniques instead of …
3d, mesh, part, segmentationThe RGB-D Person Re-identification dataset is for person re-identification using depth information. The main motivation is that the standard techniques (s…
3d, classification, depth, identification, pedestrian, shapeChairGest is an open challenge / benchmark. The task consists in spotting and recognizing gestures from multiple synchronized sensors: 1 Kinect and 4 Xse…
benchmark, detection, gesture, human, kinect, recognitionThe 3D shape description dataset consists of multiple sub-datasets Descriptor Matching - Dataset 1 & 2 (Stanford) These datasets, created from some of …
3d, benchmark, description, matching, reconstruction, registration, shapeTo encourage the open comparison of single image shadow removal in community, we provide an online benchmark site and a dataset. Our quantitatively verifi…
benchmark, illumination, removal, shadow, singleviewThe Outex dataset is part of a framework for empirical evaluation of texture classification and segmentation algorithms. The framework is being construct…
benchmark, classification, segmentation, synthetic, textureThe Berkeley Video Segmentation Dataset (BVSD) contains videos for segmentation (boundary?) Dataset train Dataset test
benchmark, segmentation, videoISPRS Test Project on Urban Classification, 3D Building Reconstruction and Semantic Labeling. In this part of our working group site you will get further …
3d, aerial, benchmark, canada, city, germany, multiview, photogrammetry, recognition, segmentation, semantic, urbanISPRS / EuroSDR Benchmark for Multi-Platform Photogrammetry In these pages you can get information about the BENCHMARK FOR MULTI-PLATFORM PHOTOGRAMMETRY…
3d, aerial, benchmark, city, germany, multiview, photogrammetry, reconstruction, switzerland, urbanWe wanted to have a collection of action recognition papers and results that everybody can use for reference. The site will work by the community principl…
action, benchmark, dataset, recognitionThe procedural texture perceptual similarity dataset contains a list of procedural textures along with their pairwise distances, as defined by a perceptua…
benchmark, procedural, study, textureThe NBVbench is a reference object and benchmark criteria for defining and evaluating the performance of a next best view (NBV) method.
3d reconstruction, geometry, next best view, planningThe Weather and Illumination Database (WILD) is an extensive database of high quality images of an outdoor urban scene, acquired every hour over all seaso…
camera, change, depth, estimation, illumination, light, newyork, static, time, urban, video, weather, webcamThe NYU-Depth V2 data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft…
depth, kinect, label, reconstruction, semantic segmentationThe data is taken from Photo Tourism reconstructions from Trevi Fountain (Rome), Notre Dame (Paris) and Half Dome (Yosemite). Each dataset consists of a s…
feature description, feature matching, pair, sfmThe Extreme Classification Repository: Multi-label Datasets & Code Kush Bhatia Himanshu Jain Prateek Jain Manik Varma The objective in extreme mult…
benchmark, classification, evaluation, learning, machine, multilabelThe 2D-3D-S dataset provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annotati…
3d, building, depth, indoor, large-scale, normal, panorama, reconstruction, segmentation, semanticCOCO-Stuff augments the COCO dataset with pixel-level stuff annotations for 10,000 images. These annotations can be used for scene understanding tasks lik…
annotation, benchmark, captioning, coco, groundtruth, segmentation, semantic, stuff, thingsThe 1DSfM Landmarks is a collection of community-based image reconstruction by Kyle Wilson and is comprised of 14 datasets with comparison to bundler grou…
3d, benchmark, city, groundtruth, landmark, reconstruction, urbanThe TVPR dataset includes 23 registration sessions. Each of the 23 folders contains the video of one registration session. Acquisitions have been performe…
clothing, depth, gender, identification, indoor, people, person, recognition, reidentification, top-view, videoThe Lane Level Localization dataset was collected on a highway in San Francisco with the following properties: * Reasonable traffic * Multiple lane hig…
3d, autonomous, benchmark, car, driving, gps, localization, map, road, videoThe Symmetry Facades dataset contains 9 building facades with multiple images. It used for coupled symmetry and structure from motion detection. Couple…
3d, building, facade, reconstruction, repetition, sfm, symmetry, urbanThe Stanford 3D Scanning Repository dataset is a compilation of 3D scans of objects like Stanford Bunny, Happy Buddha, Dragon, Armadillo and Lucy. These c…
3d reconstruction, bunny, laser, triangulationThe Video Summarization (SumMe) dataset consists of 25 videos, each annotated with at least 15 human summaries (390 in total). The data consists of videos…
action, benchmark, event, groundtruth, human, summary, videoThe Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames i…
benchmark, groundtruth, motion, object, pedestrian, segmentation, tracking, videoThe Brodatz dataset consists of 112 textures in grayscale images of various texture types. http://www.ee.oulu.fi/research/imag/texture/image_data/Brodat…
benchmark, classification, segmentation, synthetic, textureThe VidPairs dataset contains 133 pairs of images, taken from 1080p HD (~2 megapixel) official movie trailers. Each pair consists of images of the same sc…
dense, description, flow, matching, optical, pair, patch, videoThe dataset is composed of 150 synthetic scenes, captured with a (perspective) virtual camera, and each scene contains 3 to 5 objects. The model set is co…
mesh, recognition, segmentation, syntheticThis dataset package contains the software and data used for Detection-based Object Labeling on the RGB-D Scenes Dataset as implemented in the paper: De…
3d, depth, indoor, kinect, object, recognition, reconstructionIt is composed of ADL (activity daily living) and fall actions simulated by 11 volunteers. The people involved in the test are aged between 22 and 39, wit…
accelerometer, action, depth, fall detection - adl, human, kinect, recognition, video, wearableThe MOT Challenge is a framework for the fair evaluation of multiple people tracking algorithms. In this framework we provide: - A large collection of d…
3d, benchmark, benhttp://motchallenge.net/chmark, dataset, evaluation, multiple, pedestrian, people, surveillance, target, tracking, videoVoxel Based Dataset for Systematic 3D reconstruction by artificial neural networks (ANNs). A synthetic scalable cube dataset for training, testing and v…
3d, deep learning, reconstruction, sfm, synthetic city urbanWe present the 2017 DAVIS Challenge, a public competition specifically designed for the task of video object segmentation. Following the footsteps of othe…
benchmark, code, hd, object, quality, resolution, segmentation, tracking, video segmentationMICCAI 2015 Challenge on Liver Ultrasound Tracking Munich, October 9, 2015 (Full Day) Outline Ultrasound (US) imaging is a widely used medical imaging…
benchmark, human, liver, medical, organ, real, therapy, tracking, ultrasoundA 66 stereo pairs dataset with their subpixel ground truths. The construction and improvement of algorithms for subpixel stereovision requires very prec…
3d, depth, groundtruth, noise, pointcloud, stereo, stereovision, subpixelThe DTU Robot dataset consists of color images of 60 scenes acquired in a controlled setup from 119 different positions and under different lighting. For …
feature description, feature detection, feature matching, illumination, reconstruction, sfmThe SPHERE human skeleton movements dataset was created using a Kinect camera, that measures distances and provides a depth map of the scene instead of th…
action, behavior, depth, human, kinect, motion, movement, skeleton, videoThe Ford Car dataset is joint effort of Pandey et al. (for collecting images, Lidar points, calibration etc.) and us (for annotation of 2D and 3D objects)…
3d, car, detection, groundtruth, lidar, sfmA synthetic light field dataset with 24 scenes. Data provided for each scene: - 9x9x512x512x3 light fields as individual PNGs - config files with cam…
depth, disparity, ground truth, light field, syntheticISPRS and EuroSDR - Benchmark on High Density Aerial Image Matching Background and Scope of the project Innovations in matching algorithms as well as …
3d, aerial, benchmark, city, germany, multiview, photogrammetry, reconstruction, switzerland, urbanThe Microsoft COCO (mscoco) is an image recognition and segmentation dataset which contains more 300k images for more than 70 categories. Other features…
benchmark, context, detection, object, recognition, segmentation, semanticMany different labeled video datasets have been collected over the past few years, but it is hard to compare them at a glance. So we have created a handy …
action, benchmark, classification, detection, object, recognition, videoThe High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedestr…
benchmark, camera, detection, high-definition, human, indoor, lisbon, multiview, network, pedestrian, re-identification, surveillance, tracking, videoThe NYU-Depth data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Ki…
depth, kinect, label, reconstruction, semantic segmentationScene Parsing Benchmark Scene parsing data and part segmentation data derived from ADE20K dataset could be download from MIT Scene Parsing Benchmark. m…
annotation, benchmark, recognition, scene, segmentation, semantic