The following are multiview stereo data sets captured in our lab: a set of images, camera parameters and extracted apparent contours of a single rigid object. Each data set consists of 24 images. Image resolutions range from 1400x1300 pixels^2 to 2000x1800 pixels^2 depending on the data set. For calibration, we have used "Camera Calibration Toolbox for Matlab" by Jean-Yves Bouguet to estimate both the intrinsic and the extrinsic camera parameters. All the images have been corrected to remove radial and tangential distortions. For contour extraction, first, Photoshop has been used to segment the foreground from each image(pixel-level). Second, segmentation results have been used to initialize the apparent contour(s) of an object. Last, a b-spline snake has been applied to extract apparent contours in a sub-pixel level. Images are provided in the JPEG format. Camera parameters are provided in the same format as that of Camera Calibration Toolbox for Matlab. Apparent contours are provided in our own format, but we hope are fairly easy to interpret. Please see the other sections at the bottom of this website for more details. Unfortunately, we do not have ground truth for all the data sets, but we believe that we can develop some photometric ways to evaluate the reconstruction results. This page is still under construction, and we keep on updating the contents as well as adding more data sets as we capture. Note: we also provide some visual hull data sets, too. Toy Dinosaur, Toy Mummy, etc
An evaluation benchmark for dense MVS for these datasets fountain-P11, Herz-Jesu-P8, entry-P10, castle-P19, Herz-Jesu-P25, castle-P30 . Images (correcte…
3d reconstruction, benchmark, dense, depth, mesh, sfmThe Cambridge-driving Labeled Video Database (CamVid) dataset from Gabriel Brostow [?] contains ten minutes of video footage and corresponding semanticall…
3d reconstruction, depth, semantic, semantic segmentation, sfm, urbanAiguille du Midi. France showing photographs with Camera: Mamiya ZD. 55mm. - Resolution: 5Mpixels, 53 images - Photographer: B. Vallet (Imagine/EVD - 2006…
3d reconstruction, large scale, mesh, outdoor, sfmThe Stable Structure from Motion datasets due to size limitations cannot put the images online. Instead here are the tracked image points and the final re…
3d, 3d reconstruction, church, geometry, landmark, robust, sfm, stabilityThe Symmetric Bundle Adjustment dataset contains four sequences of the CAB building, Barcelona, Redmond and Capitole for 3D reconstruction considering sym…
3d reconstruction, bundle adjustment, sfm, symmetry, urbanThe object is a plaster dinosaur (stegosaurus). Click on thumbnail for a full-sized (640x480) image. Resolution of ground truth model: 0.00025m (you may w…
3d, 3d reconstruction, benchmark, multiview, sfmThe Notre Dame de Paris dataset used for 3D SfM reconstruction and contains 715 images provided by Noah Snavely. There are also version for NotreDame b…
3d, 3d reconstruction, flickr, frontview, landmark, limited, paris, pointcloud, sfmThe Leuven Stereo Scene dataset is a scene and depth dataset. There exist two variants of this dataset - a CVPR 2007 paper [1] by Leibe et al. for detecti…
3d, depth, leuven, reconstruction, segmentation, semantic, sfm, stereo, urbanThe Google Street View Pittsburgh Research dataset is a street-level image collection provided by Google for research purposes. The dataset provided he…
3d reconstruction, panorama, pittsburgh, sfm, urbanWe take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest ar…
depth, detection tracking, object detection, object tracking, odometry, optical flow, reconstruction, segmentation, semantic car depth, sfm, stereoThe CMU Geometric Context dataset by Derek Hoiem, Alexei A. Efros, Martial Hebert consists of 300 images used for training and testing the geometric conte…
3d reconstruction, context, depth, geometry, single viewThe Colosseum and San Marco are two image datasets for dense multiview stereo reconstructions used for evaluating the visual photo realism. The datasets…
3d reconstruction, aerial, flickr, landmark, photo-realism, sfm, streetside, urbanThe Paris500k dataset consists of 501,356 geotagged images collected from Flickr and Panoramio. The dataset was collected from a geographic bounding box r…
3d reconstruction, flickr, geotag, image retrieval, landmark, panoramio, paris, sfmSince its launch in September 1999, Space Imaging IKONOS earth imaging satellite has provided a reliable stream of image data that has become the standard…
3d reconstruction, aerial, photogrammetry, sfm, urbanThe Quad 6K dataset is a Structure-from-Motion dataset taken at Arts Quad at Cornell University campus and consists of 6514 images with ground truth posit…
3d gps, 3d reconstruction, groundtruth, landmark, sfm, urbanWelcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects of…
action, depth, face, human, mesh, multiview, pose, reconstruction, tracking, videoZurich City Hall dataset (also CIPA dataset) nformation: Place: City Hall, Zurich, Switzerland Number of Images: 15, 1280 x 1000 pixels Camera: Fuji …
3d reconstruction, photogrammetry, sfm, urban, zurichThe Dubrovnik6K and Rome16K datasets are image collections for SfM reconstruction, where the suffix refers to the number of images in the dataset. Dubro…
3d reconstruction, dubrovnik, landmark, rome, sfm, urbanThe SAMANTHA (Structure-and-Motion Pipeline on a Hierarchical Cluster Tree) dataset contains 4 sequences for 3D reconstruction: Pozzoveggiani, Piazza Dant…
3d reconstruction, geometry, landmark, model fitting, sfmThe Aachen dataset consists of 4479 images taken with multiple cameras (3GB), 369 query images taken with the camera of a mobile phone together with their…
3d reconstruction, aachen, image retrieval, landmark, sfmCMU/VMR Urban Image+Laser dataset contains 372 images linked with 3D laser points projections. There are additional images (due to the laser scanner being…
3d reconstruction, laser, semantic segmentation, sfm, urbanThese sequences were used for our video interpolation work described in High-quality video view interpolation using a layered representation, C.L. Zitn…
3d reconstruction, camera, depth, segmentationThis repository contains labeled 3-D point cloud laser data collected from a moving platform in a urban environment. Data are provided for research purpos…
3d reconstruction, laser, semantic segmentation, sfm, urbanThe object is a plaster reproduction of Temple of the Dioskouroi in Agrigento, Sicily. Click on thumbnail for a full-sized (640x480) image. Resolution of …
3d, 3d reconstruction, benchmark, multiview, sfmThe SPHERE human skeleton movements dataset was created using a Kinect camera, that measures distances and provides a depth map of the scene instead of th…
action, behavior, depth, human, kinect, motion, movement, skeleton, videoThe Ford Car dataset is joint effort of Pandey et al. (for collecting images, Lidar points, calibration etc.) and us (for annotation of 2D and 3D objects)…
3d, car, detection, groundtruth, lidar, sfmA synthetic light field dataset with 24 scenes. Data provided for each scene: - 9x9x512x512x3 light fields as individual PNGs - config files with cam…
depth, disparity, ground truth, light field, syntheticThe NBVbench is a reference object and benchmark criteria for defining and evaluating the performance of a next best view (NBV) method.
3d reconstruction, geometry, next best view, planningThe NYU-Depth data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Ki…
depth, kinect, label, reconstruction, semantic segmentationThe CHALEARN Multi-modal Gesture Challenge is a dataset +700 sequences for gesture recognition using images, kinect depth, segmentation and skeleton data.…
action, depth, gesture, human, illumination, kinect, recognition, segmentation, skeletonInstance recognition from depth data. Contains various challenges of Pose, Clutter, Occlusion and similar looking objects (Bonde, U., Badrinarayanan, V., …
depth, detection, instance, poseThe 2D-3D-S dataset provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annotati…
3d, building, depth, indoor, large-scale, normal, panorama, reconstruction, segmentation, semanticFor the first few decades of the fields existence, computer vision has been focused on algorithmic, logical approaches to perception. But it was only with…
3d, depth, indoor, kinect, object, recognition, reconstructionThe xawAR16 dataset is a multi-RGBD camera dataset, generated inside an operating room (IHU Strasbourg), which was designed to evaluate tracking/relocaliz…
depth, medicine, operation, recognition, surgery, table, videoThe Shefeld Kinect Gesture (SKIG) dataset contains 2160 hand gesture sequences (1080 RGB sequences and 1080 depth sequences) collected from 6 subjects. Al…
action, depth, gesture, human, illumination, kinect, recognitionThis ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1] …
3d, architecture, benchmark, classification, code, mesh, outdoor, paris, pointcloud, recognition, reconstruction, segmentation, semantic, source, urbanThe Weather and Illumination Database (WILD) is an extensive database of high quality images of an outdoor urban scene, acquired every hour over all seaso…
camera, change, depth, estimation, illumination, light, newyork, static, time, urban, video, weather, webcamThe data is taken from Photo Tourism reconstructions from Trevi Fountain (Rome), Notre Dame (Paris) and Half Dome (Yosemite). Each dataset consists of a s…
feature description, feature matching, pair, sfmThis dataset contains two image collections, TempleOfHeaven and SportsArena, that are deemed hard for Structure-from-Motion (SfM). The method is descri…
3d reconstruction, ambiguous structures, structure-from-motionThe Symmetry Facades dataset contains 9 building facades with multiple images. It used for coupled symmetry and structure from motion detection. Couple…
3d, building, facade, reconstruction, repetition, sfm, symmetry, urbanThe MSR RGB-D Dataset 7-Scenes dataset is a collection of tracked RGB-D camera frames. The dataset may be used for evaluation of methods for different app…
depth, kinect, location, reconstruction, tracking, videoThe VidPairs dataset contains 133 pairs of images, taken from 1080p HD (~2 megapixel) official movie trailers. Each pair consists of images of the same sc…
dense, description, flow, matching, optical, pair, patch, videoUnlike the previous SHREC contests, the objective of this SHREC 2012 contest is to evaluate the performance of 3D-mesh segmentation techniques instead of …
3d, mesh, part, segmentationThe dataset is composed of 150 synthetic scenes, captured with a (perspective) virtual camera, and each scene contains 3 to 5 objects. The model set is co…
mesh, recognition, segmentation, syntheticThe HCI 4D Lightfields dataset contains 11 objects with corresponding lightfields for depth estimation. Datasets can be downloaded individually below. F…
3d, 4d, benchmark, depth, evaluation, lightfield, reconstructionThis dataset package contains the software and data used for Detection-based Object Labeling on the RGB-D Scenes Dataset as implemented in the paper: De…
3d, depth, indoor, kinect, object, recognition, reconstructionThe TVPR dataset includes 23 registration sessions. Each of the 23 folders contains the video of one registration session. Acquisitions have been performe…
clothing, depth, gender, identification, indoor, people, person, recognition, reidentification, top-view, videoIt is composed of ADL (activity daily living) and fall actions simulated by 11 volunteers. The people involved in the test are aged between 22 and 39, wit…
accelerometer, action, depth, fall detection - adl, human, kinect, recognition, video, wearableThe NYU-Depth V2 data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft…
depth, kinect, label, reconstruction, semantic segmentationVoxel Based Dataset for Systematic 3D reconstruction by artificial neural networks (ANNs). A synthetic scalable cube dataset for training, testing and v…
3d, deep learning, reconstruction, sfm, synthetic city urbanThe RGB-D Person Re-identification dataset is for person re-identification using depth information. The main motivation is that the standard techniques (s…
3d, classification, depth, identification, pedestrian, shapeA 66 stereo pairs dataset with their subpixel ground truths. The construction and improvement of algorithms for subpixel stereovision requires very prec…
3d, depth, groundtruth, noise, pointcloud, stereo, stereovision, subpixelThe DTU Robot dataset consists of color images of 60 scenes acquired in a controlled setup from 119 different positions and under different lighting. For …
feature description, feature detection, feature matching, illumination, reconstruction, sfmThe Stanford 3D Scanning Repository dataset is a compilation of 3D scans of objects like Stanford Bunny, Happy Buddha, Dragon, Armadillo and Lucy. These c…
3d reconstruction, bunny, laser, triangulationWe would like to announce the release of PASCAL-Context dataset. We augmented PASCAL VOC 2010 dataset with annotations for 400+ additional categories. In …
benchmark, category, dense, pascal, recognition, segmentation, semantic, shape