The All I Have Seen (AIHS) dataset is created to study the properties of total visual input in humans, for around two weeks Nebojsa Jojic wore a camera capturing, on average, an image per every 20 seconds of his waking hours. The resulting new dataset contains a mix of indoor and outdoor scenes as well as numerous foreground objects. The creators first analysis goal is to create a visual summary of the subjects two weeks of life using unsupervised algorithms that would automatically discover recurrent scenes, familiar faces or common actions. Direct application of existing algorithms, such as panoramic stitching (e.g. Photosynth) or appearance-based clustering models (e.g. the epitome), is impractical due to either the large dataset size or the dramatic variation in the lighting conditions. The authors dubbed this type of data "All I have Seen" (AIHS, meant to be pronounced similar to "eyes"). While these types of datasets have been assembled before, it is our belief that with the proliferation of mobile devices and the availability of cloud computing, the time is now more appropriate than ever for research into this type of data acquisition, unsupervised techniques for data analysis and applications on top of them. Structural epitome: a way to summarize ones visual experience Nebojsa Jojic and Alessandro Perina and Vittorio Murino NIPS 2010
The VSUMM (Video SUMMarization) dataset is of 50 videos from Open Video. All videos are in MPEG-1 format (30 fps, 352 x 240 pixels), in color and with sou…
keyframe, similarity, static, study, summary, type, user, videoThe multi-modal/multi-view datasets are created in a cooperation between University of Surrey and Double Negative within the EU FP7 IMPART project. The …
3d, action, color, dynamic, emotion, face, human, indoor, lidar, model, multi-mode, multi-view, outdoor, rgbd, videoThe SUNCG dataset is a Large 3D Model Repository for Indoor Scenes. SUNCG is an ongoing effort to establish a richly-annotated, large-scale dataset of…
3d, indoor, layout, object, realism, recognition, rendering, room, scene, segmentation, syntheticScanNet is an RGB-D video dataset containing 2.5 million views in more than 1500 scans, annotated with 3D camera poses, surface reconstructions, and insta…
3d, cad, indoor, layout, object, realism, recognition, rendering, room, scene, segmentation, syntheticSceneNet RGB-D is dataset comprised of 5 million Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth. It expands the previous work of…
3d, indoor, lighting, navigation, reconstruction, rendering, robot, scene, segmentation, slam, synthetic, trajectoryThe Yotta dataset consists of 70 images for semantic labeling given in 11 classes. It also contains multiple videos and camera matrices for 14km or drivin…
3d, camera, classification, reconstruction, segmentation, semantic, urban, videoThe ICG Multi-Camera and Virtual PTZ dataset contains the video streams and calibrations of several static Axis P1347 cameras and one panoramic video from…
calibration, camera, crowd, detection, graz, multitarget, multiview, network, object, outdoor, panorama, pedestrian, tracking, videoThe MSR Action datasets is a collection of various 3D datasets for action recognition. See details http://research.microsoft.com/en-us/um/people/zliu/a…
3d, action, detection, recognition, reconstruction, videoThe ICG Multi-Camera datasets consist of Easy Data Set (just one person) Medium Data Set (3-5 persons, used for the experiments) Hard Data Set (crowd…
calibration, camera, detection, graz, indoor, multitarget, multiview, object, pedestrian, tracking, videoThe MOT Challenge is a framework for the fair evaluation of multiple people tracking algorithms. In this framework we provide: - A large collection of d…
3d, benchmark, benhttp://motchallenge.net/chmark, dataset, evaluation, multiple, pedestrian, people, surveillance, target, tracking, videoAn indoor action recognition dataset which consists of 18 classes performed by 20 individuals. Each action is individually performed for 8 times (4 daytim…
action, cross-view, indoor, multi-camera, open-view, recognition, videoThe crowd datasets are collected from a variety of sources, such as UCF and data-driven crowd datasets. The sequences are diverse, representing dense crow…
anomaly, crowd, detection, human, pedestrian, scene, understanding, videoThe UMD Dynamic Scene Recognition dataset consists of 13 classes and 10 videos per class and is used to classify dynamic scenes. The dataset has been de…
classification, dynamic, motion, recognition, scene, videoThis dataset package contains the software and data used for Detection-based Object Labeling on the RGB-D Scenes Dataset as implemented in the paper: De…
3d, depth, indoor, kinect, object, recognition, reconstructionThis ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1] …
3d, architecture, benchmark, classification, code, mesh, outdoor, paris, pointcloud, recognition, reconstruction, segmentation, semantic, source, urbanThe TVPR dataset includes 23 registration sessions. Each of the 23 folders contains the video of one registration session. Acquisitions have been performe…
clothing, depth, gender, identification, indoor, people, person, recognition, reidentification, top-view, videoYahoo Flickr Creative Commons 100M (YFCC100M) dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. Al…
3d, clustering, community, detection, flickr, image, internet, landmark, recognition, reconstruction, socialThe BEOID dataset includes object interactions ranging from preparing a coffee to operating a weight lifting machine and opening a door. The dataset is re…
3d, egocentric, interaction, object, pose, tracking, videoThe GaTech VideoContext dataset consists of over 100 groundtruth annotated outdoor videos with over 20000 frames for the task of geometric context evalua…
classification, context, geometry, nature, outdoor, segmentation, semantic, supervised, unsupervised, urban, videoThe 2D-3D-S dataset provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annotati…
3d, building, depth, indoor, large-scale, normal, panorama, reconstruction, segmentation, semanticThe Airport MotionSeg dataset contains 12 sequences of videos of an aiprort scenario with small and large moving objects and various speeds. It is challen…
airport, camera, clustering, motion, segmentation, video, zoomThe Lane Level Localization dataset was collected on a highway in San Francisco with the following properties: * Reasonable traffic * Multiple lane hig…
3d, autonomous, benchmark, car, driving, gps, localization, map, road, videoFor the first few decades of the fields existence, computer vision has been focused on algorithmic, logical approaches to perception. But it was only with…
3d, depth, indoor, kinect, object, recognition, reconstructionThe Make3D Depth dataset s designed to learn features to estimate scene depth from a single image. This dataset contains aligned image and range data: …
depth estimation, indoor, learning, outdoor, single viewThe Video2GIF dataset contains over 100,000 pairs of GIFs and their source videos. The GIFs were collected from two popular GIF websites (makeagif.com, gi…
gif, scene, summarization, summary, understanding, video highlight detectionThe High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedestr…
benchmark, camera, detection, high-definition, human, indoor, lisbon, multiview, network, pedestrian, re-identification, surveillance, tracking, videoAbstract Scene understanding has (again) become a focus of computer vision research, leveraging advances in detection, context modeling, and tracking. In…
3d, car, classification, pedestrian, scene, segmentation, semantic, understandingThe Mall dataset was collected from a publicly accessible webcam for crowd counting and profiling research. Ground truth: Over 60,000 pedestrians were …
counting, crowd, detection, indoor, pedestrian, tracking, video, webcamThe Video Summarization (SumMe) dataset consists of 25 videos, each annotated with at least 15 human summaries (390 in total). The data consists of videos…
action, benchmark, event, groundtruth, human, summary, videoWe collected a video dataset, termed ChokePoint, designed for experiments in person identification/verification under real-world surveillance conditions u…
clustering, detection, face, human, identification, multiview, pedestrian, real, recognition, sequence, surveillance, worldHollywood-2 datset contains 12 classes of human actions and 10 classes of scenes distributed over 3669 video clips and approximately 20.1 hours of video i…
action classification, segmentation, videoThis dataset comprises information regarding the ADLs performed by two users on a daily basis in their own homes. This dataset is composed by two instanc…
classification, clustering, multivariate, sequential, time-seriesParis-rue-Madame dataset contains 3D Mobile Laser Scanning (MLS) data from rue Madame, a street in the 6th Parisian district (France). The test zone conta…
3d, classification, laser, pointcloud, segmentation, semanticThe Notre Dame de Paris dataset used for 3D SfM reconstruction and contains 715 images provided by Noah Snavely. There are also version for NotreDame b…
3d, 3d reconstruction, flickr, frontview, landmark, limited, paris, pointcloud, sfmThe Interactive Segmentation (IcgBench) dataset from Jakob Santner contains 243 images and 262 segmentation. Some images have multiple segmentations. The …
interactive segmentation, userThe MSRC v1 dataset from Microsoft Research in Cambridge contains 240 images and 9 object classes with coarse pixel-wise labeled images. The dataset is c…
outdoor, semantic, semantic segmentationIt is composed of ADL (activity daily living) and fall actions simulated by 11 volunteers. The people involved in the test are aged between 22 and 39, wit…
accelerometer, action, depth, fall detection - adl, human, kinect, recognition, video, wearableEstimate robust and reliable depth or motion fields on our challenging real world videos!
optical flow, outdoor, stereo depthVoxel Based Dataset for Systematic 3D reconstruction by artificial neural networks (ANNs). A synthetic scalable cube dataset for training, testing and v…
3d, deep learning, reconstruction, sfm, synthetic city urbanThe CMP Facade dataset consists of facade images assembled at the Center for Machine Perception, which includes 600 rectified images of facades from vario…
classification, facade, recognition, rectification, segmentation, semantic, similarity, structure, urban52 columns for 52 weeks; normalised values of provided too.
clustering, multivariate, time-seriesThe Multicamera Human Action Video Data (MuHAVi) Manually Annotated Silhouette Data (MAS) are two datasets consisting of selected action sequences for th…
action, background, behavior, human, segmentation, videoOngoing research on university faculty perceptions and practices of using Wikipedia as a teaching resource. Based on a Technology Acceptance Model, the re…
causal-discovery, clustering, multivariate, regressionThe UrbanStreet dataset used in the paper can be downloaded here [188M] . It contains 18 stereo sequences of pedestrians taken from a stereo rig mounted o…
detection, human, multitarget, pedestrian, recognition, segmentation, tracking, urban, videoThis dataset was used in several classifications tasks related to the challenge of anuran species recognition through their calls. It is a multilabel data…
classification, clustering, multivariateA 66 stereo pairs dataset with their subpixel ground truths. The construction and improvement of algorithms for subpixel stereovision requires very prec…
3d, depth, groundtruth, noise, pointcloud, stereo, stereovision, subpixelThis web page presents visual-inertial datasets collected on-board a Micro Aerial Vehicle (MAV). The datasets contain stereo images, synchronized IMU meas…
aerial vehicles, global shutter, indoor, slamThis data set contains 13,910 measurements from 16 chemical sensors exposed to 6 gases at different concentration levels. This dataset is an extension of …
causa, classification, clustering, multivariate, regression, time-seriesThis dataset provides a collection of web images and 3D models for research on landmark recognition (especially for methods based on 3D models). We hope i…
3d, classification, codebook, feature, flickr, landmark, matching, recognition, reconstruction, retrievalThe Where Who Why (WWW) dataset provides 10,000 videos with over 8 million frames from 8,257 diverse scenes, therefore offering a superior comprehensive d…
crowd, detection, flow, optical, pedestrian, recognition, surveillance, video-- The users' knowledge class were classified by the authors using intuitive knowledge classifier (a hybrid ML technique of k-NN and meta-heuristic…
classification, clustering, multivariateThe SPHERE human skeleton movements dataset was created using a Kinect camera, that measures distances and provides a depth map of the scene instead of th…
action, behavior, depth, human, kinect, motion, movement, skeleton, videoSome datasets and evaluation tools are provided on this page for four different computer vision and computer graphics problems. Population counting Lin…
3d, counting, crowd, detection, groundtruth, line, network, object, pedestrian, pointcloud, reconstruction, road, surface, urbanThe Ford Car dataset is joint effort of Pandey et al. (for collecting images, Lidar points, calibration etc.) and us (for annotation of 2D and 3D objects)…
3d, car, detection, groundtruth, lidar, sfmThe Daimler Mono Pedestrian Detection Benchmark dataset contains a large training and test set. The training set contains 15.560 pedestrian samples (image…
detection, mono, object, outdoor, pedestrian, scale, urbanThe Daimler Urban Segmentation Dataset consists of video sequences recorded in urban traffic. The dataset consists of 5000 rectified stereo image pairs wi…
motion, outdoor, segmentation, semantic, stereo, urbanThe YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 vide…
detection, flow, object, optical, segmentation, videoISPRS and EuroSDR - Benchmark on High Density Aerial Image Matching Background and Scope of the project Innovations in matching algorithms as well as …
3d, aerial, benchmark, city, germany, multiview, photogrammetry, reconstruction, switzerland, urbanThe automated analysis of facial expressions has been widely used in different research areas, such as biometrics or emotional analysis. Special import…
classification, clustering, multivariate, sequentialMany different labeled video datasets have been collected over the past few years, but it is hard to compare them at a glance. So we have created a handy …
action, benchmark, classification, detection, object, recognition, videoBrief Description of the Dataset: --------------------------------- Each of the 19 activities is performed by eight subjects (4 female, 4 male, between th…
classification, clustering, multivariate, time-seriesThe data is in the transactional form. It contains the Latin names (species or genus) and state abbreviations.
clustering, multivariateThe examined group comprised kernels belonging to three different varieties of wheat: Kama, Rosa and Canadian, 70 elements each, randomly selected for the…
classification, clustering, multivariateThe automotive multi-sensor (AMUSE) dataset consists of inertial and other complementary sensor data combined with monocular, omnidirectional, high frame …
api, city, image, inertial, streetside, traffic, urban, videoThe domain-specific personal videos highlight dataset from the paper [1] describes a fully automatic method to train domain-specific highlight ranker for…
action, domain, human, recognition, saliency, summarization, video, wearableThe 3D Mask Attack Database (3DMAD) is a biometric (face) spoofing database. It currently contains 76500 frames of 17 persons, recorded using Kinect for b…
3d, biometry, emotion, face, frontview, recognition, segmentationThe 3DVis dataset includes a set of 12 heterogeneous scenes for testing 3D scene registration and analysis methods. Models include homogeneous shapes, rep…
3d, matching, reconstruction, registration, shape, symmetryA Vicon motion capture camera system was used to record 12 users performing 5 hand postures with markers attached to a left-handed glove. A rigid patter…
classification, clustering, multivariateISPRS Test Project on Urban Classification and 3D Building Reconstruction The ISPRS working group III/4 announces the release of the 2D semantic labelin…
3d, building, city, classification, recognition, reconstruction, semantic, urbanA dataset acquired with 3 synchronized sensors (Primesense Carmine 1.09, Microsoft Kinect v2, Canon IXUS 950 IS), featuring: * 30 industry-relevant obje…
3d, estimation, object, pose, rgbd, texture-lessWelcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects of…
action, depth, face, human, mesh, multiview, pose, reconstruction, tracking, videoThe PIROPO database (People in Indoor ROoms with Perspective and Omnidirectional cameras) comprises multiple sequences recorded in two different indoor ro…
detection, fisheye, human, indoor, omnidirectional, people, perspective, room, surveillanceThis dataset consist 51 oral presentation recorded with 2 ambient visual sensor (web-cam), 3 First Person View (FPV) cameras (1 on presenter and 2 on rand…
analysis, kinect, multi-sensor, presentation, quality, videoKEGG Metabolic pathways can be realized into network. Two kinds of network / graph can be formed. These include Reaction Network and Relation Network. In …
classification, clustering, multivariate, regression, text, univariateScene Parsing Benchmark Scene parsing data and part segmentation data derived from ADE20K dataset could be download from MIT Scene Parsing Benchmark. m…
annotation, benchmark, recognition, scene, segmentation, semanticZurich Hoengg (Switzerland) is an aerial dataset. The dataset consists of 4 aerial images in colour (Figures 2-5), scanned with 14 microns, the format …
aerial, outdoor, semantic segmentationAt Udacity, we believe in democratizing education. How can we provide opportunity to everyone on the planet? We also believe in teaching really amazing an…
autonomous, car, classification, detection, driving, recognition, robot, segmentation, street, synthetic, time, urban, videoBackground information: The data set concerns the earliest history of mankind. Prehistoric men created the desired shape of a stone tool by striking on a …
causal-discovery, classification, clustering, multivariateThe GaTech VideoStab dataset consists of N videos for the task of video stabilization. This code is implemented in Youtube video editor for stabilization.…
camera, path, stabilization, videoThe dataset is the subset of RCV1. These corpus has already been used in author identification experiments. In the top 50 authors (with respect to total s…
classification, clustering, domain-theory, multivariate, textThe dataset contains 15 documentary films that are downloaded from YouTube, whose durations vary from 9 minutes to as long as 50 minutes, and the total nu…
detection, object, videoThe dataset collects data from an Android smartphone positioned in the chest pocket. Accelerometer Data are collected from 22 participants walking in the …
classification, clustering, sequential, time-series, univariateThe goal of LabelMe is to provide an online annotation tool to build image databases for computer vision research. You can contribute to the database by v…
object detection, outdoor, semantic, semantic segmentation, software, urbanThe CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test image…
color, image classification, object, patch, scene, tinyThis is a transnational data set which contains all the transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered non-store o…
classification, clustering, multivariate, sequential, time-seriesThe Webcam Interestingness dataset consists of 20 different webcam streams, with 159 images each. It is annotated with interestingness ground truth, acqui…
classification, interest, ranking, retrieval, video, weather, webcamThe Rent3D dataset comprises floorplans and images. The goal of this work is to enable a 3D virtual-tour of an apartment given a small set of monocular im…
apartment, building, floorplan, indoor, layout, reconstruction, urbanThis archive contains 2075259 measurements gathered between December 2006 and November 2010 (47 months). Notes: 1.(global_active_power*1000/60 - sub_mete…
clustering, multivariate, regression, time-seriesThe UCF Person and Car VideoSeg dataset consists of six videos with groundtruth for video object segmentation. Surfing, jumping, skiing, sliding, big ca…
camera, groundtruth, model, motion, object, segmentation, videoPlease see the README for the details on the data organization, and so on.
classification, clustering, multivariate, textThis site is dedicated to provide datasets for the Robotics community with the aim to facilitate result evaluations and comparisons. The datasets presente…
3d, city, laser, nature, urbanThe Robotic 3D Scan Repository from Osnabrueck contains 23 different datasets showing a veriaty of 3D scans for objects, humans, cities, university campus…
3d, aerial, bremen, city, germany, heat, human, laser, lidar, osnabrueck, reconstruction, scan, urbanThe Farman Institute 3D Point Sets dataset contains 11 objects by a 3D laser scanner. This dataset was peer-reviewed by Image Processing On Line: Farman I…
3d, laser, model, object, point, reconstruction, scannerThis web page contains video data and ground truth for 16 dances with two different dance patterns. The style of dancing is inspired by Scottish Ceilidh d…
action, analysis, background, chemistry, dance, motion, pattern, videoThe xawAR16 dataset is a multi-RGBD camera dataset, generated inside an operating room (IHU Strasbourg), which was designed to evaluate tracking/relocaliz…
depth, medicine, operation, recognition, surgery, table, videoThis dataset comes from the daily measures of sensors in a urban waste water treatment plant. The objective is to classify the operational state of the pl…
clustering, multivariateThe Pittsburgh Fast-food Image dataset (PFID) consists of 4545 still images, 606 stereo pairs, 3033600 videos for structure from motion, and 27 privacy-pr…
classification, food, laboratory, real, recognition, reconstruction, videoPlaces205 dataase contains 2.5 million images from 205 scene categories for the academic public. The image dataset contains 2,448,873 images from 205 sc…
feature, learning, place, recognition, scene, urbanThe experiments have been carried out with a group of 115 students of first-year, undergraduate Engineering major of the University of Genoa. We carried…
classification, clustering, multivariate, regression, sequential, time-seriesCSV format where each row is a paper and each column is an attribute.
clustering, multivariateA Vicon motion capture camera system was used to record 12 users performing 5 hand postures with markers attached to a left-handed glove. A rigid patter…
classification, clustering, multivariateThe Longterm Pedestrian dataset consists of images from a stationary camera running 24 hours for 7 days at about 1 fps. It used for adaptive detection an…
background, change, coffee, detection, graz, illumination, indoor, multitarget, pedestrian, robustThe Daimler Mono Pedestrian Classification Benchmark dataset consists of two parts: a base data set. The base data set contains a total of 4000 pedestri…
classification, illumination, object, outdoor, pedestrian, scale, urbanJPL First-Person Interaction dataset (JPL-Interaction dataset) is composed of human activity videos taken from a first-person viewpoint. The dataset parti…
action, human, interactive, motion, recognition, videoData set has no missing values. Values are in kW of each 15 min. To convert values in kWh values must be divided by 4. Each column represent one client. S…
clustering, regression, time-seriesStyle, Price, Rating, Size, Season, NeckLine, SleeveLength, waiseline, Material, FabricType, Decoration, Pattern, Type, Recommendation are Attributes in d…
classification, clustering, textThese are atlantic-mediterranean marine sponges that belong to O.Hadromerida (Demospongiae.Porifera).
clustering, multivariateBackground Models Challenge (BMC) is a complete dataset and competition for the comparison of background subtraction algorithms. The main topics concern:…
background, change, detection, modeling, motion, segmentation, surveillance, videoKEGG Metabolic pathways can be realized into network. Two kinds of network / graph can be formed. These include Reaction Network and Relation Network. In …
classification, clustering, multivariate, regression, text, univariateThe Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divide…
benchmark, groundtruth, motion, object, pedestrian, segmentation, tracking, videoThe PETS 2006 dataset contains 7 parts showing multi-sensor sequences containing left-luggage scenarios with increasing scene complexity at a train statio…
frontview, indoor, multitarget, object detection, object tracking, pedestrianThe GaTech VideoSeg dataset consists of two (waterski and yunakim?) video sequences for object segmentation. There exists no groundtruth segmentation an…
camera, model, motion, object, segmentation, videoThe dataset represents 10 years (1999-2008) of clinical care at 130 US hospitals and integrated delivery networks. It includes over 50 features representi…
classification, clustering, multivariateThis dataset contains 600 examples of control charts synthetically generated by the process in Alcock and Manolopoulos (1999). There are six different cla…
classification, clustering, time-seriesAiguille du Midi. France showing photographs with Camera: Mamiya ZD. 55mm. - Resolution: 5Mpixels, 53 images - Photographer: B. Vallet (Imagine/EVD - 2006…
3d reconstruction, large scale, mesh, outdoor, sfmThe MSRC vNIPS dataset is the MSRC v2 dataset with new annotations for much more accurate segmentations for 93 images. Efficient Inference in Fully Conn…
outdoor, semantic, semantic segmentationThe MSR RGB-D Dataset 7-Scenes dataset is a collection of tracked RGB-D camera frames. The dataset may be used for evaluation of methods for different app…
depth, kinect, location, reconstruction, tracking, videoThe Dataset for ADL Recognition with Wrist-worn Accelerometer is a public collection of labelled accelerometer data recordings to be used for the creation…
classification, clustering, multivariate, time-seriesThe York Urban Line Segment Database is a compilation of 102 images (45 indoor, 57 outdoor) of urban environments consisting mostly of scenes from the cam…
geometry, manhattan, outdoor, pose estimation, reconstruction, urban, vanishing pointUnlike the previous SHREC contests, the objective of this SHREC 2012 contest is to evaluate the performance of 3D-mesh segmentation techniques instead of …
3d, mesh, part, segmentationIndoor localization is a key topic for mobile computing. However, it is still very difficult for the mobile sensing community to compare state-of-art Indo…
classification, clustering, multivariate, regression, sequential, time-seriesGaze data on video stimuli for computer vision and visual analytics. Converted 318 video sequences from several different gaze tracking data sets with p…
gaze data, metadata, polygon annotation, segmentation, videoThe PD and control handwriting database consists of 62 PWP (People with parkinson) and 15 healthy individuals who appealed at the Department of Neurology …
classification, clustering, multivariate, regressionIndoor localisation is a key topic for the Ambient Intelligence (AmI) research community. In this scenarios, recent advancements in wearable technologie…
classification, clustering, multivariate, regression, sequential, time-seriesThe city planar and non-planar datset consists of urban scenes accompanied by text files describing the plane/non-plane locations. Training Set (Univer…
3d, building, detection, estimation, plane, urbanThe database of nude and non-nude videos contains a collection of 179 video segments collected from the following movies: Alpha Dog, Basic Instinct, Befor…
movie, nude detection, videoThe HCI 4D Lightfields dataset contains 11 objects with corresponding lightfields for depth estimation. Datasets can be downloaded individually below. F…
3d, 4d, benchmark, depth, evaluation, lightfield, reconstructionThe RGB-D Person Re-identification dataset is for person re-identification using depth information. The main motivation is that the standard techniques (s…
3d, classification, depth, identification, pedestrian, shapeIn predicting stock prices you collect data over some period of time - day, week, month, etc. But you cannot take advantage of data from a time period unt…
classification, clustering, time-seriesThe Street View Text (SVT) dataset contains 647 words and 3796 letters in 249 images harvested from Google Street View. The dataset is more challenging …
classification, outdoor, text detection, text recognition, urbanThe current video database containing six types of human actions (walking, jogging, running, boxing, hand waving and hand clapping) performed several time…
action classification, segmentation, videoThe Babenko tracking dataset contains 12 video sequences for single object tracking. For each clip they provide (1) a directory with the original ima…
animal, face, object tracking, occlusion, single, videoThe 3D shape description dataset consists of multiple sub-datasets Descriptor Matching - Dataset 1 & 2 (Stanford) These datasets, created from some of …
3d, benchmark, description, matching, reconstruction, registration, shapeThe dataset (movement_libras) contains 15 classes of 24 instances each, where each class references to a hand movement type in LIBRAS. In the video pre-p…
classification, clustering, multivariate, sequential--- The dataset collects data from a wearable accelerometer mounted on the chest --- Sampling frequency of the accelerometer: 52 Hz --- Accelerom…
classification, clustering, sequential, time-series, univariateNews are grouped into clusters that represent pages discussing the same news story. The dataset includes also references to web pages that, at the access…
classification, clustering, multivariateThe data set gathered when we were working at project for Bahrain university between 2002 and 2003.
classification, clustering, domain-theory, univariateAutomatic identification of commercial blocks in news videos finds a lot of applications in the domain of television broadcast analysis and monitoring. Co…
classification, clustering, multivariateThe Fish4Knowledge project (groups.inf.ed.ac.uk/f4k/) is pleased to announce the availability of 2 subsets of our tropical coral reef fish video and ext…
animal, camera, classification, fish, motion, nature, recognition, video, waterThe dataset consists of a total of 3600 documents including 600 news/texts from six categories economy, culture-arts, health, politics, sports and techno…
classification, clustering, textCollect the real time readings for residential,commercial,industrial,agriculure,to find the accuracy consumption in Tamil Nadu Around Thanajvur
classification, clustering, multivariate, regressionFor complete information see the official challenge page: [Web Link]
causal-discovery, clustering, domain-theory, multivariate, sequential, time-seriesThe dataset is in the form of a 11463 x 5812 matrix of word counts, containing 11463 words and 5811 NIPS conference papers (the first column contains the …
clustering, textThe Oxford RobotCar Dataset contains over 100 repetitions of a consistent route through Oxford, UK, captured over a period of over a year. The dataset cap…
autonomous, car, classification, detection, driving, recognition, robot, segmentation, street, time, urban, video, yearThe SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used for …
camera, flow, groundtruth, model, motion, object, optical, proposal, segmentation, stationary, videoThe Leeds Cows dataset by Derek Magee consists of 14 different video sequences showing a total of 18 cows walking from right to left in front of different…
animal, background, cow, detection, segmentation, videoThe DrivFace database contains images sequences of subjects while driving in real scenarios. It is composed of 606 samples of 640480 pixels each, acquired…
classification, clustering, multivariate, regressionPlease find the original data at '[Web Link]'
classification, clustering, multivariate, time-seriesThe Berkeley Video Segmentation Dataset (BVSD) contains videos for segmentation (boundary?) Dataset train Dataset test
benchmark, segmentation, videoISPRS Test Project on Urban Classification, 3D Building Reconstruction and Semantic Labeling. In this part of our working group site you will get further …
3d, aerial, benchmark, canada, city, germany, multiview, photogrammetry, recognition, segmentation, semantic, urbanISPRS / EuroSDR Benchmark for Multi-Platform Photogrammetry In these pages you can get information about the BENCHMARK FOR MULTI-PLATFORM PHOTOGRAMMETRY…
3d, aerial, benchmark, city, germany, multiview, photogrammetry, reconstruction, switzerland, urbanThe QMUL Junction dataset is a busy traffic scenario for research on activity analysis and behavior understanding. Video length: 1 hour (90000 frames)…
behavior, counting, crowd, detection, motion, pedestrian, tracking, videoThe experiments have been carried out with a group of 30 volunteers within an age bracket of 19-48 years. Each person performed six activities (WALKING, W…
classification, clustering, multivariate, time-seriesDocuments are first obtained via a Web search using AMIEI: an integrated platform for delivering enterprise intelligence, developed by AMI Software ([Web …
clustering, multivariate, sequential, textThe Traffic Video dataset consists of X video of an overhead camera showing a street crossing with multiple traffic scenarios. The dataset can be downlo…
detection, overhead, road, tracking, traffic, urban, video, viewOpen University Learning Analytics Dataset (OULAD) contains data about courses, students and their interactions with Virtual Learning Environment (VLE) fo…
classification, clustering, multivariate, regression, sequential, time-seriesThis is a subset of the dataset introduced in the SIGGRAPH Asia 2009 paper, Webcam Clip Art: Appearance and Illuminant Transfer from Time-lapse Sequences.…
camera, change, illumination, light, nature, static, time, urban, video, webcamThe procedural texture perceptual similarity dataset contains a list of procedural textures along with their pairwise distances, as defined by a perceptua…
benchmark, procedural, study, textureThe HandNet dataset contains depth images of 10 participants hands non-rigidly deforming infront of a RealSense RGB-D camera. This dataset includes 214…
articulation, classification, detection, fingertip, hand, pose, rgbd, segmentation, videoThis dataset was constructed by adding elevation information to a 2D road network in North Jutland, Denmark (covering a region of 185 x 135 km^2). Elevati…
clustering, regression, sequential, textWe present a new large-scale dataset that contains a diverse set of stereo video sequences recorded in street scenes from 50 different cities, with high q…
car, cities, detection, pedestrian, person, segmentation, semantic, stereo, urban, video, weaklyHTRU2 is a data set which describes a sample of pulsar candidates collected during the High Time Resolution Universe Survey (South) [1]. Pulsars are a r…
classification, clustering, multivariate- The leaves were placed on a white background and then photographed. - The pictures were taken in broad daylight to ensure optimum light intensity.
classification, clustering, multivariateThe CALTECH 101 dataset by Li Fei-Fei contains images for 101 categories with about 40 to 800 images per category. Most categories have about 50 images at…
centered, image classification, natural-image, object, sceneCSV format where each row is a paper and each column an attribute.
clustering, multivariateFor each text collection, D is the number of documents, W is the number of words in the vocabulary, and N is the total number of words in the collection (…
clustering, textThe Weather and Illumination Database (WILD) is an extensive database of high quality images of an outdoor urban scene, acquired every hour over all seaso…
camera, change, depth, estimation, illumination, light, newyork, static, time, urban, video, weather, webcamThe Cholec80 dataset contains 80 videos of cholecystectomy surgeries performed by 13 surgeons. The videos are captured at 25 fps. The dataset is labeled w…
medicine, phase, recognition, surgery, tool, videoThe MSRC v2 dataset is an extension of the MSRC v1 dataset from Microsoft Research in Cambridge. It contains 591 images and 23 object classes with accurat…
outdoor, semantic, semantic segmentationThe dataset captures 25 people preparing 2 mixed salads each and contains over 4h of annotated accelerometer and RGB-D video data. Annotated activities co…
action, activity, classification, detection, recognition, tracking, videoProvide all relevant information about your data set.
classification, clustering, multivariateThe measurements were created to ease the development, comparison and evaluation of fingerprinting based hybrid indoor positioning methods. The measuremen…
causal-discovery, classification, clustering, textThis dataset contains 7 challenging volleyball activity classes annotated in 6 videos from professionals in the Austrian Volley League (season 2011/12). A…
action, activity recognition, analysis, detection, sport, video, volleyballThe object is a plaster reproduction of Temple of the Dioskouroi in Agrigento, Sicily. Click on thumbnail for a full-sized (640x480) image. Resolution of …
3d, 3d reconstruction, benchmark, multiview, sfmWe introduce the Shelf dataset for multiple human pose estimation from multiple views. In addition we annotate the body joints in the Campus dataset from …
3d, capture, estimation, human, motion, multiple, pose, viewThe 1DSfM Landmarks is a collection of community-based image reconstruction by Kyle Wilson and is comprised of 14 datasets with comparison to bundler grou…
3d, benchmark, city, groundtruth, landmark, reconstruction, urbanThe video co-segmentation dataset contains 4 video sets which totally has 11 videos with 5 frames of each video labeled with the pixel-level ground-trut…
co-segmentation, dataset, segmentation, videoThe data set consists of the expression levels of 77 proteins/protein modifications that produced detectable signals in the nuclear fraction of cortex. Th…
classification, clustering, multivariateWe present a dataset to address the problem of visual privacy - where users unintentionally leak private information when sharing personal images online, …
classification, flickr, multilabel, privacy, regression, sceneShakeFive2 A collection of 8 dyadic human interactions with accompanying skeleton metadata. The metadata is frame based xml data containing the skeleton…
human, interaction, kinect, videoThis corpus has been collected from free or free for research sources at the Internet: -> A collection of 425 SMS spam messages was manually extracted fr…
classification, clustering, domain-theory, multivariate, textThe data was collected as part of the 1990 census. There are 68 categorical attributes. This data set was derived from the USCensus1990raw data set. The…
clustering, multivariateThe CALTECH 256 dataset by Li Fei-Fei contains 30607 images for 256 categories.
centered, classification, detection, image, object, scene* Audio track (encoded as mp3) of each of the 106,574 tracks. It is on average 10 millions samples per track.* Nine audio features (consisting of 518 attr…
classification, clustering, multivariate, time-seriesThe dataset is composed by features extracted from 7 videos with people gesticulating, aiming at studying Gesture Phase Segmentation. Each video is repres…
classification, clustering, multivariate, sequential, time-seriesThe Symmetry Facades dataset contains 9 building facades with multiple images. It used for coupled symmetry and structure from motion detection. Couple…
3d, building, facade, reconstruction, repetition, sfm, symmetry, urbanThe Leuven Stereo Scene dataset is a scene and depth dataset. There exist two variants of this dataset - a CVPR 2007 paper [1] by Leibe et al. for detecti…
3d, depth, leuven, reconstruction, segmentation, semantic, sfm, stereo, urbanThe Stable Structure from Motion datasets due to size limitations cannot put the images online. Instead here are the tracked image points and the final re…
3d, 3d reconstruction, church, geometry, landmark, robust, sfm, stabilityThe characters here were used for a PhD study on primitive extraction using HMM based models. The data consists of 2858 character samples, contained in th…
classification, clustering, time-seriesThe Heterogeneity Dataset for Human Activity Recognition from Smartphone and Smartwatch sensors consists of two datasets devised to investigate sensor het…
classification, clustering, multivariate, time-seriesThe Pornography database contains nearly 80 hours of 400 pornographic and 400 non-pornographic videos. For the pornographic class, we have browsed website…
pornography, video, video frames, video shotsThe Weizmann actions dataset by Blank, Gorelick, Shechtman, Irani, and Basri consists of ten different types of actions: bending, jumping jack, jumping, j…
action, action classification, segmentation, videoThe object is a plaster dinosaur (stegosaurus). Click on thumbnail for a full-sized (640x480) image. Resolution of ground truth model: 0.00025m (you may w…
3d, 3d reconstruction, benchmark, multiview, sfmThe Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames i…
benchmark, groundtruth, motion, object, pedestrian, segmentation, tracking, videoThe Landmark 1000 or 1k dataset is a collection of the top 1000 popular flickr landmarks mined from flickr. It is maintained by Noah Snavely and publish…
3d, estimation, landmark, location, pointcloud, pose, reconstruction, worldSamples (instances) are stored row-wise. Variables (attributes) of each sample are RNA-Seq gene expression levels measured by illumina HiSeq platform.
classification, clustering, multivariateThe KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body join…
camera, detection, game, multitarget, multiview, object, outdoor, pedestrian, pose, recognition, soccer, trackingThe Salient Montages is a human-centric video summarization dataset from the paper [1]. In [1], we present a novel method to generate salient montages f…
human, montage, saliency, summarization, video, wearableThe Eurasian Cities dataset contains 103 images of outdoor urban scenes taken in Eurasian cities. It is annotated with horizontal and vertical vanishing p…
geometry, line, manhattan, outdoor, point, pose, reconstruction, urban, vanishingThe PETS 2009 dataset contains 3 parts showing multi-view sequences containing pedestrians walking in an outdoor environment. The parts are used for perso…
detection, frontview, human, occlusion multitarget, outdoor, overlap, pedestrian, trackingThe multiple foreground video co-segmentation dataset, consisting of four sets, each with a video pair and two foreground objects in common. The dataset …
co-segmentation, segmentation, videoThe TUG (Timed Up and Go test) dataset consists of actions performed three times by 20 volunteers. The people involved in the test are aged between 22 and…
accelerometer, action, depth image processing - tug, human, kinect, recognition, time, video, wearableThis is a sparse data set, less than 10% of the attributes are used for each sample. The link is to a '*.tgz' file which contains two files: [amzn-anon-ac…
causal-discovery, clustering, domain-theory, regression, time-seriesThese datasets were generated for the M2CAI challenges, a satellite event of MICCAI 2016 in Athens. Two datasets are available for two different challenge…
challenge, medicine, recognition, surgery, video, workflowThe New College Data Set contains 30GB of data intended for use by the mobile robotics and vision research communities. Our anticipated users are parties …
3d, navigation, odometry, panorama, path, reconstruction, stereo, urbanThe VidPairs dataset contains 133 pairs of images, taken from 1080p HD (~2 megapixel) official movie trailers. Each pair consists of images of the same sc…
dense, description, flow, matching, optical, pair, patch, video