The NYU-Depth V2 data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft Kinect. It features: 1449 densely labeled pairs of aligned RGB and depth images 464 new scenes taken from 3 cities 407,024 new unlabeled frames Each object is labeled with a class and an instance number (cup1, cup2, cup3, etc) The dataset has several components: Labeled: A subset of the video data accompanied by dense multi-class labels. This data has also been preprocessed to fill in missing depth labels. Raw: The raw rgb, depth and accelerometer data as provided by the Kinect. Toolbox: Useful functions for manipulating the data and labels. 464 different indoor scenes 26 scene types 407,024 unlabeled frames 1449 densely labeled frames 1000+ Classes Inpainted and raw depth available Both object and instance labels
The NYU-Depth data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft ...
semantic segmentation, kinect, label, reconstruction, depthFor the first few decades of the fields existence, computer vision has been focused on algorithmic, logical approaches to perception. But it was only wi...
object, 3d, kinect, reconstruction, depth, recognition, indoorThis dataset package contains the software and data used for Detection-based Object Labeling on the RGB-D Scenes Dataset as implemented in the paper: ...
object, 3d, kinect, reconstruction, depth, recognition, indoorThe MSR RGB-D Dataset 7-Scenes dataset is a collection of tracked RGB-D camera frames. The dataset may be used for evaluation of methods for different a...
video, kinect, location, reconstruction, depth, trackingThe CHALEARN Multi-modal Gesture Challenge is a dataset +700 sequences for gesture recognition using images, kinect depth, segmentation and skeleton dat...
gesture, skeleton, kinect, depth, human, recognition, action, illumination, segmentationThe 2D-3D-S dataset provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annota...
reconstruction, depth, large-scale, indoor, normal, building, panorama, segmentation, 3d, semanticThe Cambridge-driving Labeled Video Database (CamVid) dataset from Gabriel Brostow [?] contains ten minutes of video footage and corresponding semantica...
urban, 3d reconstruction, semantic segmentation, semantic, sfm, depthWe take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest ...
stereo, object tracking, depth, reconstruction, detection tracking, object detection, segmentation, odometry, optical flow, semantic car depth, sfmThe SPHERE human skeleton movements dataset was created using a Kinect camera, that measures distances and provides a depth map of the scene instead of ...
motion, skeleton, kinect, movement, depth, human, action, video, behaviorIt is composed of ADL (activity daily living) and fall actions simulated by 11 volunteers. The people involved in the test are aged between 22 and 39, w...
wearable, kinect, fall detection - adl, depth, human, recognition, action, accelerometer, videoThe HCI 4D Lightfields dataset contains 11 objects with corresponding lightfields for depth estimation. Datasets can be downloaded individually below....
3d, benchmark, evaluation, reconstruction, depth, 4d, lightfieldThe Shefeld Kinect Gesture (SKIG) dataset contains 2160 hand gesture sequences (1080 RGB sequences and 1080 depth sequences) collected from 6 subjects. ...
illumination, gesture, kinect, depth, recognition, human, actionThe Leuven Stereo Scene dataset is a scene and depth dataset. There exist two variants of this dataset - a CVPR 2007 paper [1] by Leibe et al. for detec...
urban, stereo, depth, reconstruction, leuven, segmentation, 3d, semantic, sfmWelcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects ...
face, reconstruction, depth, mesh, human, action, video, pose, multiview, trackingThe ECP New York dataset contains 10 manually segmented buildings from New York City, USA. Segmentation evaluating using Dice coefficient is calculated ...
urban, newyork, semantic segmentation, semantic, procedural reconstructionSome datasets and evaluation tools are provided on this page for four different computer vision and computer graphics problems. Population counting L...
urban, surface, reconstruction, pointcloud, object, road, pedestrian, network, line, 3d, crowd, counting, detection, groundtruthThe DTU Robot dataset consists of color images of 60 scenes acquired in a controlled setup from 119 different positions and under different lighting. Fo...
illumination, feature matching, sfm, reconstruction, feature detection, feature descriptionThis ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1...
benchmark, paris, reconstruction, pointcloud, outdoor, 3d, source, architecture, semantic, code, urban, mesh, recognition, segmentation, classificationThe MSRC vNIPS dataset is the MSRC v2 dataset with new annotations for much more accurate segmentations for 93 images. Efficient Inference in Fully Co...
semantic segmentation, semantic, outdoorChairGest is an open challenge / benchmark. The task consists in spotting and recognizing gestures from multiple synchronized sensors: 1 Kinect and 4 X...
gesture, detection, benchmark, kinect, recognition, humanThe Farman Institute 3D Point Sets dataset contains 11 objects by a 3D laser scanner. This dataset was peer-reviewed by Image Processing On Line: Farman...
object, scanner, 3d, reconstruction, point, model, laserThe York Urban Line Segment Database is a compilation of 102 images (45 indoor, 57 outdoor) of urban environments consisting mostly of scenes from the c...
vanishing point, urban, reconstruction, outdoor, pose estimation, manhattan, geometryThe 3DVis dataset includes a set of 12 heterogeneous scenes for testing 3D scene registration and analysis methods. Models include homogeneous shapes, r...
3d, registration, reconstruction, shape, matching, symmetryThe Robotic 3D Scan Repository from Osnabrueck contains 23 different datasets showing a veriaty of 3D scans for objects, humans, cities, university camp...
lidar, scan, urban, reconstruction, human, laser, heat, aerial, germany, 3d, bremen, city, osnabrueckISPRS / EuroSDR Benchmark for Multi-Platform Photogrammetry In these pages you can get information about the BENCHMARK FOR MULTI-PLATFORM PHOTOGRAMMET...
urban, benchmark, reconstruction, aerial, photogrammetry, germany, 3d, multiview, switzerland, cityThe RGB-D Person Re-identification dataset is for person re-identification using depth information. The main motivation is that the standard techniques ...
pedestrian, 3d, identification, classification, depth, shapeThis dataset provides a collection of web images and 3D models for research on landmark recognition (especially for methods based on 3D models). We hope...
codebook, reconstruction, matching, recognition, retrieval, 3d, classification, feature, flickr, landmarkThe TVPR dataset includes 23 registration sessions. Each of the 23 folders contains the video of one registration session. Acquisitions have been perfor...
person, depth, recognition, indoor, top-view, video, clothing, gender, reidentification, identification, peopleThis is a dataset of rectified facade images and semantic labels. The goal of the annotation is to study the layout of the facades. It contains 50 im...
urban, semantic segmentation, semantic, procedural reconstruction, grazThe eTrims dataset is comprised of two datasets, the 4-Class eTRIMS Dataset with 4 annotated object classes and the 8-Class eTRIMS Dataset with 8 annota...
urban, semantic segmentation, procedural reconstructionThe Eurasian Cities dataset contains 103 images of outdoor urban scenes taken in Eurasian cities. It is annotated with horizontal and vertical vanishing...
urban, vanishing, reconstruction, manhattan, outdoor, line, pose, point, geometryThe New College Data Set contains 30GB of data intended for use by the mobile robotics and vision research communities. Our anticipated users are partie...
urban, stereo, reconstruction, path, panorama, 3d, odometry, navigationThe 3D shape description dataset consists of multiple sub-datasets Descriptor Matching - Dataset 1 & 2 (Stanford) These datasets, created from some o...
description, 3d, benchmark, registration, reconstruction, shape, matchingt is composed of food intake movements, recorded with Kinect V1 (320240 depth frame resolution), simulated by 35 volunteers for a total of 48 tests. The...
kinect, age, intake, pointcloud, human, tracking, monitoring, groundtruth, food, behaviorThe Pittsburgh Fast-food Image dataset (PFID) consists of 4545 still images, 606 stereo pairs, 3033600 videos for structure from motion, and 27 privacy-...
video, laboratory, classification, reconstruction, real, food, recognitionA 66 stereo pairs dataset with their subpixel ground truths. The construction and improvement of algorithms for subpixel stereovision requires very pr...
stereo, depth, pointcloud, noise, stereovision, 3d, groundtruth, subpixelThe TUG (Timed Up and Go test) dataset consists of actions performed three times by 20 volunteers. The people involved in the test are aged between 22 a...
wearable, kinect, time, human, recognition, action, depth image processing - tug, accelerometer, videoSceneNet RGB-D is dataset comprised of 5 million Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth. It expands the previous work ...
trajectory, reconstruction, scene, slam, lighting, indoor, segmentation, robot, rendering, 3d, synthetic, navigationISPRS Test Project on Urban Classification and 3D Building Reconstruction The ISPRS working group III/4 announces the release of the 2D semantic label...
urban, reconstruction, recognition, building, 3d, classification, city, semanticThe Weather and Illumination Database (WILD) is an extensive database of high quality images of an outdoor urban scene, acquired every hour over all sea...
urban, estimation, depth, weather, time, newyork, webcam, video, illumination, change, static, camera, lightThe Synthetic CAD Models dataset consists of X synthetic CAD models for detection (planar) primitives. Efficient RANSAC for Point-Cloud Shape Detectio...
ransac, reconstruction, synthetic, primitive, model fitting, 3d objectInstance recognition from depth data. Contains various challenges of Pose, Clutter, Occlusion and similar looking objects (Bonde, U., Badrinarayanan, V....
detection, instance, depth, poseVoxel Based Dataset for Systematic 3D reconstruction by artificial neural networks (ANNs). A synthetic scalable cube dataset for training, testing and...
deep learning, synthetic city urban, 3d, sfm, reconstructionISPRS and EuroSDR - Benchmark on High Density Aerial Image Matching Background and Scope of the project Innovations in matching algorithms as well a...
urban, benchmark, reconstruction, aerial, photogrammetry, germany, 3d, multiview, switzerland, cityThe goal of LabelMe is to provide an online annotation tool to build image databases for computer vision research. You can contribute to the database by...
urban, semantic segmentation, software, semantic, outdoor, object detectionThe Microsoft Research Cambridge-12 Kinect gesture dataset consists of sequences of human movements, represented as body-part locations, and the associa...
gesture, recognition, human, action, kinectThe xawAR16 dataset is a multi-RGBD camera dataset, generated inside an operating room (IHU Strasbourg), which was designed to evaluate tracking/relocal...
video, medicine, table, depth, operation, recognition, surgeryShakeFive2 A collection of 8 dyadic human interactions with accompanying skeleton metadata. The metadata is frame based xml data containing the skelet...
video, human, kinect, interactionThe Ecole Centrale Paris 2010 (Paris 2010) dataset consists of 30 images of densely annotated building facades in seven classes - wall, window, sky, sho...
urban, semantic segmentation, semantic, paris, procedural reconstructionThe 1DSfM Landmarks is a collection of community-based image reconstruction by Kyle Wilson and is comprised of 14 datasets with comparison to bundler gr...
urban, 3d, benchmark, city, reconstruction, landmark, groundtruthThe Landmark 1000 or 1k dataset is a collection of the top 1000 popular flickr landmarks mined from flickr. It is maintained by Noah Snavely and publi...
estimation, location, reconstruction, pointcloud, world, 3d, pose, landmarkA synthetic light field dataset with 24 scenes. Data provided for each scene: - 9x9x512x512x3 light fields as individual PNGs - config files with c...
ground truth, light field, disparity, depth, syntheticThe ICG Graz240 dataset consists of 240 buildings with 5400 redundant images with a total of 5542 window instances. Window detection itself is difficult...
urban, semantic segmentation, semantic, object detection, grazAn evaluation benchmark for dense MVS for these datasets fountain-P11, Herz-Jesu-P8, entry-P10, castle-P19, Herz-Jesu-P25, castle-P30 . Images (correc...
3d reconstruction, benchmark, sfm, depth, dense, meshYahoo Flickr Creative Commons 100M (YFCC100M) dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. ...
internet, reconstruction, recognition, image, community, social, 3d, clustering, detection, flickr, landmarkCMU/VMR Urban Image+Laser dataset contains 372 images linked with 3D laser points projections. There are additional images (due to the laser scanner bei...
urban, 3d reconstruction, laser, semantic segmentation, sfmThe following are multiview stereo data sets captured in our lab: a set of images, camera parameters and extracted apparent contours of a single rigid o...
3d reconstruction, sfm, depth, dense, meshZurich Hoengg (Switzerland) is an aerial dataset. The dataset consists of 4 aerial images in colour (Figures 2-5), scanned with 14 microns, the forma...
semantic segmentation, aerial, outdoorThe MSR Action datasets is a collection of various 3D datasets for action recognition. See details http://research.microsoft.com/en-us/um/people/zliu...
video, detection, 3d, action, reconstruction, recognitionThese sequences were used for our video interpolation work described in High-quality video view interpolation using a layered representation, C.L. Zi...
segmentation, 3d reconstruction, camera, depthThe MSRC v2 dataset is an extension of the MSRC v1 dataset from Microsoft Research in Cambridge. It contains 591 images and 23 object classes with accur...
semantic segmentation, semantic, outdoorThis dataset consist 51 oral presentation recorded with 2 ambient visual sensor (web-cam), 3 First Person View (FPV) cameras (1 on presenter and 2 on ra...
video, quality, kinect, multi-sensor, presentation, analysisThe ECP Paris 2011 dataset consists of 104 images taken from rue Monge in the fifth district of Paris, we kept only 20 for training and 10 for testing. ...
urban, semantic segmentation, semantic, paris, procedural reconstructionThis repository contains labeled 3-D point cloud laser data collected from a moving platform in a urban environment. Data are provided for research purp...
urban, 3d reconstruction, laser, semantic segmentation, sfmThe Rent3D dataset comprises floorplans and images. The goal of this work is to enable a 3D virtual-tour of an apartment given a small set of monocular ...
building, urban, reconstruction, floorplan, layout, apartment, indoorThe Yotta dataset consists of 70 images for semantic labeling given in 11 classes. It also contains multiple videos and camera matrices for 14km or driv...
urban, reconstruction, video, segmentation, 3d, classification, camera, semanticThe MSRC v1 dataset from Microsoft Research in Cambridge contains 240 images and 9 object classes with coarse pixel-wise labeled images. The dataset is...
semantic segmentation, semantic, outdoorThe CMU Geometric Context dataset by Derek Hoiem, Alexei A. Efros, Martial Hebert consists of 300 images used for training and testing the geometric con...
single view, 3d reconstruction, geometry, depth, contextThe Symmetry Facades dataset contains 9 building facades with multiple images. It used for coupled symmetry and structure from motion detection. Coup...
urban, reconstruction, facade, building, 3d, repetition, symmetry, sfm