The CMU Geometric Context dataset by Derek Hoiem, Alexei A. Efros, Martial Hebert consists of 300 images used for training and testing the geometric context method. We extend our framework from Automatic Photo Pop-up by subclassifying vertical regions into planar (facing left, center, or right) and non- planar (porous and solid). We also provide extensive quantitative evaluation and demonstrate the usefulness of the geometric labels as context for object detection. Note that all images were contained using Google image search, using keywords such as "city", "outdoor", "field", and "road". The original content providers maintain copyrights on these images. Geometric Context from a Single Image D. Hoiem, A.A. Efros, and M. Hebert, ICCV 2005. Contents: *.jpg: 300 images used for training and testing allimsegs2.mat: contains ground truth imsegs: for each image contains superpixel images (segimage) and ground truth label for each superpixel (vert_labels, horz_labels) rand_indices.mat: cluster_images: indices for learning segmentation cv_images: indices for cross-validation (in blocks of 50) http://www.cs.illinois.edu/homes/dhoiem/projects/context/ http://www.cs.illinois.edu/homes/dhoiem/projects/data.html http://www.cs.illinois.edu/homes/dhoiem/projects/software.html
The Stable Structure from Motion datasets due to size limitations cannot put the images online. Instead here are the tracked image points and the final ...
church, stability, 3d reconstruction, 3d, robust, geometry, landmark, sfmAn evaluation benchmark for dense MVS for these datasets fountain-P11, Herz-Jesu-P8, entry-P10, castle-P19, Herz-Jesu-P25, castle-P30 . Images (correc...
3d reconstruction, benchmark, sfm, depth, dense, meshThe SAMANTHA (Structure-and-Motion Pipeline on a Hierarchical Cluster Tree) dataset contains 4 sequences for 3D reconstruction: Pozzoveggiani, Piazza Da...
3d reconstruction, geometry, sfm, landmark, model fittingThe GaTech VideoContext dataset consists of over 100 groundtruth annotated outdoor videos with over 20000 frames for the task of geometric context eval...
urban, nature, outdoor, video, segmentation, supervised, classification, context, unsupervised, geometry, semanticThese sequences were used for our video interpolation work described in High-quality video view interpolation using a layered representation, C.L. Zi...
segmentation, 3d reconstruction, camera, depthThe Cambridge-driving Labeled Video Database (CamVid) dataset from Gabriel Brostow [?] contains ten minutes of video footage and corresponding semantica...
urban, 3d reconstruction, semantic segmentation, semantic, sfm, depthThe NBVbench is a reference object and benchmark criteria for defining and evaluating the performance of a next best view (NBV) method.
planning, 3d reconstruction, next best view, geometryThe following are multiview stereo data sets captured in our lab: a set of images, camera parameters and extracted apparent contours of a single rigid o...
3d reconstruction, sfm, depth, dense, meshThe NYU-Depth data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microsoft ...
semantic segmentation, kinect, label, reconstruction, depthZurich City Hall dataset (also CIPA dataset) nformation: Place: City Hall, Zurich, Switzerland Number of Images: 15, 1280 x 1000 pixels Camera: Fuj...
urban, 3d reconstruction, photogrammetry, sfm, zurichFor the first few decades of the fields existence, computer vision has been focused on algorithmic, logical approaches to perception. But it was only wi...
object, 3d, kinect, reconstruction, depth, recognition, indoorThe Make3D Depth dataset s designed to learn features to estimate scene depth from a single image. This dataset contains aligned image and range data:...
single view, learning, indoor, outdoor, depth estimationThis repository contains labeled 3-D point cloud laser data collected from a moving platform in a urban environment. Data are provided for research purp...
urban, 3d reconstruction, laser, semantic segmentation, sfmThe SPHERE human skeleton movements dataset was created using a Kinect camera, that measures distances and provides a depth map of the scene instead of ...
motion, skeleton, kinect, movement, depth, human, action, video, behaviorInstance recognition from depth data. Contains various challenges of Pose, Clutter, Occlusion and similar looking objects (Bonde, U., Badrinarayanan, V....
detection, instance, depth, poseThe Stanford Background Dataset is a new dataset introduced in Gould et al. (ICCV 2009) for evaluating methods for geometric and semantic scene understa...
segmentation, urban, geometry, semantic, classification, natureThe xawAR16 dataset is a multi-RGBD camera dataset, generated inside an operating room (IHU Strasbourg), which was designed to evaluate tracking/relocal...
video, medicine, table, depth, operation, recognition, surgeryThe Quad 6K dataset is a Structure-from-Motion dataset taken at Arts Quad at Cornell University campus and consists of 6514 images with ground truth pos...
urban, 3d reconstruction, groundtruth, sfm, landmark, 3d gpsThe Aachen dataset consists of 4479 images taken with multiple cameras (3GB), 369 query images taken with the camera of a mobile phone together with the...
image retrieval, 3d reconstruction, aachen, sfm, landmarkThe Shefeld Kinect Gesture (SKIG) dataset contains 2160 hand gesture sequences (1080 RGB sequences and 1080 depth sequences) collected from 6 subjects. ...
illumination, gesture, kinect, depth, recognition, human, actionSince its launch in September 1999, Space Imaging IKONOS earth imaging satellite has provided a reliable stream of image data that has become the standa...
urban, 3d reconstruction, photogrammetry, aerial, sfmWelcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects ...
face, reconstruction, depth, mesh, human, action, video, pose, multiview, trackingThe MSR RGB-D Dataset 7-Scenes dataset is a collection of tracked RGB-D camera frames. The dataset may be used for evaluation of methods for different a...
video, kinect, location, reconstruction, depth, trackingThe CHALEARN Multi-modal Gesture Challenge is a dataset +700 sequences for gesture recognition using images, kinect depth, segmentation and skeleton dat...
gesture, skeleton, kinect, depth, human, recognition, action, illumination, segmentationA synthetic light field dataset with 24 scenes. Data provided for each scene: - 9x9x512x512x3 light fields as individual PNGs - config files with c...
ground truth, light field, disparity, depth, syntheticThe Colosseum and San Marco are two image datasets for dense multiview stereo reconstructions used for evaluating the visual photo realism. The datase...
urban, aerial, streetside, 3d reconstruction, photo-realism, flickr, landmark, sfmThe Paris500k dataset consists of 501,356 geotagged images collected from Flickr and Panoramio. The dataset was collected from a geographic bounding box...
panoramio, paris, image retrieval, 3d reconstruction, geotag, flickr, landmark, sfmThe 2D-3D-S dataset provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annota...
reconstruction, depth, large-scale, indoor, normal, building, panorama, segmentation, 3d, semanticThe NYU-Depth V2 data set is comprised of video sequences from a variety of indoor scenes as recorded by both the RGB and Depth cameras from the Microso...
semantic segmentation, kinect, label, reconstruction, depthThe York Urban Line Segment Database is a compilation of 102 images (45 indoor, 57 outdoor) of urban environments consisting mostly of scenes from the c...
vanishing point, urban, reconstruction, outdoor, pose estimation, manhattan, geometryThis dataset package contains the software and data used for Detection-based Object Labeling on the RGB-D Scenes Dataset as implemented in the paper: ...
object, 3d, kinect, reconstruction, depth, recognition, indoorThe RGB-D Person Re-identification dataset is for person re-identification using depth information. The main motivation is that the standard techniques ...
pedestrian, 3d, identification, classification, depth, shapeThe TVPR dataset includes 23 registration sessions. Each of the 23 folders contains the video of one registration session. Acquisitions have been perfor...
person, depth, recognition, indoor, top-view, video, clothing, gender, reidentification, identification, peopleThe Notre Dame de Paris dataset used for 3D SfM reconstruction and contains 715 images provided by Noah Snavely. There are also version for NotreDame...
paris, pointcloud, frontview, limited, 3d reconstruction, 3d, flickr, landmark, sfmWe take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest ...
stereo, object tracking, depth, reconstruction, detection tracking, object detection, segmentation, odometry, optical flow, semantic car depth, sfmThe object is a plaster reproduction of Temple of the Dioskouroi in Agrigento, Sicily. Click on thumbnail for a full-sized (640x480) image. Resolution o...
3d reconstruction, 3d, benchmark, sfm, multiviewThe Eurasian Cities dataset contains 103 images of outdoor urban scenes taken in Eurasian cities. It is annotated with horizontal and vertical vanishing...
urban, vanishing, reconstruction, manhattan, outdoor, line, pose, point, geometryIt is composed of ADL (activity daily living) and fall actions simulated by 11 volunteers. The people involved in the test are aged between 22 and 39, w...
wearable, kinect, fall detection - adl, depth, human, recognition, action, accelerometer, videoThe Dubrovnik6K and Rome16K datasets are image collections for SfM reconstruction, where the suffix refers to the number of images in the dataset. Dub...
urban, 3d reconstruction, dubrovnik, sfm, landmark, romeCMU/VMR Urban Image+Laser dataset contains 372 images linked with 3D laser points projections. There are additional images (due to the laser scanner bei...
urban, 3d reconstruction, laser, semantic segmentation, sfmThe object is a plaster dinosaur (stegosaurus). Click on thumbnail for a full-sized (640x480) image. Resolution of ground truth model: 0.00025m (you may...
3d reconstruction, 3d, benchmark, sfm, multiviewThe Weather and Illumination Database (WILD) is an extensive database of high quality images of an outdoor urban scene, acquired every hour over all sea...
urban, estimation, depth, weather, time, newyork, webcam, video, illumination, change, static, camera, lightA 66 stereo pairs dataset with their subpixel ground truths. The construction and improvement of algorithms for subpixel stereovision requires very pr...
stereo, depth, pointcloud, noise, stereovision, 3d, groundtruth, subpixelThe Leuven Stereo Scene dataset is a scene and depth dataset. There exist two variants of this dataset - a CVPR 2007 paper [1] by Leibe et al. for detec...
urban, stereo, depth, reconstruction, leuven, segmentation, 3d, semantic, sfmAiguille du Midi. France showing photographs with Camera: Mamiya ZD. 55mm. - Resolution: 5Mpixels, 53 images - Photographer: B. Vallet (Imagine/EVD - 20...
3d reconstruction, large scale, sfm, outdoor, meshThe HCI 4D Lightfields dataset contains 11 objects with corresponding lightfields for depth estimation. Datasets can be downloaded individually below....
3d, benchmark, evaluation, reconstruction, depth, 4d, lightfieldThe Stanford 3D Scanning Repository dataset is a compilation of 3D scans of objects like Stanford Bunny, Happy Buddha, Dragon, Armadillo and Lucy. These...
triangulation, 3d reconstruction, laser, bunnyThe Google Street View Pittsburgh Research dataset is a street-level image collection provided by Google for research purposes. The dataset provided ...
panorama, pittsburgh, urban, 3d reconstruction, sfmThe Symmetric Bundle Adjustment dataset contains four sequences of the CAB building, Barcelona, Redmond and Capitole for 3D reconstruction considering s...
urban, 3d reconstruction, symmetry, sfm, bundle adjustmentThe Microsoft COCO (mscoco) is an image recognition and segmentation dataset which contains more 300k images for more than 70 categories. Other featur...
object, segmentation, benchmark, semantic, context, recognition, detectionThis dataset contains two image collections, TempleOfHeaven and SportsArena, that are deemed hard for Structure-from-Motion (SfM). The method is desc...
ambiguous structures, 3d reconstruction, structure-from-motion