In HouseCraft, we utilize rental ads to create realistic textured 3D models of building exteriors. In particular, we exploit the address of the property and its floorplan, which are typically available in the ad. The address allows us to extract Google StreetView images around the building, while the buildings floorplan allows for an efficient parametrization of the building in 3D via a small set of random variables. Our approach is able to precisely estimate the geometry and location of the property, and can create realistic 3D building models. The original SydneyHouse dataset contains 174 random houses in Sydney with: Annotated floorplan Adress and map Up to three streetview images, with computed semantic features Accurate house location and vertical heights (in accordance with streetview observations) Hang Chu, Shenlong Wang, Raquel Urtasun, Sanja Fidler. HouseCraft: Building Houses from Rental Ads and Street Views. ECCV 2016.
The Inria Aerial Image Labeling addresses a core topic in remote sensing: the automatic pixelwise labeling of aerial imagery (link to paper). Dataset ...
house, urban, aerial, building, segmentation, footprint, groundtruth, city, semanticISPRS Test Project on Urban Classification, 3D Building Reconstruction and Semantic Labeling. In this part of our working group site you will get furthe...
urban, benchmark, recognition, aerial, canada, segmentation, photogrammetry, germany, 3d, multiview, city, semanticThe Zurich Summer v1.0 dataset is a collection of 20 chips (crops), taken from a QuickBird acquisition of the city of Zurich (Switzerland) in August 200...
annotation, urban, pan, gsd, superpixel, nir, aerial, satellite, segmentation, zurich, rgb, city, semanticThe Paris Art Deco Facades dataset consists of 79 / 80 images of rectified facades of the architectural style Art Deco, which has different sizes of win...
urban, paris, grammar, facade, recognition, segmentation, procedural, architecture, semantic, cityISPRS Test Project on Urban Classification and 3D Building Reconstruction The ISPRS working group III/4 announces the release of the 2D semantic label...
urban, reconstruction, recognition, building, 3d, classification, city, semanticThis ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1...
benchmark, paris, reconstruction, pointcloud, outdoor, 3d, source, architecture, semantic, code, urban, mesh, recognition, segmentation, classificationThe GaTech VideoContext dataset consists of over 100 groundtruth annotated outdoor videos with over 20000 frames for the task of geometric context eval...
urban, nature, outdoor, video, segmentation, supervised, classification, context, unsupervised, geometry, semanticThe Yotta dataset consists of 70 images for semantic labeling given in 11 classes. It also contains multiple videos and camera matrices for 14km or driv...
urban, reconstruction, video, segmentation, 3d, classification, camera, semanticThe Rent3D dataset comprises floorplans and images. The goal of this work is to enable a 3D virtual-tour of an apartment given a small set of monocular ...
building, urban, reconstruction, floorplan, layout, apartment, indoorThe San Francisco Landmark Dataset for Mobile Landmark Recognition is a set of images and query images for localization. We present the San Francisco ...
urban, mobile, sanfrancisco, gps, retrieval, localization, landmark, city, calibrationThe 2D-3D-S dataset provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annota...
reconstruction, depth, large-scale, indoor, normal, building, panorama, segmentation, 3d, semanticThe LabelMeFacade dataset contains buildings, windows, sky and a limited number of unlabeled regions (maximally 20% covering of the image). This procedu...
segmentation, urban, semantic, recognition, facade, rectifiedThe Stanford Background Dataset is a new dataset introduced in Gould et al. (ICCV 2009) for evaluating methods for geometric and semantic scene understa...
segmentation, urban, geometry, semantic, classification, natureThe Leuven Stereo Scene dataset is a scene and depth dataset. There exist two variants of this dataset - a CVPR 2007 paper [1] by Leibe et al. for detec...
urban, stereo, depth, reconstruction, leuven, segmentation, 3d, semantic, sfmThe Google Street View dataset contains 62,058 high quality Google Street View images. The images cover the downtown and neighboring areas of Pittsburgh...
pittsburgh, urban, manhattan, sphere, address, panorama, google, streetview, gps, retrieval, localizationThe CMP Facade dataset consists of facade images assembled at the Center for Machine Perception, which includes 600 rectified images of facades from var...
urban, similarity, facade, recognition, segmentation, structure, classification, rectification, semanticWe present a new large-scale dataset that contains a diverse set of stereo video sequences recorded in street scenes from 50 different cities, with high...
urban, stereo, cities, person, video, weakly, segmentation, pedestrian, detection, car, semanticThe Daimler Urban Segmentation Dataset consists of video sequences recorded in urban traffic. The dataset consists of 5000 rectified stereo image pairs ...
segmentation, urban, motion, stereo, semantic, outdoorhttp://www.cvlibs.net/datasets/kitti/eval_odometry.php Related Datasets TUM RGB-D Dataset: Indoor dataset captured with Microsoft Kinect and high-ac...
urban path 3d reconstruction, registration, navigation, localization, matching, slam, odometryThe Geosemantic is a dataset of object locations from GIS and a query image with metadata. It is used to project the buildings and streets that are in t...
geography, gps, segmentation, gis, supervised, semanticThe Oxford RobotCar Dataset contains over 100 repetitions of a consistent route through Oxford, UK, captured over a period of over a year. The dataset c...
driving, street, urban, time, recognition, autonomous, video, segmentation, robot, classification, detection, car, yearThe ECP New York dataset contains 10 manually segmented buildings from New York City, USA. Segmentation evaluating using Dice coefficient is calculated ...
urban, newyork, semantic segmentation, semantic, procedural reconstructionThe UrbanStreet dataset used in the paper can be downloaded here [188M] . It contains 18 stereo sequences of pedestrians taken from a stereo rig mounted...
urban, human, recognition, video, pedestrian, segmentation, tracking, multitarget, detectionAt Udacity, we believe in democratizing education. How can we provide opportunity to everyone on the planet? We also believe in teaching really amazing ...
driving, street, urban, time, recognition, autonomous, video, segmentation, robot, classification, detection, car, syntheticThe .enpeda.. Image Sequence Analysis Test Site (EISATS) offers sets of long bi- or trinocular image sequences recorded in the context of vision-based d...
motion, stereo, analysis, flow, segmentation, optical, semantic, visionZuBuD+, created in February 2017 by Federico Magliani (University of Parma), introduces many query images balancing the class evaluated from the previou...
building, image retrieval, urban, landmarkThe goal of LabelMe is to provide an online annotation tool to build image databases for computer vision research. You can contribute to the database by...
urban, semantic segmentation, software, semantic, outdoor, object detectionThe Swedish Traffic Sign Recognition provides Matlab code for parsing the annotation files and displaying the results. Part0 for each set contains the a...
urban, traffic, detection, city, sign, recognitionCOCO-Stuff augments the COCO dataset with pixel-level stuff annotations for 10,000 images. These annotations can be used for scene understanding tasks l...
annotation, benchmark, coco, segmentation, things, captioning, stuff, groundtruth, semanticAbstract Scene understanding has (again) become a focus of computer vision research, leveraging advances in detection, context modeling, and tracking. ...
scene, segmentation, pedestrian, 3d, classification, understanding, car, semanticParis-rue-Madame dataset contains 3D Mobile Laser Scanning (MLS) data from rue Madame, a street in the 6th Parisian district (France). The test zone con...
segmentation, 3d, semantic, classification, pointcloud, laserWe would like to announce the release of PASCAL-Context dataset. We augmented PASCAL VOC 2010 dataset with annotations for 400+ additional categories. I...
segmentation, benchmark, shape, recognition, pascal, category, semantic, denseThe ICG Graz240 dataset consists of 240 buildings with 5400 redundant images with a total of 5542 window instances. Window detection itself is difficult...
urban, semantic segmentation, semantic, object detection, grazThe PASCAL VOC is augmented with segmentation annotation for semantic parts of objects. For example, for the person category, we provide segmentation ma...
part, human, recognition, object, pedestrian, segmentation, pascal, detection, semanticThe Microsoft COCO (mscoco) is an image recognition and segmentation dataset which contains more 300k images for more than 70 categories. Other featur...
object, segmentation, benchmark, semantic, context, recognition, detectionThe ECP Paris 2011 dataset consists of 104 images taken from rue Monge in the fifth district of Paris, we kept only 20 for training and 10 for testing. ...
urban, semantic segmentation, semantic, paris, procedural reconstructionThe Symmetry Facades dataset contains 9 building facades with multiple images. It used for coupled symmetry and structure from motion detection. Coup...
urban, reconstruction, facade, building, 3d, repetition, symmetry, sfmThis site is dedicated to provide datasets for the Robotics community with the aim to facilitate result evaluations and comparisons. The datasets presen...
urban, laser, 3d, city, natureThe Robotic 3D Scan Repository from Osnabrueck contains 23 different datasets showing a veriaty of 3D scans for objects, humans, cities, university camp...
lidar, scan, urban, reconstruction, human, laser, heat, aerial, germany, 3d, bremen, city, osnabrueckISPRS / EuroSDR Benchmark for Multi-Platform Photogrammetry In these pages you can get information about the BENCHMARK FOR MULTI-PLATFORM PHOTOGRAMMET...
urban, benchmark, reconstruction, aerial, photogrammetry, germany, 3d, multiview, switzerland, cityThe Symmetry set dataset is a collection of images at different illuminations for the purpose of image matching using local symmetry features. Image M...
urban, matching, lighting, image, illumination, building, feature, symmetryThe city planar and non-planar datset consists of urban scenes accompanied by text files describing the plane/non-plane locations. Training Set (Univ...
building, urban, detection, 3d, estimation, planeISPRS and EuroSDR - Benchmark on High Density Aerial Image Matching Background and Scope of the project Innovations in matching algorithms as well a...
urban, benchmark, reconstruction, aerial, photogrammetry, germany, 3d, multiview, switzerland, citySVHN is a real-world image dataset for developing machine learning and object recognition algorithms with minimal requirement on data preprocessing and ...
urban, real, recognition, text, streetside, world, streetview, classification, detection, numberThe Ecole Centrale Paris 2010 (Paris 2010) dataset consists of 30 images of densely annotated building facades in seven classes - wall, window, sky, sho...
urban, semantic segmentation, semantic, paris, procedural reconstructionThe 1DSfM Landmarks is a collection of community-based image reconstruction by Kyle Wilson and is comprised of 14 datasets with comparison to bundler gr...
urban, 3d, benchmark, city, reconstruction, landmark, groundtruthThe automotive multi-sensor (AMUSE) dataset consists of inertial and other complementary sensor data combined with monocular, omnidirectional, high fram...
urban, api, image, video, inertial, streetside, traffic, cityScene Parsing Benchmark Scene parsing data and part segmentation data derived from ADE20K dataset could be download from MIT Scene Parsing Benchmark. ...
segmentation, annotation, benchmark, semantic, scene, recognitionThe TUD Crossing dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 201 images with 1008 highly overlapping pedestrians with signif...
urban, sideview, overlap, segmentation, pedestrian, tracking, multitarget, detectionThe Caltech Buildings dataset consists of images taken for 50 buildings around the Caltech campus. Five different images were taken for each building fr...
building, caltech, urban, retrieval, taxonomy, hierarchyThe Cambridge-driving Labeled Video Database (CamVid) dataset from Gabriel Brostow [?] contains ten minutes of video footage and corresponding semantica...
urban, 3d reconstruction, semantic segmentation, semantic, sfm, depthThis is a dataset of rectified facade images and semantic labels. The goal of the annotation is to study the layout of the facades. It contains 50 im...
urban, semantic segmentation, semantic, procedural reconstruction, grazAWS hosts a variety of public datasets that anyone can access for free. Previously, large datasets such as satellite imagery or genomic data have requi...
space, human, recognition, image, amazon, satellite, segmentation, learning, deep, classification, biology, resolutionPedestrian Color Naming (PCN) dataset contains 14,213 images, each of which hand-labeled with color label for each pixel.
segmentation, pedestrian, color namingThe Comprehensive Cars (CompCars) dataset contains data from two scenarios, including images from web-nature and surveillance-nature. The web-nature dat...
object, urban, fine-grained, classification, recognition, vehicle, car, attributeSheffield Building Image Dataset consists of over 3,000 low-resolution images of forty different buildings typically between 70 and 120 images per buil...
image retrieval, image classification, urban, sheffieldThe Quad 6K dataset is a Structure-from-Motion dataset taken at Arts Quad at Cornell University campus and consists of 6514 images with ground truth pos...
urban, 3d reconstruction, groundtruth, sfm, landmark, 3d gpsThis data set comprises 144 images of an edge profile cutting head of a milling machine. The head tool contains a total of 30 cutting inserts. The cutti...
profile, head, cutting, edge, tools, inserts, object, tool, milling, localization, wear, monitoringThe ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P1...
evaluation, graz, object, laboratory, pedestrian, segmentation, multiview, tracking, camera, detection, calibrationThe Outex dataset is part of a framework for empirical evaluation of texture classification and segmentation algorithms. The framework is being constru...
segmentation, benchmark, classification, synthetic, textureThe Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divi...
video, object, segmentation, motion, pedestrian, benchmark, tracking, groundtruthThe Zurich Building dataset (ZuBud) from Hao Shao, Tomas Svoboda and Luc Van Gool [?] contains 1005 images with 201 buildings each in five views. There ...
image retrieval, urban, procedural, rectificationThe CHALEARN Multi-modal Gesture Challenge is a dataset +700 sequences for gesture recognition using images, kinect depth, segmentation and skeleton dat...
gesture, skeleton, kinect, depth, human, recognition, action, illumination, segmentationThe Airport MotionSeg dataset contains 12 sequences of videos of an aiprort scenario with small and large moving objects and various speeds. It is chall...
video, segmentation, motion, airport, clustering, camera, zoomThis is a subset of the dataset introduced in the SIGGRAPH Asia 2009 paper, Webcam Clip Art: Appearance and Illuminant Transfer from Time-lapse Sequence...
urban, nature, time, webcam, video, illumination, change, static, camera, lightThe Daimler Mono Pedestrian Detection Benchmark dataset contains a large training and test set. The training set contains 15.560 pedestrian samples (ima...
object, mono, urban, pedestrian, outdoor, scale, detectionThe HandNet dataset contains depth images of 10 participants hands non-rigidly deforming infront of a RealSense RGB-D camera. This dataset includes 2...
rgbd, hand, articulation, video, segmentation, classification, pose, fingertip, detectionThe YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 vi...
video, object, flow, segmentation, detection, opticalA New Color Image Database for Benchmarking of Face Detection Techniques and Human Skin Segmentation Techniques. A new color face image database for ...
face, segmentation, skin, detection, benchmarkingThe SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used fo...
motion, video, object, proposal, flow, segmentation, stationary, model, camera, optical, groundtruthThe Google Street View Pittsburgh Research dataset is a street-level image collection provided by Google for research purposes. The dataset provided ...
panorama, pittsburgh, urban, 3d reconstruction, sfmPermanently growing database on lung tuberculosis patients. The data include radiological images (CT+XRay) plus social, clinical, and lab data as well a...
medical, segmentation, xray, chest, genome, none, tuberculosis, ctThe ETHZ Shape classes dataset from Vittorio Ferrari [?] consists of five object classes and a total of 255 images. All classes contain significant intr...
clutter, swan, bottle, matching, nature, object detection by shape, mug, giraffe, segmentation, applelogoThe Textures volume currently contains 154 images, all monochrome, 129 512x512 and 25 1024x1024. For the Brodatz texture images, the number in parenth...
segmentation, benchmark, evaluation, classification, synthetic, textureThese sequences were used for our video interpolation work described in High-quality video view interpolation using a layered representation, C.L. Zi...
segmentation, 3d reconstruction, camera, depthThe Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames...
motion, benchmark, video, object, pedestrian, segmentation, tracking, groundtruthThe CVC Partial Occlusion Virtual Pedestrian datasets (CVC-01 to CVC-06) cover a range of scenarios of occluded pedestrians generated in a virtual and r...
urban, pedestrian, classification, synthetic, occlusion, tracking, detectionAesthetic Visual Analysis (AVA) dataset studies the organization of content by aesthetic preference. It contains over 250,000 images along with a rich v...
memorability, image quality, semantic, aestheticsThe TRaffic ANd COngestionS (TRANCOS) dataset, a novel benchmark for (extremely overlapping) vehicle counting in traffic congestion situations. It consi...
urban, highway, spain, object, traffic, transportation, vehicle, detection, car10 videos as inputs, and segmented image sequences as ground-truth
segmentationThis repository contains labeled 3-D point cloud laser data collected from a moving platform in a urban environment. Data are provided for research purp...
urban, 3d reconstruction, laser, semantic segmentation, sfmBelgiumTSC dataset is built for traffic sign classification purposes. Is is a subset of BelgiumTS dataset and contains cropped images around annotations...
urban, traffic, road, classification, sign, belgiumThe Caltech Lanes dataset includes four clips taken around streets in Pasadena, CA at different times of day. The archive below includes 1225 individu...
caltech, urban, road, pasadena, detection, laneMultispectral Imaging (MSI) datasets were acquired using IRIS II which is a lightweight portable system comprising of a high resolution camera, a novel ...
illumination, wavelength, registration, alignment, matching, groundtruth, multi-spectralImage segmentation and boundary detection. Grayscale and color segmentations for 300 images, the images are divided into a training set of 200 images, a...
segmentationThe MSRC v2 dataset is an extension of the MSRC v1 dataset from Microsoft Research in Cambridge. It contains 591 images and 23 object classes with accur...
semantic segmentation, semantic, outdoorThe Berkeley Video Segmentation Dataset (BVSD) contains videos for segmentation (boundary?) Dataset train Dataset test
video, segmentation, benchmarkThe Caltech Pedestrian Dataset consists of approximately 10 hours of 640x480 30Hz video taken from a vehicle driving through regular traffic in an urban...
urban, pedestrian, object detectionThe MSRC v1 dataset from Microsoft Research in Cambridge contains 240 images and 9 object classes with coarse pixel-wise labeled images. The dataset is...
semantic segmentation, semantic, outdoor200 gray level images along with ground truth segmentations
segmentationThe Western GCO Segmentation problem instances are provided to compare effects of graph size, neighborhood size, length of s to t paths, regional arc co...
face, adhead, abdomen, liver, binary, medical, segmentation, optimization, bone, babyfaceThe Buffy dataset contains images selected from the TV series, Buffy: the Vampire Slayer. We select a set of 452 images from the first two episodes for ...
segmentation, human, buffy, movie, object detectionThe ETHZ Extended Shape classes dataset from Konrad Schindler is larger dataset of shape categories, created by merging ETHZ shape classes with Konrad S...
segmentation, clutter, object detection by shapeA benchmark dataset for the evaluation of retinal image registration methods is introduced. The dataset consists on 134 image pairs and is annotated wit...
image, retinal, fundus, retina, registration, eyeThis work attempts to provide two Hand Images Databases for hand biometrics: one is created using a mobile phone camera of modest quality, which we ca...
segmentation, person, identification, authentication, mobile, shape, biometric/ hand geometry, webcamSome datasets and evaluation tools are provided on this page for four different computer vision and computer graphics problems. Population counting L...
urban, surface, reconstruction, pointcloud, object, road, pedestrian, network, line, 3d, crowd, counting, detection, groundtruthZurich City Hall dataset (also CIPA dataset) nformation: Place: City Hall, Zurich, Switzerland Number of Images: 15, 1280 x 1000 pixels Camera: Fuj...
urban, 3d reconstruction, photogrammetry, sfm, zurichImage memorability dataset contains target and filler images, precomputed features and annotations, and memorability. It gives features and annotation...
memorability, image quality, semantic, aestheticsSince its launch in September 1999, Space Imaging IKONOS earth imaging satellite has provided a reliable stream of image data that has become the standa...
urban, 3d reconstruction, photogrammetry, aerial, sfmThe Symmetric Bundle Adjustment dataset contains four sequences of the CAB building, Barcelona, Redmond and Capitole for 3D reconstruction considering s...
urban, 3d reconstruction, symmetry, sfm, bundle adjustmentThe multiple foreground video co-segmentation dataset, consisting of four sets, each with a video pair and two foreground objects in common. The datase...
video, segmentation, co-segmentationThe Paris dataset consists of 6412 images. Images have high resolution and are in JPEG format. http://www.robots.ox.ac.uk/~vgg/data/parisbuildings/pa...
image retrieval, urban, landmark, parisThe FAce Semantic SEGmentation (FASSEG) repository contains datasets for multi-class semantic face segmentation. The FASSEG repository is composed by ...
face, segmentationWe present the 2017 DAVIS Challenge, a public competition specifically designed for the task of video object segmentation. Following the footsteps of ot...
code, quality, benchmark, video segmentation, object, segmentation, hd, tracking, resolutionThe Lane Level Localization dataset was collected on a highway in San Francisco with the following properties: * Reasonable traffic * Multiple lane h...
driving, benchmark, autonomous, video, road, gps, map, 3d, localization, carThe York Urban Line Segment Database is a compilation of 102 images (45 indoor, 57 outdoor) of urban environments consisting mostly of scenes from the c...
vanishing point, urban, reconstruction, outdoor, pose estimation, manhattan, geometryThe dataset is composed of 150 synthetic scenes, captured with a (perspective) virtual camera, and each scene contains 3 to 5 objects. The model set is ...
mesh, segmentation, recognition, syntheticThe 3DVis dataset includes a set of 12 heterogeneous scenes for testing 3D scene registration and analysis methods. Models include homogeneous shapes, r...
3d, registration, reconstruction, shape, matching, symmetryThis UIUC Cars dataset by Shivani Agarwal, Aatif Awan and Dan Roth contains images of side views of cars for use in evaluating object detection algorith...
urban, sideview, detection, car, recognition, scaleThe test sequences provide interested researchers a real-world multi-view test data set captured in the blue-c portals. The data is meant to be used for...
tracking, segmentation, camera, action, multiviewThe THUS10000 benchmark dataset comprises of 10,000 images, each of which has an unambiguous salient object and the object region is accurately annotate...
saliency, segmentation, salient object detection, attention, visualBelgiumTS is a large dataset with 10000+ traffic sign annotations, thousands of physically distinct traffic signs. 4 video sequences recorded with 8 hig...
urban, sign, belgium, road, traffic, classification, camera, calibrationThe TUD Pedestrians training dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 210 and 400 training images with X pedestrians with...
segmentation, pedestrian, sideview, object detectionThe GaTech VideoSeg dataset consists of two (waterski and yunakim?) video sequences for object segmentation. There exists no groundtruth segmentation ...
video, object, segmentation, motion, model, cameraA large dataset of geotagged face images collected from Flickr. The zip file contains text files containing urls of the images. Face2GPS: Estimating G...
gender, face, geotagged, classification, age, localization, humanThe 3D Mask Attack Database (3DMAD) is a biometric (face) spoofing database. It currently contains 76500 frames of 17 persons, recorded using Kinect for...
face, emotion, segmentation, 3d, recognition, biometry, frontviewThe eTrims dataset is comprised of two datasets, the 4-Class eTRIMS Dataset with 4 annotated object classes and the 8-Class eTRIMS Dataset with 8 annota...
urban, semantic segmentation, procedural reconstructionWe take advantage of our autonomous driving platform Annieway to develop novel challenging real-world computer vision benchmarks. Our tasks of interest ...
stereo, object tracking, depth, reconstruction, detection tracking, object detection, segmentation, odometry, optical flow, semantic car depth, sfmThis dataset contains videos of crowds and other high density moving objects. The videos are collected mainly from the BBC Motion Gallery and Getty Imag...
segmentationPlaces205 dataase contains 2.5 million images from 205 scene categories for the academic public. The image dataset contains 2,448,873 images from 205 ...
urban, learning, scene, feature, place, recognitionThe Eurasian Cities dataset contains 103 images of outdoor urban scenes taken in Eurasian cities. It is annotated with horizontal and vertical vanishing...
urban, vanishing, reconstruction, manhattan, outdoor, line, pose, point, geometryClassification/Detection Competitions, Segmentation Competition, Person Layout Taster Competition datasets
segmentationThe dataset consist of the about 50 hours obtained from kindergarten surveillance videos. Dataset, totally approximately 100 videos sequences (1000GB, 5...
segmentation, action, behavior, video surveillance, human, backgroundThe New College Data Set contains 30GB of data intended for use by the mobile robotics and vision research communities. Our anticipated users are partie...
urban, stereo, reconstruction, path, panorama, 3d, odometry, navigationPenn-Fudan Pedestrian Detection and Segmentation
segmentation, motion, background, pedestrian, detectionThe Pedestrian Parsing dataset contains 3,673 images from 171 videos of different Surveillance Scenes (PPSS), where 2,064 images are occluded and 1,609 ...
segmentation, pedestrian, parsingThe 3D shape description dataset consists of multiple sub-datasets Descriptor Matching - Dataset 1 & 2 (Stanford) These datasets, created from some o...
description, 3d, benchmark, registration, reconstruction, shape, matchingThe Colosseum and San Marco are two image datasets for dense multiview stereo reconstructions used for evaluating the visual photo realism. The datase...
urban, aerial, streetside, 3d reconstruction, photo-realism, flickr, landmark, sfmThe VOT2016 pixel-wise annotations dataset contains pixel-wise per-frame annotations for sequences from VOT2016 dataset. The annotation is in a form of ...
object, segmentation, annotation, mask, visual, trackingHollywood-2 datset contains 12 classes of human actions and 10 classes of scenes distributed over 3669 video clips and approximately 20.1 hours of video...
video, segmentation, action classificationThe PASCAL VOC Challenge datasets by Mark Everingham is a yearly dataset which has a central evaluation server and the final test data is not released. ...
chair, object detection, building, object segmentation, pedestrian, object pose, animal, car, airplaneThe Deformed Lattice Detection In Real-World Images dataset is used for regular grid detection. The authors have developed a robust and fast lattice det...
urban, symmetry, lattice detection, texture segmentationSceneNet RGB-D is dataset comprised of 5 million Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth. It expands the previous work ...
trajectory, reconstruction, scene, slam, lighting, indoor, segmentation, robot, rendering, 3d, synthetic, navigationThe Weather and Illumination Database (WILD) is an extensive database of high quality images of an outdoor urban scene, acquired every hour over all sea...
urban, estimation, depth, weather, time, newyork, webcam, video, illumination, change, static, camera, lightGaze data on video stimuli for computer vision and visual analytics. Converted 318 video sequences from several different gaze tracking data sets with...
video, metadata, segmentation, gaze data, polygon annotationThe UCF Person and Car VideoSeg dataset consists of six videos with groundtruth for video object segmentation. Surfing, jumping, skiing, sliding, big ...
video, object, segmentation, motion, model, camera, groundtruthThe Berkeley DeepDrive Video Dataset contains 2x order of magnitude more video training data.
driving, urban, learning, endtoend, deep, autonomousThe Oxford Buildings dataset by James Philbin and Andrew Zisserman consists of 5062 images collected from Flickr by searching for particular Oxford land...
image retrieval, urban, oxford, landmarkThe Traffic Video dataset consists of X video of an overhead camera showing a street crossing with multiple traffic scenarios. The dataset can be down...
video, urban, traffic, road, overhead, tracking, view, detectionThe Brodatz dataset consists of 112 textures in grayscale images of various texture types. http://www.ee.oulu.fi/research/imag/texture/image_data/Brod...
segmentation, benchmark, classification, synthetic, textureThe current video database containing six types of human actions (walking, jogging, running, boxing, hand waving and hand clapping) performed several ti...
video, segmentation, action classificationScanNet is an RGB-D video dataset containing 2.5 million views in more than 1500 scans, annotated with 3D camera poses, surface reconstructions, and ins...
scene, layout, recognition, indoor, object, cad, segmentation, rendering, 3d, realism, room, syntheticThe TUD Campus dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 71 images and 303 highly overlapping pedestrians with large scale...
segmentation, pedestrian, sideview, object tracking, object detection, overlapCows for object segmentation, Five video sequences for motion segmentation
segmentationUnlike the previous SHREC contests, the objective of this SHREC 2012 contest is to evaluate the performance of 3D-mesh segmentation techniques instead o...
mesh, segmentation, 3d, partThe INRIA Horses dataset from Frederic Jurie and Vittorio Ferrari consists of 170 images with one or more horses in side-view at several scales and clut...
segmentation, clutter, horse, object detection by shape, natureDaimler Stereo Pedestrian Detection Benchmark C. Keller, M. Enzweiler, and D. M. Gavrila, A New Benchmark for Stereo-based Pedestrian Detection, Proc...
urban, pedestrian, object detectionUBC3V is a synthetic dataset for training and evaluation of single or multiview depth-based pose estimation techniques. The nature of the data is simila...
segmentation, multiview depth based pose estimationThe Multicamera Human Action Video Data (MuHAVi) Manually Annotated Silhouette Data (MAS) are two datasets consisting of selected action sequences for ...
video, segmentation, action, behavior, human, backgroundThe KU Leuven Facade dataset is used for architectural styles classification. M. Mathias, A. Martinovic, J. Weissenberg, S. Haegler, L. Van Gool: Auto...
image classification, urban, architecture, procedural reconstruction328 side-view color images of horses that were manually segmented. The images were randomly collected from the WWW.
segmentationThe video co-segmentation dataset contains 4 video sets which totally has 11 videos with 5 frames of each video labeled with the pixel-level ground-tr...
video, segmentation, co-segmentation, datasetCMP Dataset by Ondra Chum contains 5 million images collected from the internet.
image retrieval, urban, large scaleDaimler Multi-Cue, Occluded Pedestrian Classification Benchmark Training and test samples have a resolution of 48 x 96 pixels with a 12-pixel border a...
image classification, urban, pedestrian, object detectionThe SUNCG dataset is a Large 3D Model Repository for Indoor Scenes. SUNCG is an ongoing effort to establish a richly-annotated, large-scale dataset ...
scene, layout, recognition, indoor, object, segmentation, rendering, 3d, realism, room, syntheticThe Leeds Cows dataset by Derek Magee consists of 14 different video sequences showing a total of 18 cows walking from right to left in front of differe...
video, segmentation, detection, cow, animal, backgroundThe German Traffic Sign Recognition Benchmark is a dataset for multi-class detection problem in natural images and do cordially invite you to participat...
urban, traffic, recognition, detection, traffic signOur repetitive pattern dataset with 106 images of app. 30 buildings from Pankrac, Prague and Marseille appearing in more than one image, number of appea...
image retrieval, urban, symmetry, repetition, image classificationThe Dubrovnik6K and Rome16K datasets are image collections for SfM reconstruction, where the suffix refers to the number of images in the dataset. Dub...
urban, 3d reconstruction, dubrovnik, sfm, landmark, romeCMU/VMR Urban Image+Laser dataset contains 372 images linked with 3D laser points projections. There are additional images (due to the laser scanner bei...
urban, 3d reconstruction, laser, semantic segmentation, sfmThe Hopkins 155 Dataset has been created with the goal of providing an extensive benchmark for testing feature based motion segmentation algorithms. It ...
urban, optical flow, stereo estimation, motion segmentationMIT Pedestrian dataset from Papageorgiou and Poggio [IJCV2000] contains 509 training and 200 test images of pedestrians in city scenes (plus left-right ...
urban, pedestrian, boundingbox, frontview, people, object detectionThe DynTex dataset consists of a comprehensive set of Dynamic Textures. Dynamic, or temporal, texture is a spatially repetitive, time-varying visual pat...
segmentation, dynamic, video repetition, synthetic, textureThe MSRC vNIPS dataset is the MSRC v2 dataset with new annotations for much more accurate segmentations for 93 images. Efficient Inference in Fully Co...
semantic segmentation, semantic, outdoorThe Daimler Mono Pedestrian Classification Benchmark dataset consists of two parts: a base data set. The base data set contains a total of 4000 pedest...
illumination, object, urban, pedestrian, classification, outdoor, scaleContains hand-labelled pixel annotations for 38 groups of images, each group containing a common foreground. Approximately 17 images per group, 643 imag...
segmentationThe TUD Pedestrians dataset from Micha Andriluka, Stefan Roth and Bernt Schiele [AndrilukaCVPR2008] consists of 250 images with 311 fully visible people...
segmentation, pedestrian, sideview, object detectionBackground Models Challenge (BMC) is a complete dataset and competition for the comparison of background subtraction algorithms. The main topics concer...
motion, background, video, modeling, segmentation, change, surveillance, detectionThe contour patches dataset is a large dataset of images patch matches used for contour detection. References: C. L. Zitnick and D. Parikh The Role...
lowlevel, match, edge, image, contour, segmentation, patch, detectionGround truth database of 50 images with: Data, Segmentation, Labelling - Lasso, Labelling - Rectangle
segmentationThe Weizmann actions dataset by Blank, Gorelick, Shechtman, Irani, and Basri consists of ten different types of actions: bending, jumping jack, jumping,...
video, segmentation, action, action classificationGeometric Context Dataset: pixel labels for seven geometric classes for 300 images
segmentationPOS Labeled Faces in the Wild, a collection of face which is proposed for studying face identification in unconstrained environment, its purpose is serv...
face, recognition, wild, identification, registrationThe Street View Text (SVT) dataset contains 647 words and 3796 letters in 249 images harvested from Google Street View. The dataset is more challengin...
urban, text recognition, text detection, classification, outdoor