The UK Bench dataset from Henrik Stewenius and David Nister contains 10200 images of N=2550 groups with each four images at size 640x480. The images are rotated, blurred and have a tendancy for computer science motives. The original paper is based on two subsets of 1400 and 6376, due to lack of images and efficient implementations. The dataset is typically used for image retrieval, where one image of a group is used as query and the score how many of the other images are in the top-4 rank.
The CALTECH 101 dataset by Li Fei-Fei contains images for 101 categories with about 40 to 800 images per category. Most categories have about 50 images ...
object, natural-image, centered, scene, image classificationThe CALTECH 256 dataset by Li Fei-Fei contains 30607 images for 256 categories.
object, detection, image, centered, classification, sceneThe ICG Multi-Camera datasets consist of Easy Data Set (just one person) Medium Data Set (3-5 persons, used for the experiments) Hard Data Set (cro...
graz, indoor, video, object, pedestrian, multiview, tracking, camera, multitarget, detection, calibrationThe SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used fo...
motion, video, object, proposal, flow, segmentation, stationary, model, camera, optical, groundtruthThe BEOID dataset includes object interactions ranging from preparing a coffee to operating a weight lifting machine and opening a door. The dataset is ...
video, object, egocentric, 3d, interaction, pose, trackingThe Microsoft COCO (mscoco) is an image recognition and segmentation dataset which contains more 300k images for more than 70 categories. Other featur...
object, segmentation, benchmark, semantic, context, recognition, detectionOur repetitive pattern dataset with 106 images of app. 30 buildings from Pankrac, Prague and Marseille appearing in more than one image, number of appea...
image retrieval, urban, symmetry, repetition, image classificationMany different labeled video datasets have been collected over the past few years, but it is hard to compare them at a glance. So we have created a hand...
video, object, benchmark, classification, recognition, detection, actionFor the first few decades of the fields existence, computer vision has been focused on algorithmic, logical approaches to perception. But it was only wi...
object, 3d, kinect, reconstruction, depth, recognition, indoorThe Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames...
motion, benchmark, video, object, pedestrian, segmentation, tracking, groundtruthThis data set comprises 144 images of an edge profile cutting head of a milling machine. The head tool contains a total of 30 cutting inserts. The cutti...
profile, head, cutting, edge, tools, inserts, object, tool, milling, localization, wear, monitoringSheffield Building Image Dataset consists of over 3,000 low-resolution images of forty different buildings typically between 70 and 120 images per buil...
image retrieval, image classification, urban, sheffieldThis material is supplementary to Michael Stark, Bernt Schiele. How Good are Local Features for Classes of Geometric Objects. Eleventh IEEE Internat...
object, binary, tool, classification, shapeHumans have used sketching to depict our visual world since prehistoric times. Even today, sketching is possibly the only rendering technique readily av...
image retrieval, shape retrieval, partial, matching, sketchThe Tiny Images dataset consists of 79,302,017 images, each being a 32x32 color image. This data is stored in the form of large binary files which can b...
image retrieval, image classification, color, tinyThe COIL-100 (Columbia University Image Library) consists of 100 objects. For formal documentation look at the corresponding compressed technical report...
image retrieval, image classificationThe ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P1...
evaluation, graz, object, laboratory, pedestrian, segmentation, multiview, tracking, camera, detection, calibrationThe Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divi...
video, object, segmentation, motion, pedestrian, benchmark, tracking, groundtruthThe Comprehensive Cars (CompCars) dataset contains data from two scenarios, including images from web-nature and surveillance-nature. The web-nature dat...
object, urban, fine-grained, classification, recognition, vehicle, car, attributeThe Aachen dataset consists of 4479 images taken with multiple cameras (3GB), 369 query images taken with the camera of a mobile phone together with the...
image retrieval, 3d reconstruction, aachen, sfm, landmarkLASIESTA is composed by many real indoor and outdoor sequences organized in different categories, each of one covering a specific challenge in moving ob...
motion, subtraction, dataset, background, object, stationary, foreground, camera, challenge, detection, groundtruthThe Daimler Mono Pedestrian Detection Benchmark dataset contains a large training and test set. The training set contains 15.560 pedestrian samples (ima...
object, mono, urban, pedestrian, outdoor, scale, detectionThe Aspect Layout dataset is designed to allow evaluation of object detection for aspect ratios in perspective images. Author text: In this project ...
object, detection, aspect, perspective, ratio, layoutThe Oxford Buildings dataset by James Philbin and Andrew Zisserman consists of 5062 images collected from Flickr by searching for particular Oxford land...
image retrieval, urban, oxford, landmarkSome datasets and evaluation tools are provided on this page for four different computer vision and computer graphics problems. Population counting L...
urban, surface, reconstruction, pointcloud, object, road, pedestrian, network, line, 3d, crowd, counting, detection, groundtruthWe introduce a benchmark for evaluating the performance of large scale sketch-based image retrieval systems. The necessary data is acquired in a control...
image retrieval, shape retrieval, partial, matching, sketchThe Zurich Building dataset (ZuBud) from Hao Shao, Tomas Svoboda and Luc Van Gool [?] contains 1005 images with 201 buildings each in five views. There ...
image retrieval, urban, procedural, rectificationThe CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test ima...
object, color, patch, scene, tiny, image classificationTh EPFL Multi-View Car dataset contains 20 sequences of cars as they rotate by 360 degrees. There is one image approximately every 3-4 degrees. Using th...
detection, estimation, car, pose, multiview, rotationThe SUNCG dataset is a Large 3D Model Repository for Indoor Scenes. SUNCG is an ongoing effort to establish a richly-annotated, large-scale dataset ...
scene, layout, recognition, indoor, object, segmentation, rendering, 3d, realism, room, syntheticThe Paris500k dataset consists of 501,356 geotagged images collected from Flickr and Panoramio. The dataset was collected from a geographic bounding box...
panoramio, paris, image retrieval, 3d reconstruction, geotag, flickr, landmark, sfmThe Paris dataset consists of 6412 images. Images have high resolution and are in JPEG format. http://www.robots.ox.ac.uk/~vgg/data/parisbuildings/pa...
image retrieval, urban, landmark, parisThe Multi-illuminant Image Sequences dataset contains 16 video sequences (13 with single light source and 3 with two global light sources), recorded wi...
constancy, color, white, chromaticity, physics, nature, dichromatic, illumination, object, balance, lightScanNet is an RGB-D video dataset containing 2.5 million views in more than 1500 scans, annotated with 3D camera poses, surface reconstructions, and ins...
scene, layout, recognition, indoor, object, cad, segmentation, rendering, 3d, realism, room, syntheticThe PASCAL VOC is augmented with segmentation annotation for semantic parts of objects. For example, for the person category, we provide segmentation ma...
part, human, recognition, object, pedestrian, segmentation, pascal, detection, semanticThe UCF Person and Car VideoSeg dataset consists of six videos with groundtruth for video object segmentation. Surfing, jumping, skiing, sliding, big ...
video, object, segmentation, motion, model, camera, groundtruthWe present the 2017 DAVIS Challenge, a public competition specifically designed for the task of video object segmentation. Following the footsteps of ot...
code, quality, benchmark, video segmentation, object, segmentation, hd, tracking, resolutionThe Farman Institute 3D Point Sets dataset contains 11 objects by a 3D laser scanner. This dataset was peer-reviewed by Image Processing On Line: Farman...
object, scanner, 3d, reconstruction, point, model, laserFERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. T...
facial expressions, joy, cardinal classification, deep learning, stylization, animation, fear, human transfer, face, disgust, neutral, anger, annotation emotion, surprise, sad, image retrieval, facial expressionThis dataset package contains the software and data used for Detection-based Object Labeling on the RGB-D Scenes Dataset as implemented in the paper: ...
object, 3d, kinect, reconstruction, depth, recognition, indoorThe GaTech VideoSeg dataset consists of two (waterski and yunakim?) video sequences for object segmentation. There exists no groundtruth segmentation ...
video, object, segmentation, motion, model, cameraCMP Dataset by Ondra Chum contains 5 million images collected from the internet.
image retrieval, urban, large scaleA dataset acquired with 3 synchronized sensors (Primesense Carmine 1.09, Microsoft Kinect v2, Canon IXUS 950 IS), featuring: * 30 industry-relevant ob...
object, rgbd, 3d, estimation, pose, texture-lessThe TRaffic ANd COngestionS (TRANCOS) dataset, a novel benchmark for (extremely overlapping) vehicle counting in traffic congestion situations. It consi...
urban, highway, spain, object, traffic, transportation, vehicle, detection, carThe VOT2016 pixel-wise annotations dataset contains pixel-wise per-frame annotations for sequences from VOT2016 dataset. The annotation is in a form of ...
object, segmentation, annotation, mask, visual, trackingZuBuD+, created in February 2017 by Federico Magliani (University of Parma), introduces many query images balancing the class evaluated from the previou...
building, image retrieval, urban, landmarkThe ICG Multi-Camera and Virtual PTZ dataset contains the video streams and calibrations of several static Axis P1347 cameras and one panoramic video fr...
graz, outdoor, video, object, panorama, pedestrian, network, crowd, multiview, tracking, camera, multitarget, detection, calibrationThe Visual Attributes dataset contains visual attribute annotations for over 500 object classes (animate and inanimate) which are all represented in Ima...
object, recognition, attribute, classification, imagenetThe Daimler Mono Pedestrian Classification Benchmark dataset consists of two parts: a base data set. The base data set contains a total of 4000 pedest...
illumination, object, urban, pedestrian, classification, outdoor, scaleThe YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 vi...
video, object, flow, segmentation, detection, opticalThe dataset contains 15 documentary films that are downloaded from YouTube, whose durations vary from 9 minutes to as long as 50 minutes, and the total ...
video, object, detectionThe KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body jo...
recognition, soccer, outdoor, object, pedestrian, game, pose, multiview, tracking, camera, multitarget, detection