Humans have used sketching to depict our visual world since prehistoric times. Even today, sketching is possibly the only rendering technique readily available to all humans. This paper is the first large scale exploration of human sketches. We analyze the distribution of non-expert sketches of everyday objects such as teapot or car. We ask humans to sketch objects of a given category and gather 20,000 unique sketches evenly distributed over 250 object categories. With this dataset we perform a perceptual study and find that humans can correctly identify the object category of a sketch 73% of the time. We compare human performance against computational recognition methods. We develop a bag-of-features sketch representation and use multi-class support vector machines, trained on our sketch dataset, to classify sketches. The resulting recognition method is able to identify unknown sketches with 56% accuracy (chance is 0.4%). Based on the computational model, we demonstrate an interactive sketch recognition system. We release the complete crowd-sourced dataset of sketches to the community.
We introduce a benchmark for evaluating the performance of large scale sketch-based image retrieval systems. The necessary data is acquired in a control...
image retrieval, shape retrieval, partial, matching, sketchThe Tools 2D dataset from Bronstein, Bronstein, Bruckstein, and Kimmel [?] for partial similarity experiments and consists of 15 shapes: 5 humans, 5 hor...
matching, binary, shape retrieval, partialThe Mythological Creatures consists of articulated shapes (silhouettes) for partial similarity experiments and contains 15 shapes: 5 humans, 5 horses an...
binary, shape retrieval, partial, matching, animalThe Kimia 216 has 18 classes each consisting of 12 images. It contains shapes silhouettes for birds, bones, brick, camels, car, children, classic cards,...
binary, shape retrieval, matching, animal, kimiaThe SIID silhouette dataset contains... and is from the Shape Indexing of Image Database (SIID). Download SIID silhouette dataset http://www.lems.bro...
matching, binary, shape retrievalThe Kimia 25 consists of 6 classes and 25 images. They are part of the Shape Indexing of Image Database (SIID) project, which also contains the SIID sil...
matching, binary, kimia, shape retrievalMPEG-7 Core Experiment CE-Shape-1 [?] is a popular database for shape matching evaluation consisting of 70 shape categories, where each category is repr...
matching, binary, shape retrieval, bullseyeThe Kimia 99 has 9 classes each consisting of each 11 images. They are part of the Shape Indexing of Image Database (SIID) project, which also contains ...
matching, binary, kimia, shape retrievalDetail 2D Projection DataSet is a database of 2d projections of mechanical details with holes. The dataset consists of 13 shape categories where each ca...
binary, shape retrieval, holes, detail, matchingThe Symmetry set dataset is a collection of images at different illuminations for the purpose of image matching using local symmetry features. Image M...
urban, matching, lighting, image, illumination, building, feature, symmetryGeneralized Dual Bootstrap-ICP Algorithm
matching, panorama, image registrationZuBuD+, created in February 2017 by Federico Magliani (University of Parma), introduces many query images balancing the class evaluated from the previou...
building, image retrieval, urban, landmarkThe VidPairs dataset contains 133 pairs of images, taken from 1080p HD (~2 megapixel) official movie trailers. Each pair consists of images of the same ...
matching, dense, video, flow, description, patch, pair, opticalSheffield Building Image Dataset consists of over 3,000 low-resolution images of forty different buildings typically between 70 and 120 images per buil...
image retrieval, image classification, urban, sheffieldThe Tiny Images dataset consists of 79,302,017 images, each being a 32x32 color image. This data is stored in the form of large binary files which can b...
image retrieval, image classification, color, tinyMultispectral Imaging (MSI) datasets were acquired using IRIS II which is a lightweight portable system comprising of a high resolution camera, a novel ...
illumination, wavelength, registration, alignment, matching, groundtruth, multi-spectralThe Extreme Zoom Dataset. EZD is a 6 image sets with incleasing zoom factor from general scene view to focusing on single detail. MODS: Fast and Robus...
description, detection, zoom, viewpoint, matching, featureThe COIL-100 (Columbia University Image Library) consists of 100 objects. For formal documentation look at the corresponding compressed technical report...
image retrieval, image classificationThe Aachen dataset consists of 4479 images taken with multiple cameras (3GB), 369 query images taken with the camera of a mobile phone together with the...
image retrieval, 3d reconstruction, aachen, sfm, landmarkThe Oxford Buildings dataset by James Philbin and Andrew Zisserman consists of 5062 images collected from Flickr by searching for particular Oxford land...
image retrieval, urban, oxford, landmarkThe Compact Descriptors for Visual Search Patches Dataset (CDVS) is a dataset comprised of pairwise image patches. MPEG is a standard titled Compact De...
descriptor, mpeg, patch, retrieval, matching, featureThe Zurich Building dataset (ZuBud) from Hao Shao, Tomas Svoboda and Luc Van Gool [?] contains 1005 images with 201 buildings each in five views. There ...
image retrieval, urban, procedural, rectificationThe Paris500k dataset consists of 501,356 geotagged images collected from Flickr and Panoramio. The dataset was collected from a geographic bounding box...
panoramio, paris, image retrieval, 3d reconstruction, geotag, flickr, landmark, sfmThe Paris dataset consists of 6412 images. Images have high resolution and are in JPEG format. http://www.robots.ox.ac.uk/~vgg/data/parisbuildings/pa...
image retrieval, urban, landmark, parishttp://www.cvlibs.net/datasets/kitti/eval_odometry.php Related Datasets TUM RGB-D Dataset: Indoor dataset captured with Microsoft Kinect and high-ac...
urban path 3d reconstruction, registration, navigation, localization, matching, slam, odometryFERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. T...
facial expressions, joy, cardinal classification, deep learning, stylization, animation, fear, human transfer, face, disgust, neutral, anger, annotation emotion, surprise, sad, image retrieval, facial expressionThe 3DVis dataset includes a set of 12 heterogeneous scenes for testing 3D scene registration and analysis methods. Models include homogeneous shapes, r...
3d, registration, reconstruction, shape, matching, symmetryThe CMP map2photo dataset consists of 6 pairs, where one image is satellite photo and second image is a map of the same area. The task is to match thes...
sensing, baseline, matching, description, map, feature, remote, detection, wideThis dataset provides a collection of web images and 3D models for research on landmark recognition (especially for methods based on 3D models). We hope...
codebook, reconstruction, matching, recognition, retrieval, 3d, classification, feature, flickr, landmarkCMP Dataset by Ondra Chum contains 5 million images collected from the internet.
image retrieval, urban, large scaleThe UK Bench dataset from Henrik Stewenius and David Nister contains 10200 images of N=2550 groups with each four images at size 640x480. The images are...
image retrieval, object, rotation, centeredWe introduce a labeled dataset of categorized images for evaluating sketch based image retrieval. Using Flickr, we downloaded about 3000 images for each...
saliency, internet, shape, sketch, visual, attention, group, retrieval, salient object detectionThe ETHZ Shape classes dataset from Vittorio Ferrari [?] consists of five object classes and a total of 255 images. All classes contain significant intr...
clutter, swan, bottle, matching, nature, object detection by shape, mug, giraffe, segmentation, applelogoOur repetitive pattern dataset with 106 images of app. 30 buildings from Pankrac, Prague and Marseille appearing in more than one image, number of appea...
image retrieval, urban, symmetry, repetition, image classificationThe Wide (multiple) Baseline Dataset. 31 image pairs, simultaneously combining several nuisance factors: geometry, illumination, IR-visible, etc. WxBS...
description, night, viewpoint, matching, feature, detection, day, ir15 wide baseline stereo image pairs with large viewpoint change, provided ground truth homographies. Image size (~1000x700 pixels, RGB) D. Mishkin a...
description, wide baseline stereo, detection, viewpoint, matching, featureThe 3D shape description dataset consists of multiple sub-datasets Descriptor Matching - Dataset 1 & 2 (Stanford) These datasets, created from some o...
description, 3d, benchmark, registration, reconstruction, shape, matching