EITZ Sketch Quality

Vision Dataset

Homepage

http://cybertron.cg.tu-berlin.de/eitz/projects/classifysketch/

Description

Humans have used sketching to depict our visual world since prehistoric times. Even today, sketching is possibly the only rendering technique readily available to all humans. This paper is the first large scale exploration of human sketches. We analyze the distribution of non-expert sketches of everyday objects such as teapot or car. We ask humans to sketch objects of a given category and gather 20,000 unique sketches evenly distributed over 250 object categories. With this dataset we perform a perceptual study and find that humans can correctly identify the object category of a sketch 73% of the time. We compare human performance against computational recognition methods. We develop a bag-of-features sketch representation and use multi-class support vector machines, trained on our sketch dataset, to classify sketches. The resulting recognition method is able to identify unknown sketches with 56% accuracy (chance is 0.4%). Based on the computational model, we demonstrate an interactive sketch recognition system. We release the complete crowd-sourced dataset of sketches to the community.

Discussion

Related datasets

EITZ Sketch-Based Image Retri…

We introduce a benchmark for evaluating the performance of large scale sketch-based image retrieval systems. The necessary data is acquired in a controlle…

image retrieval, matching, partial, shape retrieval, sketch

Vision

Tools2D

The Tools 2D dataset from Bronstein, Bronstein, Bruckstein, and Kimmel [?] for partial similarity experiments and consists of 15 shapes: 5 humans, 5 horse…

binary, matching, partial, shape retrieval

Vision

Mythological Creatures

The Mythological Creatures consists of articulated shapes (silhouettes) for partial similarity experiments and contains 15 shapes: 5 humans, 5 horses and …

animal, binary, matching, partial, shape retrieval

Vision

SIID

The SIID silhouette dataset contains... and is from the Shape Indexing of Image Database (SIID). Download SIID silhouette dataset http://www.lems.brown…

binary, matching, shape retrieval

Vision

MPEG-7 Core Experiment CE-Sha…

MPEG-7 Core Experiment CE-Shape-1 [?] is a popular database for shape matching evaluation consisting of 70 shape categories, where each category is repres…

binary, bullseye, matching, shape retrieval

Vision

KIMIA25

The Kimia 25 consists of 6 classes and 25 images. They are part of the Shape Indexing of Image Database (SIID) project, which also contains the SIID silho…

binary, kimia, matching, shape retrieval

Vision

KIMA99

The Kimia 99 has 9 classes each consisting of each 11 images. They are part of the Shape Indexing of Image Database (SIID) project, which also contains th…

binary, kimia, matching, shape retrieval

Vision

Detail 2D Projection DataSet

Detail 2D Projection DataSet is a database of 2d projections of mechanical details with holes. The dataset consists of 13 shape categories where each cate…

binary, detail, holes, matching, shape retrieval

Vision

KIMA216

The Kimia 216 has 18 classes each consisting of 12 images. It contains shapes silhouettes for birds, bones, brick, camels, car, children, classic cards, e…

animal, binary, kimia, matching, shape retrieval

Vision

COIL-100

The COIL-100 (Columbia University Image Library) consists of 100 objects. For formal documentation look at the corresponding compressed technical report, …

image classification, image retrieval

Vision

ETHZ Shape

The ETHZ Shape classes dataset from Vittorio Ferrari [?] consists of five object classes and a total of 255 images. All classes contain significant intra-…

applelogo, bottle, clutter, giraffe, matching, mug, nature, object detection by shape, segmentation, swan

Vision

3DVis

The 3DVis dataset includes a set of 12 heterogeneous scenes for testing 3D scene registration and analysis methods. Models include homogeneous shapes, rep…

3d, matching, reconstruction, registration, shape, symmetry

Vision

ZuBud

The Zurich Building dataset (ZuBud) from Hao Shao, Tomas Svoboda and Luc Van Gool [?] contains 1005 images with 201 buildings each in five views. There is…

image retrieval, procedural, rectification, urban

Vision

Visual Search Patches

The Compact Descriptors for Visual Search Patches Dataset (CDVS) is a dataset comprised of pairwise image patches. MPEG is a standard titled Compact Desc…

descriptor, feature, matching, mpeg, patch, retrieval

Vision

THUR15000

We introduce a labeled dataset of categorized images for evaluating sketch based image retrieval. Using Flickr, we downloaded about 3000 images for each o…

attention, group, internet, retrieval, saliency, salient object detection, shape, sketch, visual

Vision

CMP Extreme View Dataset

15 wide baseline stereo image pairs with large viewpoint change, provided ground truth homographies. Image size (~1000x700 pixels, RGB) D. Mishkin and…

description, detection, feature, matching, viewpoint, wide baseline stereo

Vision

Tiny Images

The Tiny Images dataset consists of 79,302,017 images, each being a 32x32 color image. This data is stored in the form of large binary files which can be …

color, image classification, image retrieval, tiny

Vision

image panorama gdbicp

Generalized Dual Bootstrap-ICP Algorithm

image registration, matching, panorama

Vision

Pankrac Marseille

Our repetitive pattern dataset with 106 images of app. 30 buildings from Pankrac, Prague and Marseille appearing in more than one image, number of appeara…

image classification, image retrieval, repetition, symmetry, urban

Vision

Multispectral Imaging (MSI)

Multispectral Imaging (MSI) datasets were acquired using IRIS II which is a lightweight portable system comprising of a high resolution camera, a novel fi…

alignment, groundtruth, illumination, matching, multi-spectral, registration, wavelength

Vision

CMP map2photo

The CMP map2photo dataset consists of 6 pairs, where one image is satellite photo and second image is a map of the same area. The task is to match these …

baseline, description, detection, feature, map, matching, remote, sensing, wide

Vision

Facial Expression Research Gr…

FERG-DB is a database of stylized characters with annotated facial expressions. The database contains multiple face images of six stylized characters. The…

anger, animation, annotation emotion, cardinal classification, deep learning, disgust, face, facial expression, facial expressions, fear, human transfer, image retrieval, joy, neutral, sad, stylization, surprise

Vision

Aachen Retrieval

The Aachen dataset consists of 4479 images taken with multiple cameras (3GB), 369 query images taken with the camera of a mobile phone together with their…

3d reconstruction, aachen, image retrieval, landmark, sfm

Vision

SHOT 3D shape description

The 3D shape description dataset consists of multiple sub-datasets Descriptor Matching - Dataset 1 & 2 (Stanford) These datasets, created from some of …

3d, benchmark, description, matching, reconstruction, registration, shape

Vision

Paris Retrieval

The Paris dataset consists of 6412 images. Images have high resolution and are in JPEG format. http://www.robots.ox.ac.uk/~vgg/data/parisbuildings/pari…

image retrieval, landmark, paris, urban

Vision

CMP WxBS dataset

The Wide (multiple) Baseline Dataset. 31 image pairs, simultaneously combining several nuisance factors: geometry, illumination, IR-visible, etc. WxBS: …

day, description, detection, feature, ir, matching, night, viewpoint

Vision

CMP Retrieval

CMP Dataset by Ondra Chum contains 5 million images collected from the internet.

image retrieval, large scale, urban

Vision

Oxford Buildings

The Oxford Buildings dataset by James Philbin and Andrew Zisserman consists of 5062 images collected from Flickr by searching for particular Oxford landma…

image retrieval, landmark, oxford, urban

Vision

Sheffield Building

Sheffield Building Image Dataset consists of over 3,000 low-resolution images of forty different buildings typically between 70 and 120 images per buildi…

image classification, image retrieval, sheffield, urban

Vision

UK Bench

The UK Bench dataset from Henrik Stewenius and David Nister contains 10200 images of N=2550 groups with each four images at size 640x480. The images are r…

centered, image retrieval, object, rotation

Vision

VidPairs

The VidPairs dataset contains 133 pairs of images, taken from 1080p HD (~2 megapixel) official movie trailers. Each pair consists of images of the same sc…

dense, description, flow, matching, optical, pair, patch, video

Vision

KITTI Odometry

http://www.cvlibs.net/datasets/kitti/eval_odometry.php Related Datasets TUM RGB-D Dataset: Indoor dataset captured with Microsoft Kinect and high-accu…

localization, matching, navigation, odometry, registration, slam, urban path 3d reconstruction

Vision

CMP Extreme Zoom Dataset

The Extreme Zoom Dataset. EZD is a 6 image sets with incleasing zoom factor from general scene view to focusing on single detail. MODS: Fast and Robust …

description, detection, feature, matching, viewpoint, zoom

Vision

Paris500k

The Paris500k dataset consists of 501,356 geotagged images collected from Flickr and Panoramio. The dataset was collected from a geographic bounding box r…

3d reconstruction, flickr, geotag, image retrieval, landmark, panoramio, paris, sfm

Vision

Symmetry Set

The Symmetry set dataset is a collection of images at different illuminations for the purpose of image matching using local symmetry features. Image Mat…

building, feature, illumination, image, lighting, matching, symmetry, urban

Vision

Landmark 3D

This dataset provides a collection of web images and 3D models for research on landmark recognition (especially for methods based on 3D models). We hope i…

3d, classification, codebook, feature, flickr, landmark, matching, recognition, reconstruction, retrieval

Vision

ZuBuD+

ZuBuD+, created in February 2017 by Federico Magliani (University of Parma), introduces many query images balancing the class evaluated from the previous …

building, image retrieval, landmark, urban

Vision