We introduce a labeled dataset of categorized images for evaluating sketch based image retrieval. Using Flickr, we downloaded about 3000 images for each of the 5 keywords: butterfly, coffee mug, dog jump, giraffe, and plane, together comprising of about 15000 images. For each image, if there is a non-ambiguous object with correct content matching with the query keyword and most part of the object is visible, we mark such an object region. The salient regions are marked at a pixel level. We only label salient object region for objects with almost fully visible since partially occluded objects are is less useful for shape matching. The THUR15000 dataset do not contain a salient region labeled for every image in the dataset, i.e., some images may not have any salient region. This dataset is used to evaluate shape based image retrieval performance.
The THUS10000 benchmark dataset comprises of 10,000 images, each of which has an unambiguous salient object and the object region is accurately annotate...
saliency, segmentation, salient object detection, attention, visualAn Annotated Dataset For Near-Duplicate Detection In Personal Photo Collections Managing photo collections involves a variety of image quality assessm...
copyright, duplicate, detection, groundtruth, retrievalThis material is supplementary to Michael Stark, Bernt Schiele. How Good are Local Features for Classes of Geometric Objects. Eleventh IEEE Internat...
object, binary, tool, classification, shapeHumans have used sketching to depict our visual world since prehistoric times. Even today, sketching is possibly the only rendering technique readily av...
image retrieval, shape retrieval, partial, matching, sketchThe ImageNET dataset is the latest dataset by Li Fei-Fei containing various dataset ranging from 1000 to 10000 categories.
image classification, object segmentation, retrievalThe FlickrLogos-32 dataset contains photos showing brand logos and is meant for the evaluation of multi-class logo recognition as well as logo retrieval...
classification brand boundingbox, retrieval, object recognition, machine learning, logo, detection, image, flickrThe Multi-FoV synthetic datasets are two synthetic scenes (vehicle moving in a city, and flying robot hovering in a confined room). For each scene, thre...
synthetic, visual, odometry, fov, blender, camera, groundtruthGroup emotion recognition in images - Happiness Intensity labels for group of people in images. The images have been collected from Flickr using keyword...
emotion, wild, flickr, behavior, group, human, facial expressionThis work attempts to provide two Hand Images Databases for hand biometrics: one is created using a mobile phone camera of modest quality, which we ca...
segmentation, person, identification, authentication, mobile, shape, biometric/ hand geometry, webcamThe Compact Descriptors for Visual Search Patches Dataset (CDVS) is a dataset comprised of pairwise image patches. MPEG is a standard titled Compact De...
descriptor, mpeg, patch, retrieval, matching, featureThe Caltech Buildings dataset consists of images taken for 50 buildings around the Caltech campus. Five different images were taken for each building fr...
building, caltech, urban, retrieval, taxonomy, hierarchyThe domain-specific personal videos highlight dataset from the paper [1] describes a fully automatic method to train domain-specific highlight ranker f...
saliency, domain, wearable, human, recognition, action, video, summarizationThe 3DVis dataset includes a set of 12 heterogeneous scenes for testing 3D scene registration and analysis methods. Models include homogeneous shapes, r...
3d, registration, reconstruction, shape, matching, symmetryThe RGB-D Person Re-identification dataset is for person re-identification using depth information. The main motivation is that the standard techniques ...
pedestrian, 3d, identification, classification, depth, shapeThis dataset provides a collection of web images and 3D models for research on landmark recognition (especially for methods based on 3D models). We hope...
codebook, reconstruction, matching, recognition, retrieval, 3d, classification, feature, flickr, landmarkThe Webcam Interestingness dataset consists of 20 different webcam streams, with 159 images each. It is annotated with interestingness ground truth, acq...
video, interest, retrieval, classification, weather, ranking, webcamThe Google Street View dataset contains 62,058 high quality Google Street View images. The images cover the downtown and neighboring areas of Pittsburgh...
pittsburgh, urban, manhattan, sphere, address, panorama, google, streetview, gps, retrieval, localizationThe 3D shape description dataset consists of multiple sub-datasets Descriptor Matching - Dataset 1 & 2 (Stanford) These datasets, created from some o...
description, 3d, benchmark, registration, reconstruction, shape, matchingThe VOT2016 pixel-wise annotations dataset contains pixel-wise per-frame annotations for sequences from VOT2016 dataset. The annotation is in a form of ...
object, segmentation, annotation, mask, visual, trackingThe Salient Montages is a human-centric video summarization dataset from the paper [1]. In [1], we present a novel method to generate salient montages...
video, saliency, wearable, montage, summarization, humanThe Caltech Game Covers dataset consists of CD/DVD covers of video games. The set was downloaded from freecovers.net during the summer of 2008. The set ...
caltech, retrieval, game, cover, classification, hierarchy, taxonomyThe San Francisco Landmark Dataset for Mobile Landmark Recognition is a set of images and query images for localization. We present the San Francisco ...
urban, mobile, sanfrancisco, gps, retrieval, localization, landmark, city, calibrationWe introduce a benchmark for evaluating the performance of large scale sketch-based image retrieval systems. The necessary data is acquired in a control...
image retrieval, shape retrieval, partial, matching, sketchYahoo Flickr Creative Commons 100M (YFCC100M) dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. ...
internet, reconstruction, recognition, image, community, social, 3d, clustering, detection, flickr, landmarkWe would like to announce the release of PASCAL-Context dataset. We augmented PASCAL VOC 2010 dataset with annotations for 400+ additional categories. I...
segmentation, benchmark, shape, recognition, pascal, category, semantic, dense