MICCAI 2015 Challenge on Liver Ultrasound Tracking Munich, October 9, 2015 (Full Day) Outline Ultrasound (US) imaging is a widely used medical imaging technique. As US has high temporal resolution and is non-invasive, it is an appealing choice for applications which require tracking and tissue motion analysis, such as motion compensation in image-guided intervention and therapy. Specifically, we want to address the issue of respiratory motion in the liver. While there is a large number of relevant works in motion tracking, it is difficult to compare the reported tracking strategies. Critical factors are the lack of a public dataset, the variation in tracking objective and validation strategies. The aim of the challenge is to present the current state-of-the-art in automated tracking of anatomical landmarks in the liver and compare different methods
The High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedes...
high-definition, benchmark, human, lisbon, indoor, video, re-identification, pedestrian, network, multiview, tracking, surveillance, camera, detectionWe collected a video dataset, termed ChokePoint, designed for experiments in person identification/verification under real-world surveillance conditions...
face, real, human, recognition, world, pedestrian, identification, clustering, multiview, surveillance, detection, sequenceThe Western GCO Segmentation problem instances are provided to compare effects of graph size, neighborhood size, length of s to t paths, regional arc co...
face, adhead, abdomen, liver, binary, medical, segmentation, optimization, bone, babyfaceWelcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects ...
face, reconstruction, depth, mesh, human, action, video, pose, multiview, trackingThe Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames...
motion, benchmark, video, object, pedestrian, segmentation, tracking, groundtruthThe Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divi...
video, object, segmentation, motion, pedestrian, benchmark, tracking, groundtruthThe UrbanStreet dataset used in the paper can be downloaded here [188M] . It contains 18 stereo sequences of pedestrians taken from a stereo rig mounted...
urban, human, recognition, video, pedestrian, segmentation, tracking, multitarget, detectionChairGest is an open challenge / benchmark. The task consists in spotting and recognizing gestures from multiple synchronized sensors: 1 Kinect and 4 X...
gesture, detection, benchmark, kinect, recognition, humanWe present the 2017 DAVIS Challenge, a public competition specifically designed for the task of video object segmentation. Following the footsteps of ot...
code, quality, benchmark, video segmentation, object, segmentation, hd, tracking, resolutionThe PETS 2009 dataset contains 3 parts showing multi-view sequences containing pedestrians walking in an outdoor environment. The parts are used for per...
overlap, human, frontview, occlusion multitarget, outdoor, pedestrian, tracking, detectionThe MOT Challenge is a framework for the fair evaluation of multiple people tracking algorithms. In this framework we provide: - A large collection of...
multiple, benchmark, evaluation, benhttp://motchallenge.net/chmark, dataset, target, video, pedestrian, 3d, tracking, surveillance, peopleThe Video Summarization (SumMe) dataset consists of 25 videos, each annotated with at least 15 human summaries (390 in total). The data consists of vide...
video, benchmark, summary, event, human, groundtruth, actiont is composed of food intake movements, recorded with Kinect V1 (320240 depth frame resolution), simulated by 35 volunteers for a total of 48 tests. The...
kinect, age, intake, pointcloud, human, tracking, monitoring, groundtruth, food, behaviorAWS hosts a variety of public datasets that anyone can access for free. Previously, large datasets such as satellite imagery or genomic data have requi...
space, human, recognition, image, amazon, satellite, segmentation, learning, deep, classification, biology, resolutionThe ICG Lab 6 (Multi-Camera Multi-Object Tracking) dataset contains 6 indoor people tracking scenarios recorded at our laboratory using 4 static Axis P1...
evaluation, graz, object, laboratory, pedestrian, segmentation, multiview, tracking, camera, detection, calibrationThe database contains, for each of the 100 examples: (1) the uncompressed frames, up to the 10th frame after the appearance of the 8th cell; (2) a text ...
trajectory, circle, mouse, biology, cell, trackingScene Parsing Benchmark Scene parsing data and part segmentation data derived from ADE20K dataset could be download from MIT Scene Parsing Benchmark. ...
segmentation, annotation, benchmark, semantic, scene, recognitionGroup emotion recognition in images - Happiness Intensity labels for group of people in images. The images have been collected from Flickr using keyword...
emotion, wild, flickr, behavior, group, human, facial expressionThe TUD Crossing dataset from Micha Andriluka, Stefan Roth and Bernt Schiele consists of 201 images with 1008 highly overlapping pedestrians with signif...
urban, sideview, overlap, segmentation, pedestrian, tracking, multitarget, detectionThe Buffy dataset contains images selected from the TV series, Buffy: the Vampire Slayer. We select a set of 452 images from the first two episodes for ...
segmentation, human, buffy, movie, object detectionThe multi-modal/multi-view datasets are created in a cooperation between University of Surrey and Double Negative within the EU FP7 IMPART project. Th...
rgbd, color, dynamic, multi-view, action, outdoor, video, 3d, face, emotion, lidar, human, indoor, multi-mode, modelThe dataset captures 25 people preparing 2 mixed salads each and contains over 4h of annotated accelerometer and RGB-D video data. Annotated activities ...
video, activity, classification, tracking, recognition, detection, actionThe procedural texture perceptual similarity dataset contains a list of procedural textures along with their pairwise distances, as defined by a percept...
study, benchmark, procedural, textureThe domain-specific personal videos highlight dataset from the paper [1] describes a fully automatic method to train domain-specific highlight ranker f...
saliency, domain, wearable, human, recognition, action, video, summarizationScene Background Initialization (SBI) dataset The SBI dataset has been assembled in order to evaluate and compare the results of background initializa...
change, detection, benchmark, background, foreground, initializationThe Traffic Video dataset consists of X video of an overhead camera showing a street crossing with multiple traffic scenarios. The dataset can be down...
video, urban, traffic, road, overhead, tracking, view, detectionThe Multicamera Human Action Video Data (MuHAVi) Manually Annotated Silhouette Data (MAS) are two datasets consisting of selected action sequences for ...
video, segmentation, action, behavior, human, backgroundThe Stanford 40 Actions dataset contains images of humans performing 40 actions. In each image, we provide a bounding box of the person who is performin...
recognition, human, detection, action, boundingboxThe dataset consists of eight unique scenes in crowded spaces such as a university campus or the sidewalks of a busy street.
trackingDB Contains 100 examples with the uncompressed frames, up to the 10th frame after the appearance of the 8th cell; a text file with the trajectories of a...
medicalCollection of endoscopic and laparoscopic (mono/stereo) videos and images
medicalIt is composed of ADL (activity daily living) and fall actions simulated by 11 volunteers. The people involved in the test are aged between 22 and 39, w...
wearable, kinect, fall detection - adl, depth, human, recognition, action, accelerometer, videoMIT traffic data set is for research on activity analysis and crowded scenes. It includes a traffic video sequence of 90 minutes long. It is recorded by...
trackingThe Pittsburgh Fast-food Image dataset (PFID) consists of 4545 still images, 606 stereo pairs, 3033600 videos for structure from motion, and 27 privacy-...
video, laboratory, classification, reconstruction, real, food, recognitionThe object is a plaster dinosaur (stegosaurus). Click on thumbnail for a full-sized (640x480) image. Resolution of ground truth model: 0.00025m (you may...
3d reconstruction, 3d, benchmark, sfm, multiviewToday, we introduce Open Images, a dataset consisting of ~9 million URLs to images that have been annotated with labels spanning over 6000 categories. W...
annotation, deep, classification, real, large-scale, image, category, automatic3 datasets: PTZ Tracking, Thermal-visible registration, Single object tracking
tracking, pedestrian, thermal, ptzThe TUG (Timed Up and Go test) dataset consists of actions performed three times by 20 volunteers. The people involved in the test are aged between 22 a...
wearable, kinect, time, human, recognition, action, depth image processing - tug, accelerometer, videoThe ICG Multi-Camera and Virtual PTZ dataset contains the video streams and calibrations of several static Axis P1347 cameras and one panoramic video fr...
graz, outdoor, video, object, panorama, pedestrian, network, crowd, multiview, tracking, camera, multitarget, detection, calibrationThe Salient Montages is a human-centric video summarization dataset from the paper [1]. In [1], we present a novel method to generate salient montages...
video, saliency, wearable, montage, summarization, humanWe share our omnidirectional and panoramic image dataset (with annotations) to be used for human and car detection. Please reach through: http://cvrg.i...
panorama, detection, car, omnidirection, recognition, humanThe TU Berlin Multi-Object and Multi-Camera Tracking Dataset (MOCAT) is a synthetic dataset to train and test tracking and detection systems in a virtua...
evaluation, multi-view, pedestrian, animal, tracking, multi-class, vehicle, detection, syntheticISPRS and EuroSDR - Benchmark on High Density Aerial Image Matching Background and Scope of the project Innovations in matching algorithms as well a...
urban, benchmark, reconstruction, aerial, photogrammetry, germany, 3d, multiview, switzerland, cityJPL First-Person Interaction dataset (JPL-Interaction dataset) is composed of human activity videos taken from a first-person viewpoint. The dataset par...
video, motion, action, interactive, recognition, humanThe Our Database of Faces (ORL) dataset contains ten different images of each of 40 distinct subjects. For some subjects, the images were taken at diffe...
illumination, face, recognition, human, expressionCOCO-Stuff augments the COCO dataset with pixel-level stuff annotations for 10,000 images. These annotations can be used for scene understanding tasks l...
annotation, benchmark, coco, segmentation, things, captioning, stuff, groundtruth, semanticThe Microsoft Research Cambridge-12 Kinect gesture dataset consists of sequences of human movements, represented as body-part locations, and the associa...
gesture, recognition, human, action, kinectWe wanted to have a collection of action recognition papers and results that everybody can use for reference. The site will work by the community princi...
recognition, benchmark, action, datasetSVHN is a real-world image dataset for developing machine learning and object recognition algorithms with minimal requirement on data preprocessing and ...
urban, real, recognition, text, streetside, world, streetview, classification, detection, numberRobust Multi-Person Tracking from Mobile Platforms In all cases, data was recorded using a pair of AVT Marlins F033C mounted on a chariot respectively...
tracking, pedestrian, color, sequenceThe Outex dataset is part of a framework for empirical evaluation of texture classification and segmentation algorithms. The framework is being constru...
segmentation, benchmark, classification, synthetic, textureLow-resolution RGB videos + ground truth trajectories from multiple fixed and moving cameras monitoring the same scenes (indoor and outdoor) to improve ...
trackingShakeFive2 A collection of 8 dyadic human interactions with accompanying skeleton metadata. The metadata is frame based xml data containing the skelet...
video, human, kinect, interactionDataset A (former NLPR Gait Database) was created on Dec. 10, 2001, including 20 persons. Each person has 12 image sequences, 4 sequences for each of th...
motion, foot, human, recognition, gait, action, classification, biometry, pressureThe Shefeld Kinect Gesture (SKIG) dataset contains 2160 hand gesture sequences (1080 RGB sequences and 1080 depth sequences) collected from 6 subjects. ...
illumination, gesture, kinect, depth, recognition, human, actionThe Brodatz dataset consists of 112 textures in grayscale images of various texture types. http://www.ee.oulu.fi/research/imag/texture/image_data/Brod...
segmentation, benchmark, classification, synthetic, textureThe 1DSfM Landmarks is a collection of community-based image reconstruction by Kyle Wilson and is comprised of 14 datasets with comparison to bundler gr...
urban, 3d, benchmark, city, reconstruction, landmark, groundtruthThe MSR RGB-D Dataset 7-Scenes dataset is a collection of tracked RGB-D camera frames. The dataset may be used for evaluation of methods for different a...
video, kinect, location, reconstruction, depth, trackingThe CHALEARN Multi-modal Gesture Challenge is a dataset +700 sequences for gesture recognition using images, kinect depth, segmentation and skeleton dat...
gesture, skeleton, kinect, depth, human, recognition, action, illumination, segmentationWe would like to announce the release of PASCAL-Context dataset. We augmented PASCAL VOC 2010 dataset with annotations for 400+ additional categories. I...
segmentation, benchmark, shape, recognition, pascal, category, semantic, denseThe Graffiti dataset by Krystian Mikolajczyk and Cordelia Schmid contains 48 images split into 8 sequences with 6 images each showing different structur...
benchmark, image rectification, feature detection, feature descriptionThe PASCAL VOC is augmented with segmentation annotation for semantic parts of objects. For example, for the person category, we provide segmentation ma...
part, human, recognition, object, pedestrian, segmentation, pascal, detection, semanticAn evaluation benchmark for dense MVS for these datasets fountain-P11, Herz-Jesu-P8, entry-P10, castle-P19, Herz-Jesu-P25, castle-P30 . Images (correc...
3d reconstruction, benchmark, sfm, depth, dense, meshThe KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body jo...
recognition, soccer, outdoor, object, pedestrian, game, pose, multiview, tracking, camera, multitarget, detectionThe HCI 4D Lightfields dataset contains 11 objects with corresponding lightfields for depth estimation. Datasets can be downloaded individually below....
3d, benchmark, evaluation, reconstruction, depth, 4d, lightfieldThe crowd datasets are collected from a variety of sources, such as UCF and data-driven crowd datasets. The sequences are diverse, representing dense cr...
video, pedestrian, scene, crowd, human, understanding, anomaly, detectionPermanently growing database on lung tuberculosis patients. The data include radiological images (CT+XRay) plus social, clinical, and lab data as well a...
medical, segmentation, xray, chest, genome, none, tuberculosis, ctThe Microsoft COCO (mscoco) is an image recognition and segmentation dataset which contains more 300k images for more than 70 categories. Other featur...
object, segmentation, benchmark, semantic, context, recognition, detectionThe Textures volume currently contains 154 images, all monochrome, 129 512x512 and 25 1024x1024. For the Brodatz texture images, the number in parenth...
segmentation, benchmark, evaluation, classification, synthetic, textureThe YACCLAB dataset includes both synthetic and real binary images and is suitable for a wide range of applications, ranging from document processing to...
fingerprints, videosurveillance, text, binary, medical, natural, labeling, randomnoiseThe Mall dataset was collected from a publicly accessible webcam for crowd counting and profiling research. Ground truth: Over 60,000 pedestrians wer...
video, pedestrian, crowd, counting, tracking, detection, indoor, webcamMany different labeled video datasets have been collected over the past few years, but it is hard to compare them at a glance. So we have created a hand...
video, object, benchmark, classification, recognition, detection, actionThe CVC Partial Occlusion Virtual Pedestrian datasets (CVC-01 to CVC-06) cover a range of scenarios of occluded pedestrians generated in a virtual and r...
urban, pedestrian, classification, synthetic, occlusion, tracking, detectionThe PETS 2016 IPATCH dataset contains a set of fourteen multi camera recordings (visible, themal) collected off the coast of Brest, France, in collabora...
visible, thermal, multimodal, vessel, maritime, boat, gps, tracking, detection, radarDataset contains 1000 images of 100 persons, with 10 images per person and is freely available. All images were acquired by cropping ears from images fr...
person, pedestrian, ear, recognition, human, lighting, biometryThe set was recorded in Zurich, using a pair of cameras mounted on a mobile platform. It contains 12'298 annotated pedestrians in roughly 2'000 frames.
trackingThe Berkeley Video Segmentation Dataset (BVSD) contains videos for segmentation (boundary?) Dataset train Dataset test
video, segmentation, benchmarkTo encourage the open comparison of single image shadow removal in community, we provide an online benchmark site and a dataset. Our quantitatively veri...
illumination, shadow, benchmark, singleview, removalDatabase contains 798 images of 114 persons, with 7 images per person and is freely available for research purposes. All images were taken in supervised...
face, person, human, lighting, recognition, illumination, pedestrian, biometryWe introduce the Shelf dataset for multiple human pose estimation from multiple views. In addition we annotate the body joints in the Campus dataset fro...
motion, multiple, 3d, estimation, capture, pose, human, viewThis dataset consists of more than 22,000 images of 24 people which are captured by 16 cameras installed in a shopping mall "Shinpuh-kan". All images ar...
trackingThis ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1...
benchmark, paris, reconstruction, pointcloud, outdoor, 3d, source, architecture, semantic, code, urban, mesh, recognition, segmentation, classificationThe Extreme Classification Repository: Multi-label Datasets & Code Kush Bhatia Himanshu Jain Prateek Jain Manik Varma The objective in extreme mu...
multilabel, machine, learning, benchmark, evaluation, classificationISPRS Test Project on Urban Classification, 3D Building Reconstruction and Semantic Labeling. In this part of our working group site you will get furthe...
urban, benchmark, recognition, aerial, canada, segmentation, photogrammetry, germany, 3d, multiview, city, semanticThe Robotic 3D Scan Repository from Osnabrueck contains 23 different datasets showing a veriaty of 3D scans for objects, humans, cities, university camp...
lidar, scan, urban, reconstruction, human, laser, heat, aerial, germany, 3d, bremen, city, osnabrueckThe test sequences provide interested researchers a real-world multi-view test data set captured in the blue-c portals. The data is meant to be used for...
tracking, segmentation, camera, action, multiviewISPRS / EuroSDR Benchmark for Multi-Platform Photogrammetry In these pages you can get information about the BENCHMARK FOR MULTI-PLATFORM PHOTOGRAMMET...
urban, benchmark, reconstruction, aerial, photogrammetry, germany, 3d, multiview, switzerland, cityThe Malaya Abrupt Motion (MAMo) dataset is targeted for visual tracking, particularly for abrupt motion tracking. It was collected from publicly accessi...
abrupt motion tracking, tracking, visual trackingThe PIROPO database (People in Indoor ROoms with Perspective and Omnidirectional cameras) comprises multiple sequences recorded in two different indoor ...
perspective, human, indoor, room, surveillance, detection, fisheye, omnidirectional, peopleA large dataset of geotagged face images collected from Flickr. The zip file contains text files containing urls of the images. Face2GPS: Estimating G...
gender, face, geotagged, classification, age, localization, humanThe Prague Texture Segmentation Datagenerator and Benchmark is designed to mutually compare and rank different (dynamic/static) texture segmenters (supe...
benchmark, texture segmentation, texture classification, syntheticThe object is a plaster reproduction of Temple of the Dioskouroi in Agrigento, Sicily. Click on thumbnail for a full-sized (640x480) image. Resolution o...
3d reconstruction, 3d, benchmark, sfm, multiviewThe INRIA People dataset from Navneet Dalal and Bill Triggs [DalalCVPR2005] consists of training and testing data. The training contains 1805 images and...
pedestrian, sideview, boundingbox, frontview, object detection, humanThe dataset consist of the about 50 hours obtained from kindergarten surveillance videos. Dataset, totally approximately 100 videos sequences (1000GB, 5...
segmentation, action, behavior, video surveillance, human, backgroundThe 3D shape description dataset consists of multiple sub-datasets Descriptor Matching - Dataset 1 & 2 (Stanford) These datasets, created from some o...
description, 3d, benchmark, registration, reconstruction, shape, matchingThe VOT2016 pixel-wise annotations dataset contains pixel-wise per-frame annotations for sequences from VOT2016 dataset. The annotation is in a form of ...
object, segmentation, annotation, mask, visual, trackingHallway Corridor - Multiple Camera Tracking: An indoor camera network dataset with 6 cameras (contains ground plane homography).
trackingThe ICG Multi-Camera datasets consist of Easy Data Set (just one person) Medium Data Set (3-5 persons, used for the experiments) Hard Data Set (cro...
graz, indoor, video, object, pedestrian, multiview, tracking, camera, multitarget, detection, calibrationCollected in a clothing store. Captured with Kinect (640*480, about 30fps)
tracking, detectionThe Lane Level Localization dataset was collected on a highway in San Francisco with the following properties: * Reasonable traffic * Multiple lane h...
driving, benchmark, autonomous, video, road, gps, map, 3d, localization, carThe FaceScrub dataset comprises a total of 107818 unconstrained face images of 530 celebrities crawled from the Internet, with about 200 images per pers...
face, celebrity, detection, people, recognition, humanThe BEOID dataset includes object interactions ranging from preparing a coffee to operating a weight lifting machine and opening a door. The dataset is ...
video, object, egocentric, 3d, interaction, pose, trackingFine-Grained Visual Classification of Aircraft (FGVC-Aircraft) is a benchmark dataset for the fine grained visual categorization of aircraft. Data, an...
benchmark, evaluation, fine-grained, classification, aircraft, airplane, recognitionThe tracking environment consists of multiple 3D range sensors, covering an area of about 900 m2, in the "ATC" shopping center in Osaka, Japan.
trackingThe SPHERE human skeleton movements dataset was created using a Kinect camera, that measures distances and provides a depth map of the scene instead of ...
motion, skeleton, kinect, movement, depth, human, action, video, behaviorThe QMUL Junction dataset is a busy traffic scenario for research on activity analysis and behavior understanding. Video length: 1 hour (90000 frame...
video, motion, pedestrian, crowd, counting, tracking, detection, behavior