The All I Have Seen (AIHS) dataset is created to study the properties of total visual input in humans, for around two weeks Nebojsa Jojic wore a camera capturing, on average, an image per every 20 seconds of his waking hours. The resulting new dataset contains a mix of indoor and outdoor scenes as well as numerous foreground objects. The creators first analysis goal is to create a visual summary of the subjects two weeks of life using unsupervised algorithms that would automatically discover recurrent scenes, familiar faces or common actions. Direct application of existing algorithms, such as panoramic stitching (e.g. Photosynth) or appearance-based clustering models (e.g. the epitome), is impractical due to either the large dataset size or the dramatic variation in the lighting conditions. The authors dubbed this type of data "All I have Seen" (AIHS, meant to be pronounced similar to "eyes"). While these types of datasets have been assembled before, it is our belief that with the proliferation of mobile devices and the availability of cloud computing, the time is now more appropriate than ever for research into this type of data acquisition, unsupervised techniques for data analysis and applications on top of them. Structural epitome: a way to summarize ones visual experience Nebojsa Jojic and Alessandro Perina and Vittorio Murino NIPS 2010
The VSUMM (Video SUMMarization) dataset is of 50 videos from Open Video. All videos are in MPEG-1 format (30 fps, 352 x 240 pixels), in color and with s...
similarity, type, summary, user, video, static, keyframe, studyThe multi-modal/multi-view datasets are created in a cooperation between University of Surrey and Double Negative within the EU FP7 IMPART project. Th...
rgbd, color, dynamic, multi-view, action, outdoor, video, 3d, face, emotion, lidar, human, indoor, multi-mode, modelSceneNet RGB-D is dataset comprised of 5 million Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth. It expands the previous work ...
trajectory, reconstruction, scene, slam, lighting, indoor, segmentation, robot, rendering, 3d, synthetic, navigationThe SUNCG dataset is a Large 3D Model Repository for Indoor Scenes. SUNCG is an ongoing effort to establish a richly-annotated, large-scale dataset ...
scene, layout, recognition, indoor, object, segmentation, rendering, 3d, realism, room, syntheticScanNet is an RGB-D video dataset containing 2.5 million views in more than 1500 scans, annotated with 3D camera poses, surface reconstructions, and ins...
scene, layout, recognition, indoor, object, cad, segmentation, rendering, 3d, realism, room, syntheticThe Video Summarization (SumMe) dataset consists of 25 videos, each annotated with at least 15 human summaries (390 in total). The data consists of vide...
video, benchmark, summary, event, human, groundtruth, actionThe UMD Dynamic Scene Recognition dataset consists of 13 classes and 10 videos per class and is used to classify dynamic scenes. The dataset has been ...
video, motion, dynamic, classification, scene, recognitionThe BEOID dataset includes object interactions ranging from preparing a coffee to operating a weight lifting machine and opening a door. The dataset is ...
video, object, egocentric, 3d, interaction, pose, trackingThe GaTech VideoContext dataset consists of over 100 groundtruth annotated outdoor videos with over 20000 frames for the task of geometric context eval...
urban, nature, outdoor, video, segmentation, supervised, classification, context, unsupervised, geometry, semanticThe Airport MotionSeg dataset contains 12 sequences of videos of an aiprort scenario with small and large moving objects and various speeds. It is chall...
video, segmentation, motion, airport, clustering, camera, zoomThe MOT Challenge is a framework for the fair evaluation of multiple people tracking algorithms. In this framework we provide: - A large collection of...
multiple, benchmark, evaluation, benhttp://motchallenge.net/chmark, dataset, target, video, pedestrian, 3d, tracking, surveillance, peopleThe High Definition Analytics (HDA) dataset is a multi-camera High-Resolution image sequence dataset for research on High-Definition surveillance: Pedes...
high-definition, benchmark, human, lisbon, indoor, video, re-identification, pedestrian, network, multiview, tracking, surveillance, camera, detectionFor the first few decades of the fields existence, computer vision has been focused on algorithmic, logical approaches to perception. But it was only wi...
object, 3d, kinect, reconstruction, depth, recognition, indoorYahoo Flickr Creative Commons 100M (YFCC100M) dataset contains a list of photos and videos. This list is compiled from data available on Yahoo! Flickr. ...
internet, reconstruction, recognition, image, community, social, 3d, clustering, detection, flickr, landmarkThe Video2GIF dataset contains over 100,000 pairs of GIFs and their source videos. The GIFs were collected from two popular GIF websites (makeagif.com, ...
gif, scene, summarization, summary, video highlight detection, understandingThe Lane Level Localization dataset was collected on a highway in San Francisco with the following properties: * Reasonable traffic * Multiple lane h...
driving, benchmark, autonomous, video, road, gps, map, 3d, localization, carThe ICG Multi-Camera and Virtual PTZ dataset contains the video streams and calibrations of several static Axis P1347 cameras and one panoramic video fr...
graz, outdoor, video, object, panorama, pedestrian, network, crowd, multiview, tracking, camera, multitarget, detection, calibrationThe crowd datasets are collected from a variety of sources, such as UCF and data-driven crowd datasets. The sequences are diverse, representing dense cr...
video, pedestrian, scene, crowd, human, understanding, anomaly, detectionThe MSR Action datasets is a collection of various 3D datasets for action recognition. See details http://research.microsoft.com/en-us/um/people/zliu...
video, detection, 3d, action, reconstruction, recognitionThe TVPR dataset includes 23 registration sessions. Each of the 23 folders contains the video of one registration session. Acquisitions have been perfor...
person, depth, recognition, indoor, top-view, video, clothing, gender, reidentification, identification, peopleThis ETHZ CVL RueMonge 2014 dataset used for 3D reconstruction and semantic mesh labelling for urban scene understanding. It was first published in [1...
benchmark, paris, reconstruction, pointcloud, outdoor, 3d, source, architecture, semantic, code, urban, mesh, recognition, segmentation, classificationThe Make3D Depth dataset s designed to learn features to estimate scene depth from a single image. This dataset contains aligned image and range data:...
single view, learning, indoor, outdoor, depth estimationAn indoor action recognition dataset which consists of 18 classes performed by 20 individuals. Each action is individually performed for 8 times (4 dayt...
video, open-view, cross-view, recognition, indoor, action, multi-cameraThis dataset package contains the software and data used for Detection-based Object Labeling on the RGB-D Scenes Dataset as implemented in the paper: ...
object, 3d, kinect, reconstruction, depth, recognition, indoorThe Mall dataset was collected from a publicly accessible webcam for crowd counting and profiling research. Ground truth: Over 60,000 pedestrians wer...
video, pedestrian, crowd, counting, tracking, detection, indoor, webcamThe Yotta dataset consists of 70 images for semantic labeling given in 11 classes. It also contains multiple videos and camera matrices for 14km or driv...
urban, reconstruction, video, segmentation, 3d, classification, camera, semanticThe 2D-3D-S dataset provides a variety of mutually registered modalities from 2D, 2.5D and 3D domains, with instance-level semantic and geometric annota...
reconstruction, depth, large-scale, indoor, normal, building, panorama, segmentation, 3d, semanticAbstract Scene understanding has (again) become a focus of computer vision research, leveraging advances in detection, context modeling, and tracking. ...
scene, segmentation, pedestrian, 3d, classification, understanding, car, semanticThe ICG Multi-Camera datasets consist of Easy Data Set (just one person) Medium Data Set (3-5 persons, used for the experiments) Hard Data Set (cro...
graz, indoor, video, object, pedestrian, multiview, tracking, camera, multitarget, detection, calibrationThe Leeds Cows dataset by Derek Magee consists of 14 different video sequences showing a total of 18 cows walking from right to left in front of differe...
video, segmentation, detection, cow, animal, backgroundThe CALTECH 256 dataset by Li Fei-Fei contains 30607 images for 256 categories.
object, detection, image, centered, classification, sceneThe Pittsburgh Fast-food Image dataset (PFID) consists of 4545 still images, 606 stereo pairs, 3033600 videos for structure from motion, and 27 privacy-...
video, laboratory, classification, reconstruction, real, food, recognitionThe object is a plaster dinosaur (stegosaurus). Click on thumbnail for a full-sized (640x480) image. Resolution of ground truth model: 0.00025m (you may...
3d reconstruction, 3d, benchmark, sfm, multiviewISPRS Test Project on Urban Classification and 3D Building Reconstruction The ISPRS working group III/4 announces the release of the 2D semantic label...
urban, reconstruction, recognition, building, 3d, classification, city, semanticThe Daimler Mono Pedestrian Classification Benchmark dataset consists of two parts: a base data set. The base data set contains a total of 4000 pedest...
illumination, object, urban, pedestrian, classification, outdoor, scaleThis dataset comprises information regarding the ADLs performed by two users on a daily basis in their own homes. This dataset is composed by two insta...
classification, clustering, sequential, time-series, multivariateJPL First-Person Interaction dataset (JPL-Interaction dataset) is composed of human activity videos taken from a first-person viewpoint. The dataset par...
video, motion, action, interactive, recognition, humanThe MSR RGB-D Dataset 7-Scenes dataset is a collection of tracked RGB-D camera frames. The dataset may be used for evaluation of methods for different a...
video, kinect, location, reconstruction, depth, trackingDocuments are first obtained via a Web search using AMIEI: an integrated platform for delivering enterprise intelligence, developed by AMI Software ([We...
multivariate, text, clustering, sequentialThe YouTube-Objects dataset is composed of videos collected from YouTube by querying for the names of 10 object classes. It contains between 9 and 24 vi...
video, object, flow, segmentation, detection, opticalThis is a sparse data set, less than 10% of the attributes are used for each sample. The link is to a '*.tgz' file which contains two files: [amzn-anon-...
clustering, causal-discovery, regression, time-series, domain-theoryOngoing research on university faculty perceptions and practices of using Wikipedia as a teaching resource. Based on a Technology Acceptance Model, the ...
regression, multivariate, clustering, causal-discoveryThe KTH Multiview Football dataset contains 771 images of football players includes images taken from 3 views at 257 time instances 14 annotated body jo...
recognition, soccer, outdoor, object, pedestrian, game, pose, multiview, tracking, camera, multitarget, detectionCSV format where each row is a paper and each column is an attribute.
multivariate, clusteringThe HCI 4D Lightfields dataset contains 11 objects with corresponding lightfields for depth estimation. Datasets can be downloaded individually below....
3d, benchmark, evaluation, reconstruction, depth, 4d, lightfieldThe data set gathered when we were working at project for Bahrain university between 2002 and 2003.
domain-theory, univariate, classification, clusteringThe SegTrack dataset consists of six videos (five are used) with ground truth pixelwise segmentation (6th penguin is not usable). The dataset is used fo...
motion, video, object, proposal, flow, segmentation, stationary, model, camera, optical, groundtruthThis dataset consist 51 oral presentation recorded with 2 ambient visual sensor (web-cam), 3 First Person View (FPV) cameras (1 on presenter and 2 on ra...
video, quality, kinect, multi-sensor, presentation, analysisThe Freiburg-Berkeley Motion Segmentation Dataset (FBMS-59) is an extension of the BMS dataset with 33 additional video sequences. A total of 720 frames...
motion, benchmark, video, object, pedestrian, segmentation, tracking, groundtruthThe VidPairs dataset contains 133 pairs of images, taken from 1080p HD (~2 megapixel) official movie trailers. Each pair consists of images of the same ...
matching, dense, video, flow, description, patch, pair, opticalThe CALTECH 101 dataset by Li Fei-Fei contains images for 101 categories with about 40 to 800 images per category. Most categories have about 50 images ...
object, natural-image, centered, scene, image classificationThe Rent3D dataset comprises floorplans and images. The goal of this work is to enable a 3D virtual-tour of an apartment given a small set of monocular ...
building, urban, reconstruction, floorplan, layout, apartment, indoorThe GaTech VideoStab dataset consists of N videos for the task of video stabilization. This code is implemented in Youtube video editor for stabilizatio...
video, camera, path, stabilizationThe data set consists of the expression levels of 77 proteins/protein modifications that produced detectable signals in the nuclear fraction of cortex. ...
multivariate, classification, clusteringThe PETS 2006 dataset contains 7 parts showing multi-sensor sequences containing left-luggage scenarios with increasing scene complexity at a train stat...
pedestrian, indoor, frontview, object tracking, object detection, multitargetThe Berkeley Video Segmentation Dataset (BVSD) contains videos for segmentation (boundary?) Dataset train Dataset test
video, segmentation, benchmarkThe Fish4Knowledge project (groups.inf.ed.ac.uk/f4k/) is pleased to announce the availability of 2 subsets of our tropical coral reef fish video and e...
motion, nature, recognition, fish, video, water, classification, animal, cameraThe dataset represents 10 years (1999-2008) of clinical care at 130 US hospitals and integrated delivery networks. It includes over 50 features represen...
multivariate, classification, clusteringThe MSRC v1 dataset from Microsoft Research in Cambridge contains 240 images and 9 object classes with coarse pixel-wise labeled images. The dataset is...
semantic segmentation, semantic, outdoorCollect the real time readings for residential,commercial,industrial,agriculure,to find the accuracy consumption in Tamil Nadu Around Thanajvur
regression, multivariate, classification, clusteringSamples (instances) are stored row-wise. Variables (attributes) of each sample are RNA-Seq gene expression levels measured by illumina HiSeq platform.
multivariate, classification, clusteringThe Symmetry Facades dataset contains 9 building facades with multiple images. It used for coupled symmetry and structure from motion detection. Coup...
urban, reconstruction, facade, building, 3d, repetition, symmetry, sfmThis dataset was constructed by adding elevation information to a 2D road network in North Jutland, Denmark (covering a region of 185 x 135 km^2). Eleva...
regression, text, clustering, sequentialThese are atlantic-mediterranean marine sponges that belong to O.Hadromerida (Demospongiae.Porifera).
multivariate, clusteringSome datasets and evaluation tools are provided on this page for four different computer vision and computer graphics problems. Population counting L...
urban, surface, reconstruction, pointcloud, object, road, pedestrian, network, line, 3d, crowd, counting, detection, groundtruthThis dataset was used in several classifications tasks related to the challenge of anuran species recognition through their calls. It is a multilabel da...
multivariate, classification, clusteringWe introduce the Shelf dataset for multiple human pose estimation from multiple views. In addition we annotate the body joints in the Campus dataset fro...
motion, multiple, 3d, estimation, capture, pose, human, viewBackground information: The data set concerns the earliest history of mankind. Prehistoric men created the desired shape of a stone tool by striking on ...
multivariate, classification, clustering, causal-discoveryThe multiple foreground video co-segmentation dataset, consisting of four sets, each with a video pair and two foreground objects in common. The datase...
video, segmentation, co-segmentationThe dataset (movement_libras) contains 15 classes of 24 instances each, where each class references to a hand movement type in LIBRAS. In the video pre...
multivariate, classification, clustering, sequentialIn predicting stock prices you collect data over some period of time - day, week, month, etc. But you cannot take advantage of data from a time period u...
time-series, classification, clusteringThis archive contains 2075259 measurements gathered between December 2006 and November 2010 (47 months). Notes: 1.(global_active_power*1000/60 - sub_me...
regression, time-series, multivariate, clusteringThe Daimler Urban Segmentation Dataset consists of video sequences recorded in urban traffic. The dataset consists of 5000 rectified stereo image pairs ...
segmentation, urban, motion, stereo, semantic, outdoorThe Farman Institute 3D Point Sets dataset contains 11 objects by a 3D laser scanner. This dataset was peer-reviewed by Image Processing On Line: Farman...
object, scanner, 3d, reconstruction, point, model, laserThe York Urban Line Segment Database is a compilation of 102 images (45 indoor, 57 outdoor) of urban environments consisting mostly of scenes from the c...
vanishing point, urban, reconstruction, outdoor, pose estimation, manhattan, geometryThis site is dedicated to provide datasets for the Robotics community with the aim to facilitate result evaluations and comparisons. The datasets presen...
urban, laser, 3d, city, natureThe 3DVis dataset includes a set of 12 heterogeneous scenes for testing 3D scene registration and analysis methods. Models include homogeneous shapes, r...
3d, registration, reconstruction, shape, matching, symmetryThe Robotic 3D Scan Repository from Osnabrueck contains 23 different datasets showing a veriaty of 3D scans for objects, humans, cities, university camp...
lidar, scan, urban, reconstruction, human, laser, heat, aerial, germany, 3d, bremen, city, osnabrueckThe data is in the transactional form. It contains the Latin names (species or genus) and state abbreviations.
multivariate, clusteringThe dataset consists of a total of 3600 documents including 600 news/texts from six categories economy, culture-arts, health, politics, sports and tech...
text, classification, clusteringISPRS / EuroSDR Benchmark for Multi-Platform Photogrammetry In these pages you can get information about the BENCHMARK FOR MULTI-PLATFORM PHOTOGRAMMET...
urban, benchmark, reconstruction, aerial, photogrammetry, germany, 3d, multiview, switzerland, cityThe PIROPO database (People in Indoor ROoms with Perspective and Omnidirectional cameras) comprises multiple sequences recorded in two different indoor ...
perspective, human, indoor, room, surveillance, detection, fisheye, omnidirectional, peopleThe GaTech VideoSeg dataset consists of two (waterski and yunakim?) video sequences for object segmentation. There exists no groundtruth segmentation ...
video, object, segmentation, motion, model, cameraThis dataset provides a collection of web images and 3D models for research on landmark recognition (especially for methods based on 3D models). We hope...
codebook, reconstruction, matching, recognition, retrieval, 3d, classification, feature, flickr, landmarkWe present a new large-scale dataset that contains a diverse set of stereo video sequences recorded in street scenes from 50 different cities, with high...
urban, stereo, cities, person, video, weakly, segmentation, pedestrian, detection, car, semanticThe Notre Dame de Paris dataset used for 3D SfM reconstruction and contains 715 images provided by Noah Snavely. There are also version for NotreDame...
paris, pointcloud, frontview, limited, 3d reconstruction, 3d, flickr, landmark, sfmPlease see the README for the details on the data organization, and so on.
multivariate, text, classification, clusteringThe database of nude and non-nude videos contains a collection of 179 video segments collected from the following movies: Alpha Dog, Basic Instinct, Bef...
video, nude detection, movieThe object is a plaster reproduction of Temple of the Dioskouroi in Agrigento, Sicily. Click on thumbnail for a full-sized (640x480) image. Resolution o...
3d reconstruction, 3d, benchmark, sfm, multiviewPlaces205 dataase contains 2.5 million images from 205 scene categories for the academic public. The image dataset contains 2,448,873 images from 205 ...
urban, learning, scene, feature, place, recognitionThe Eurasian Cities dataset contains 103 images of outdoor urban scenes taken in Eurasian cities. It is annotated with horizontal and vertical vanishing...
urban, vanishing, reconstruction, manhattan, outdoor, line, pose, point, geometryBrief Description of the Dataset: --------------------------------- Each of the 19 activities is performed by eight subjects (4 female, 4 male, between ...
time-series, multivariate, classification, clusteringThe New College Data Set contains 30GB of data intended for use by the mobile robotics and vision research communities. Our anticipated users are partie...
urban, stereo, reconstruction, path, panorama, 3d, odometry, navigationThe Interactive Segmentation (IcgBench) dataset from Jakob Santner contains 243 images and 262 segmentation. Some images have multiple segmentations. Th...
interactive segmentation, userThis data set contains 13,910 measurements from 16 chemical sensors exposed to 6 gases at different concentration levels. This dataset is an extension o...
causa, classification, clustering, regression, time-series, multivariateThe 3D shape description dataset consists of multiple sub-datasets Descriptor Matching - Dataset 1 & 2 (Stanford) These datasets, created from some o...
description, 3d, benchmark, registration, reconstruction, shape, matchingHollywood-2 datset contains 12 classes of human actions and 10 classes of scenes distributed over 3669 video clips and approximately 20.1 hours of video...
video, segmentation, action classificationIndoor localisation is a key topic for the Ambient Intelligence (AmI) research community. In this scenarios, recent advancements in wearable technolog...
classification, clustering, sequential, regression, time-series, multivariateThe TUG (Timed Up and Go test) dataset consists of actions performed three times by 20 volunteers. The people involved in the test are aged between 22 a...
wearable, kinect, time, human, recognition, action, depth image processing - tug, accelerometer, videoThe city planar and non-planar datset consists of urban scenes accompanied by text files describing the plane/non-plane locations. Training Set (Univ...
building, urban, detection, 3d, estimation, planeData set has no missing values. Values are in kW of each 15 min. To convert values in kWh values must be divided by 4. Each column represent one client....
regression, time-series, clusteringAutomatic identification of commercial blocks in news videos finds a lot of applications in the domain of television broadcast analysis and monitoring. ...
multivariate, classification, clusteringThe Weather and Illumination Database (WILD) is an extensive database of high quality images of an outdoor urban scene, acquired every hour over all sea...
urban, estimation, depth, weather, time, newyork, webcam, video, illumination, change, static, camera, lightGaze data on video stimuli for computer vision and visual analytics. Converted 318 video sequences from several different gaze tracking data sets with...
video, metadata, segmentation, gaze data, polygon annotationThe dataset contains 15 documentary films that are downloaded from YouTube, whose durations vary from 9 minutes to as long as 50 minutes, and the total ...
video, object, detectionThe Salient Montages is a human-centric video summarization dataset from the paper [1]. In [1], we present a novel method to generate salient montages...
video, saliency, wearable, montage, summarization, humanThe Longterm Pedestrian dataset consists of images from a stationary camera running 24 hours for 7 days at about 1 fps. It used for adaptive detection ...
coffee, graz, background, indoor, illumination, change, pedestrian, robust, multitarget, detectionThe examined group comprised kernels belonging to three different varieties of wheat: Kama, Rosa and Canadian, 70 elements each, randomly selected for t...
multivariate, classification, clusteringVoxel Based Dataset for Systematic 3D reconstruction by artificial neural networks (ANNs). A synthetic scalable cube dataset for training, testing and...
deep learning, synthetic city urban, 3d, sfm, reconstructionISPRS and EuroSDR - Benchmark on High Density Aerial Image Matching Background and Scope of the project Innovations in matching algorithms as well a...
urban, benchmark, reconstruction, aerial, photogrammetry, germany, 3d, multiview, switzerland, city-- The users' knowledge class were classified by the authors using intuitive knowledge classifier (a hybrid ML technique of k-NN and meta-heurist...
multivariate, classification, clusteringKEGG Metabolic pathways can be realized into network. Two kinds of network / graph can be formed. These include Reaction Network and Relation Network. I...
univariate, classification, clustering, regression, multivariate, textPlease find the original data at '[Web Link]'
time-series, multivariate, classification, clusteringThe xawAR16 dataset is a multi-RGBD camera dataset, generated inside an operating room (IHU Strasbourg), which was designed to evaluate tracking/relocal...
video, medicine, table, depth, operation, recognition, surgeryThe Pornography database contains nearly 80 hours of 400 pornographic and 400 non-pornographic videos. For the pornographic class, we have browsed websi...
video, pornography, video shots, video framesThe Stable Structure from Motion datasets due to size limitations cannot put the images online. Instead here are the tracked image points and the final ...
church, stability, 3d reconstruction, 3d, robust, geometry, landmark, sfmThe dataset collects data from an Android smartphone positioned in the chest pocket. Accelerometer Data are collected from 22 participants walking in th...
univariate, classification, clustering, sequential, time-seriesShakeFive2 A collection of 8 dyadic human interactions with accompanying skeleton metadata. The metadata is frame based xml data containing the skelet...
video, human, kinect, interactionThe characters here were used for a PhD study on primitive extraction using HMM based models. The data consists of 2858 character samples, contained in ...
time-series, classification, clusteringThis dataset contains 7 challenging volleyball activity classes annotated in 6 videos from professionals in the Austrian Volley League (season 2011/12)....
video, sport, analysis, activity recognition, volleyball, detection, actionThe 1DSfM Landmarks is a collection of community-based image reconstruction by Kyle Wilson and is comprised of 14 datasets with comparison to bundler gr...
urban, 3d, benchmark, city, reconstruction, landmark, groundtruthThis web page presents visual-inertial datasets collected on-board a Micro Aerial Vehicle (MAV). The datasets contain stereo images, synchronized IMU me...
slam, global shutter, indoor, aerial vehiclesA 66 stereo pairs dataset with their subpixel ground truths. The construction and improvement of algorithms for subpixel stereovision requires very pr...
stereo, depth, pointcloud, noise, stereovision, 3d, groundtruth, subpixelZurich Hoengg (Switzerland) is an aerial dataset. The dataset consists of 4 aerial images in colour (Figures 2-5), scanned with 14 microns, the forma...
semantic segmentation, aerial, outdoorThe Dataset for ADL Recognition with Wrist-worn Accelerometer is a public collection of labelled accelerometer data recordings to be used for the creati...
time-series, multivariate, classification, clusteringThe dataset is in the form of a 11463 x 5812 matrix of word counts, containing 11463 words and 5811 NIPS conference papers (the first column contains th...
text, clusteringThe dataset is composed by features extracted from 7 videos with people gesticulating, aiming at studying Gesture Phase Segmentation. Each video is repr...
classification, clustering, sequential, time-series, multivariate* Audio track (encoded as mp3) of each of the 106,574 tracks. It is on average 10 millions samples per track.* Nine audio features (consisting of 518 at...
time-series, multivariate, classification, clusteringThe automotive multi-sensor (AMUSE) dataset consists of inertial and other complementary sensor data combined with monocular, omnidirectional, high fram...
urban, api, image, video, inertial, streetside, traffic, cityThe SPHERE human skeleton movements dataset was created using a Kinect camera, that measures distances and provides a depth map of the scene instead of ...
motion, skeleton, kinect, movement, depth, human, action, video, behaviorThe PD and control handwriting database consists of 62 PWP (People with parkinson) and 15 healthy individuals who appealed at the Department of Neurolog...
regression, multivariate, classification, clusteringThis dataset comes from the daily measures of sensors in a urban waste water treatment plant. The objective is to classify the operational state of the ...
multivariate, clusteringThe automated analysis of facial expressions has been widely used in different research areas, such as biometrics or emotional analysis. Special impo...
multivariate, classification, clustering, sequentialThe Video Segmentation Benchmark (VSB100) provides ground truth annotations for the Berkeley Video Dataset, which consists of 100 HD quality videos divi...
video, object, segmentation, motion, pedestrian, benchmark, tracking, groundtruthScene Parsing Benchmark Scene parsing data and part segmentation data derived from ADE20K dataset could be download from MIT Scene Parsing Benchmark. ...
segmentation, annotation, benchmark, semantic, scene, recognitionThe Daimler Mono Pedestrian Detection Benchmark dataset contains a large training and test set. The training set contains 15.560 pedestrian samples (ima...
object, mono, urban, pedestrian, outdoor, scale, detectionThe HandNet dataset contains depth images of 10 participants hands non-rigidly deforming infront of a RealSense RGB-D camera. This dataset includes 2...
rgbd, hand, articulation, video, segmentation, classification, pose, fingertip, detectionStyle, Price, Rating, Size, Season, NeckLine, SleeveLength, waiseline, Material, FabricType, Decoration, Pattern, Type, Recommendation are Attributes in...
text, classification, clusteringThis is a transnational data set which contains all the transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered non-store...
classification, clustering, sequential, time-series, multivariateThe UCF Person and Car VideoSeg dataset consists of six videos with groundtruth for video object segmentation. Surfing, jumping, skiing, sliding, big ...
video, object, segmentation, motion, model, camera, groundtruthThe dataset captures 25 people preparing 2 mixed salads each and contains over 4h of annotated accelerometer and RGB-D video data. Annotated activities ...
video, activity, classification, tracking, recognition, detection, actionOpen University Learning Analytics Dataset (OULAD) contains data about courses, students and their interactions with Virtual Learning Environment (VLE) ...
classification, clustering, sequential, regression, time-series, multivariateThe Cholec80 dataset contains 80 videos of cholecystectomy surgeries performed by 13 surgeons. The videos are captured at 25 fps. The dataset is labeled...
video, medicine, surgery, phase, tool, recognitionThe Traffic Video dataset consists of X video of an overhead camera showing a street crossing with multiple traffic scenarios. The dataset can be down...
video, urban, traffic, road, overhead, tracking, view, detection- The leaves were placed on a white background and then photographed. - The pictures were taken in broad daylight to ensure optimum light intensity.
multivariate, classification, clusteringA dataset acquired with 3 synchronized sensors (Primesense Carmine 1.09, Microsoft Kinect v2, Canon IXUS 950 IS), featuring: * 30 industry-relevant ob...
object, rgbd, 3d, estimation, pose, texture-lessThis dataset contains 600 examples of control charts synthetically generated by the process in Alcock and Manolopoulos (1999). There are six different c...
time-series, classification, clusteringHTRU2 is a data set which describes a sample of pulsar candidates collected during the High Time Resolution Universe Survey (South) [1]. Pulsars are a...
multivariate, classification, clusteringWe present a dataset to address the problem of visual privacy - where users unintentionally leak private information when sharing personal images online...
multilabel, privacy, classification, flickr, scene, regressionThe Heterogeneity Dataset for Human Activity Recognition from Smartphone and Smartwatch sensors consists of two datasets devised to investigate sensor h...
time-series, multivariate, classification, clusteringIt is composed of ADL (activity daily living) and fall actions simulated by 11 volunteers. The people involved in the test are aged between 22 and 39, w...
wearable, kinect, fall detection - adl, depth, human, recognition, action, accelerometer, videoThe current video database containing six types of human actions (walking, jogging, running, boxing, hand waving and hand clapping) performed several ti...
video, segmentation, action classificationThe Weizmann actions dataset by Blank, Gorelick, Shechtman, Irani, and Basri consists of ten different types of actions: bending, jumping jack, jumping,...
video, segmentation, action, action classificationThe Ford Car dataset is joint effort of Pandey et al. (for collecting images, Lidar points, calibration etc.) and us (for annotation of 2D and 3D object...
lidar, detection, groundtruth, 3d, car, sfmThe measurements were created to ease the development, comparison and evaluation of fingerprinting based hybrid indoor positioning methods. The measurem...
text, classification, clustering, causal-discoveryThe Babenko tracking dataset contains 12 video sequences for single object tracking. For each clip they provide (1) a directory with the original i...
face, video, single, occlusion, object tracking, animalThe Leuven Stereo Scene dataset is a scene and depth dataset. There exist two variants of this dataset - a CVPR 2007 paper [1] by Leibe et al. for detec...
urban, stereo, depth, reconstruction, leuven, segmentation, 3d, semantic, sfmThe PETS 2009 dataset contains 3 parts showing multi-view sequences containing pedestrians walking in an outdoor environment. The parts are used for per...
overlap, human, frontview, occlusion multitarget, outdoor, pedestrian, tracking, detectionEstimate robust and reliable depth or motion fields on our challenging real world videos!
optical flow, stereo depth, outdoorThe goal of LabelMe is to provide an online annotation tool to build image databases for computer vision research. You can contribute to the database by...
urban, semantic segmentation, software, semantic, outdoor, object detectionIndoor localization is a key topic for mobile computing. However, it is still very difficult for the mobile sensing community to compare state-of-art In...
classification, clustering, sequential, regression, time-series, multivariateFor complete information see the official challenge page: [Web Link]
clustering, sequential, causal-discovery, time-series, multivariate, domain-theory52 columns for 52 weeks; normalised values of provided too.
time-series, multivariate, clusteringWelcome to the homepage of the gvvperfcapeva datasets. This site serves as a hub to access a wide range of datasets that have been created for projects ...
face, reconstruction, depth, mesh, human, action, video, pose, multiview, trackingParis-rue-Madame dataset contains 3D Mobile Laser Scanning (MLS) data from rue Madame, a street in the 6th Parisian district (France). The test zone con...
segmentation, 3d, semantic, classification, pointcloud, laserThe Landmark 1000 or 1k dataset is a collection of the top 1000 popular flickr landmarks mined from flickr. It is maintained by Noah Snavely and publi...
estimation, location, reconstruction, pointcloud, world, 3d, pose, landmarkThis is a subset of the dataset introduced in the SIGGRAPH Asia 2009 paper, Webcam Clip Art: Appearance and Illuminant Transfer from Time-lapse Sequence...
urban, nature, time, webcam, video, illumination, change, static, camera, lightCSV format where each row is a paper and each column an attribute.
multivariate, clusteringThese datasets were generated for the M2CAI challenges, a satellite event of MICCAI 2016 in Athens. Two datasets are available for two different challen...
video, medicine, workflow, surgery, recognition, challengeNews are grouped into clusters that represent pages discussing the same news story. The dataset includes also references to web pages that, at the acce...
multivariate, classification, clusteringThis corpus has been collected from free or free for research sources at the Internet: -> A collection of 425 SMS spam messages was manually extracted ...
text, classification, clustering, multivariate, domain-theoryWe collected a video dataset, termed ChokePoint, designed for experiments in person identification/verification under real-world surveillance conditions...
face, real, human, recognition, world, pedestrian, identification, clustering, multiview, surveillance, detection, sequenceThe dataset is the subset of RCV1. These corpus has already been used in author identification experiments. In the top 50 authors (with respect to total...
text, classification, clustering, multivariate, domain-theoryThe Where Who Why (WWW) dataset provides 10,000 videos with over 8 million frames from 8,257 diverse scenes, therefore offering a superior comprehensive...
recognition, video, flow, pedestrian, crowd, surveillance, optical, detectionMany different labeled video datasets have been collected over the past few years, but it is hard to compare them at a glance. So we have created a hand...
video, object, benchmark, classification, recognition, detection, actionKEGG Metabolic pathways can be realized into network. Two kinds of network / graph can be formed. These include Reaction Network and Relation Network. I...
univariate, classification, clustering, regression, multivariate, textThe experiments have been carried out with a group of 115 students of first-year, undergraduate Engineering major of the University of Genoa. We carri...
classification, clustering, sequential, regression, time-series, multivariateISPRS Test Project on Urban Classification, 3D Building Reconstruction and Semantic Labeling. In this part of our working group site you will get furthe...
urban, benchmark, recognition, aerial, canada, segmentation, photogrammetry, germany, 3d, multiview, city, semanticA Vicon motion capture camera system was used to record 12 users performing 5 hand postures with markers attached to a left-handed glove. A rigid patt...
multivariate, classification, clusteringFor each text collection, D is the number of documents, W is the number of words in the vocabulary, and N is the total number of words in the collection...
text, clusteringAiguille du Midi. France showing photographs with Camera: Mamiya ZD. 55mm. - Resolution: 5Mpixels, 53 images - Photographer: B. Vallet (Imagine/EVD - 20...
3d reconstruction, large scale, sfm, outdoor, meshA Vicon motion capture camera system was used to record 12 users performing 5 hand postures with markers attached to a left-handed glove. A rigid patt...
multivariate, classification, clusteringThe 3D Mask Attack Database (3DMAD) is a biometric (face) spoofing database. It currently contains 76500 frames of 17 persons, recorded using Kinect for...
face, emotion, segmentation, 3d, recognition, biometry, frontviewThe data was collected as part of the 1990 census. There are 68 categorical attributes. This data set was derived from the USCensus1990raw data set. T...
multivariate, clusteringThe MSRC v2 dataset is an extension of the MSRC v1 dataset from Microsoft Research in Cambridge. It contains 591 images and 23 object classes with accur...
semantic segmentation, semantic, outdoorThe Oxford RobotCar Dataset contains over 100 repetitions of a consistent route through Oxford, UK, captured over a period of over a year. The dataset c...
driving, street, urban, time, recognition, autonomous, video, segmentation, robot, classification, detection, car, yearThe DrivFace database contains images sequences of subjects while driving in real scenarios. It is composed of 606 samples of 640480 pixels each, acquir...
regression, multivariate, classification, clusteringThe QMUL Junction dataset is a busy traffic scenario for research on activity analysis and behavior understanding. Video length: 1 hour (90000 frame...
video, motion, pedestrian, crowd, counting, tracking, detection, behaviorThe Street View Text (SVT) dataset contains 647 words and 3796 letters in 249 images harvested from Google Street View. The dataset is more challengin...
urban, text recognition, text detection, classification, outdoorBackground Models Challenge (BMC) is a complete dataset and competition for the comparison of background subtraction algorithms. The main topics concer...
motion, background, video, modeling, segmentation, change, surveillance, detectionThis web page contains video data and ground truth for 16 dances with two different dance patterns. The style of dancing is inspired by Scottish Ceilidh...
motion, dance, analysis, background, action, video, chemistry, patternThe experiments have been carried out with a group of 30 volunteers within an age bracket of 19-48 years. Each person performed six activities (WALKING,...
time-series, multivariate, classification, clustering--- The dataset collects data from a wearable accelerometer mounted on the chest --- Sampling frequency of the accelerometer: 52 Hz --- Acceler...
univariate, classification, clustering, sequential, time-seriesThe CIFAR-10 dataset consists of 60000 32x32 colour images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test ima...
object, color, patch, scene, tiny, image classificationThe MSRC vNIPS dataset is the MSRC v2 dataset with new annotations for much more accurate segmentations for 93 images. Efficient Inference in Fully Co...
semantic segmentation, semantic, outdoorThe procedural texture perceptual similarity dataset contains a list of procedural textures along with their pairwise distances, as defined by a percept...
study, benchmark, procedural, textureThe domain-specific personal videos highlight dataset from the paper [1] describes a fully automatic method to train domain-specific highlight ranker f...
saliency, domain, wearable, human, recognition, action, video, summarizationThe video co-segmentation dataset contains 4 video sets which totally has 11 videos with 5 frames of each video labeled with the pixel-level ground-tr...
video, segmentation, co-segmentation, datasetUnlike the previous SHREC contests, the objective of this SHREC 2012 contest is to evaluate the performance of 3D-mesh segmentation techniques instead o...
mesh, segmentation, 3d, partThe UrbanStreet dataset used in the paper can be downloaded here [188M] . It contains 18 stereo sequences of pedestrians taken from a stereo rig mounted...
urban, human, recognition, video, pedestrian, segmentation, tracking, multitarget, detectionProvide all relevant information about your data set.
multivariate, classification, clusteringThe Multicamera Human Action Video Data (MuHAVi) Manually Annotated Silhouette Data (MAS) are two datasets consisting of selected action sequences for ...
video, segmentation, action, behavior, human, backgroundThe RGB-D Person Re-identification dataset is for person re-identification using depth information. The main motivation is that the standard techniques ...
pedestrian, 3d, identification, classification, depth, shapeThe Webcam Interestingness dataset consists of 20 different webcam streams, with 159 images each. It is annotated with interestingness ground truth, acq...
video, interest, retrieval, classification, weather, ranking, webcamThe CMP Facade dataset consists of facade images assembled at the Center for Machine Perception, which includes 600 rectified images of facades from var...
urban, similarity, facade, recognition, segmentation, structure, classification, rectification, semanticAt Udacity, we believe in democratizing education. How can we provide opportunity to everyone on the planet? We also believe in teaching really amazing ...
driving, street, urban, time, recognition, autonomous, video, segmentation, robot, classification, detection, car, synthetic