This data set is designed for testing indexing schemes in time series databases. It is a much larger dataset than has been used in any published study (That we are currently aware of). It contains one million data points. The data has been split into 10 sections to facilitate testing (see below). We recommend building the index with 9 of the 100,000-datapoint sections, and randomly extracting a query shape from the 10th section. (Some previously published work seems to have used queries that were also used to build the indexing structure. This will produce optimistic results) The data are interesting because they have structure at different resolutions. Each of the 10 sections where generated by independent invocations of the function: (see equation.gif) Where rand(x) produces a random integer between zero and x. The data appears highly periodic, but never exactly repeats itself. This feature is designed to challenge the indexing structure. The time series are ploted here: (ts1-5.gif), (ts6-10.gif)
People used for recording of the data were wearing four tags (ankle left, ankle right, belt and chest). Each instance is a localization data for one of...
time-series, univariate, classification, sequentialSequential (time-series) domain. Single-line melodies of 100 Bach chorales (originally 4 voices). The melody line can be studied independently of othe...
time-series, univariate--- The dataset collects data from a wearable accelerometer mounted on the chest --- Sampling frequency of the accelerometer: 52 Hz --- Acceler...
univariate, classification, clustering, sequential, time-seriesData is collected from imkb.gov.tr and finance.yahoo.com. Data is organized with regard to working days in Istanbul Stock Exchange.
univariate, classification, regression, time-series, multivariateThe dataset collects data from an Android smartphone positioned in the chest pocket. Accelerometer Data are collected from 22 participants walking in th...
univariate, classification, clustering, sequential, time-seriesThe dataset could contain missing values. The data was sampled every minute, computing and uploading it smoothed with 15 minute means. The header of the...
text, sequential, regression, time-series, multivariateThe dataset is composed by features extracted from 7 videos with people gesticulating, aiming at studying Gesture Phase Segmentation. Each video is repr...
classification, clustering, sequential, time-series, multivariateThree data sets are submitted, for training and testing. Ground-truth occupancy was obtained from time stamped pictures that were taken every minute. Fo...
time-series, multivariate, classificationBike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Th...
regression, univariateThis is a custom generated dataset designed for the task of action co-segmentation in pairs of action sequences. The dataset contains 101 pairs of ac...
time-series, temporal segmentation, action cosegmentation, motion-capture-dataThe donation includes 5 datasets, each of them defining a different learning problem: * LP1: failures in approach to grasp position * LP2: fail...
time-series, multivariate, classificationOur dataset is used by us to explore spammers in microblog and you can access our demo system at [Web Link]Please add :8080 after the domain name as por...
univariate, classification, sequential, causal-discovery, multivariate, text1. Protocol: Seven male and three female subjects (age 25 to 30), who have experienced aggression in scenarios such as physical fighting, took par...
time-series, classification* Audio track (encoded as mp3) of each of the 106,574 tracks. It is on average 10 millions samples per track.* Nine audio features (consisting of 518 at...
time-series, multivariate, classification, clusteringThis data arises from a large study to examine EEG correlates of genetic predisposition to alcoholism. It contains measurements from 64 electrodes place...
time-series, multivariateThe MHEALTH (Mobile HEALTH) dataset comprises body motion and vital signs recordings for ten volunteers of diverse profile while performing several phys...
time-series, multivariate, classificationThis dataset represents a real-life benchmark in the area of Activity Recognition applications, as described in [1]. The classification tasks consist ...
time-series, multivariate, classification, sequentialDiabetes patient records were obtained from two sources: an automatic electronic recording device and paper records. The automatic device had an inter...
time-series, multivariateThis loop sensor data was collected for the Glendale on ramp for the 101 North freeway in Los Angeles. It is close enough to the stadium to see unusual...
time-series, multivariateThis data set contains the acquired time series from 16 chemical sensors exposed to gas mixtures at varying concentration levels. In particular, we gene...
regression, time-series, multivariate, classification1. Protocol: Three male and one female subjects (age 25 to 30), who have experienced aggression in scenarios such as physical fighting, took part ...
time-series, classificationThe experiments have been carried out with a group of 30 volunteers within an age bracket of 19-48 years. Each person performed six activities (WALKING,...
time-series, multivariate, classification, clusteringThe Daphnet Freezing of Gait Dataset is a dataset devised to benchmark automatic methods to recognize gait freeze from wearable acceleration sensors pl...
time-series, multivariate, classificationThe OPPORTUNITY Dataset for Human Activity Recognition from Wearable, Object, and Ambient Sensors is a dataset devised to benchmark human activity recog...
time-series, multivariate, classificationThe datas time period is between Jan 1st, 2010 to Dec 31st, 2014. Missing data are denoted as NA.
regression, time-series, multivariateThe experiments were carried out with a group of 30 volunteers within an age bracket of 19-48 years. They performed a protocol of activities composed of...
time-series, multivariate, classificationThis is a transnational data set which contains all the transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered non-store...
classification, clustering, sequential, time-series, multivariateIn predicting stock prices you collect data over some period of time - day, week, month, etc. But you cannot take advantage of data from a time period u...
time-series, classification, clusteringAll data is from one continuous EEG measurement with the Emotiv EEG Neuroheadset. The duration of the measurement was 117 seconds. The eye state was det...
time-series, multivariate, classification, sequentialThe dataset contains 9358 instances of hourly averaged responses from an array of 5 metal oxide chemical sensors embedded in an Air Quality Chemical Mul...
regression, time-series, multivariateThis archive contains 2075259 measurements gathered between December 2006 and November 2010 (47 months). Notes: 1.(global_active_power*1000/60 - sub_me...
regression, time-series, multivariate, clusteringOpen University Learning Analytics Dataset (OULAD) contains data about courses, students and their interactions with Virtual Learning Environment (VLE) ...
classification, clustering, sequential, regression, time-series, multivariateThis dataset has recordings of a gas sensor array composed of 8 MOX gas sensors, and a temperature and humidity sensor. This sensor array was exposed to...
time-series, multivariate, classification2. Information database: 2.1. Protocol: 22 male subjects , 11 with different knee abnormalities previously diagnosed by a professional. They undergo th...
time-series, multivariateDataset from 8800(10 digits x 10 repetitions x 88 speakers) time series of 13 Frequency Cepstral Coefficients (MFCCs) had taken from 44 males and 44 fem...
time-series, multivariate, classificationNumber of instances: 18000 times-series measurements recorded from a 72 metal-oxide gas sensor array-based chemical detection platform. Number of attri...
time-series, multivariate, classificationThis dataset contains 600 examples of control charts synthetically generated by the process in Alcock and Manolopoulos (1999). There are six different c...
time-series, classification, clusteringBrief Description of the Dataset: --------------------------------- Each of the 19 activities is performed by eight subjects (4 female, 4 male, between ...
time-series, multivariate, classification, clusteringThe PAMAP2 Physical Activity Monitoring dataset contains data of 18 different physical activities (such as walking, cycling, playing soccer, etc.), perf...
time-series, multivariate, classificationA chemical detection platform composed of 8 chemo-resistive gas sensors was exposed to turbulent gas mixtures generated naturally in a wind tunnel. The ...
regression, time-series, multivariate, classificationThe Heterogeneity Dataset for Human Activity Recognition from Smartphone and Smartwatch sensors consists of two datasets devised to investigate sensor h...
time-series, multivariate, classification, clusteringThis dataset is an addition to the dataset at [Web Link] We collected more dataset to improve the accuracy of our HAR algorithms applied in ...
time-series, classificationThis data set contains 13,910 measurements from 16 chemical sensors exposed to 6 gases at different concentration levels. This dataset is an extension o...
causa, classification, clustering, regression, time-series, multivariateWe have downloaded 15 months worth of daily data from the California Department of Transportation PEMS website, [Web Link], The data describes the occup...
time-series, multivariate, classificationThe data can be used to try to predict student learning in SE teamwork based on observation of their team activity **** README FILE from the submitted...
time-series, classification, sequentialUncompressing the archive url_svmlight.tar.gz will yield a directory url_svmlight/ containing the following files: * FeatureTypes --- A text file li...
time-series, multivariate, classificationThe data was collected for examining our newly developed classifier for multidimensional curves (multidimensional time series). Nine male speakers utter...
time-series, multivariate, classificationIndoor localisation is a key topic for the Ambient Intelligence (AmI) research community. In this scenarios, recent advancements in wearable technolog...
classification, clustering, sequential, regression, time-series, multivariateData set has no missing values. Values are in kW of each 15 min. To convert values in kWh values must be divided by 4. Each column represent one client....
regression, time-series, clusteringEEG record contains many regular oscillations, which are believed to reflect synchronized rhythmic activity in a group of neurons. Most activity related...
univariate, classificationFor a list of attributes, please refer to those two .names files. They use the following naming convention: All the attribute start with T means the t...
time-series, multivariate, classification, sequentialThis dataset comprises information regarding the ADLs performed by two users on a daily basis in their own homes. This dataset is composed by two insta...
classification, clustering, sequential, time-series, multivariate1. Title of Database: Machine Learning based ZZAlpha Stock Recommendations 2. Sources: (a) Original owners of data: ZZAlpha Ltd., 4729 E. Sunrise #10...
time-series, classification, sequentialIndoor localization is a key topic for mobile computing. However, it is still very difficult for the mobile sensing community to compare state-of-art In...
classification, clustering, sequential, regression, time-series, multivariateThe measured data was collected using a chemical sensing system based on an array of 16 metal-oxide gas sensors and an external mechanical ventilator to...
regression, time-series, multivariate, classificationFor complete information see the official challenge page: [Web Link]
clustering, sequential, causal-discovery, time-series, multivariate, domain-theoryKEGG Metabolic pathways can be realized into network. Two kinds of network / graph can be formed. These include Reaction Network and Relation Network. I...
univariate, classification, clustering, regression, multivariate, textPlease find the original data at '[Web Link]'
time-series, multivariate, classification, clustering52 columns for 52 weeks; normalised values of provided too.
time-series, multivariate, clusteringPart of the problem in using an automated program to discover the unknown target function is to decide how to encode names such that the program can be ...
text, univariate, classificationThis dataset includes the recordings of five replicates of an 8-sensor array. Each unit holds 8 MOX sensors and integrates custom-designed electronics f...
classification, regression, time-series, multivariate, domain-theoryThe characters here were used for a PhD study on primitive extraction using HMM based models. The data consists of 2858 character samples, contained in ...
time-series, classification, clusteringThe measurements were created to ease the development, comparison and evaluation of fingerprinting based hybrid indoor positioning methods. The measurem...
time-series, multivariate, classification, sequentialThe source of the data is the raw measurements from a Nintendo PowerGlove. It was interfaced through a PowerGlove Serial Interface to a Silicon Graphics...
time-series, multivariate, classificationThis is a sparse data set, less than 10% of the attributes are used for each sample. The link is to a '*.tgz' file which contains two files: [amzn-anon-...
clustering, causal-discovery, regression, time-series, domain-theoryThe data set gathered when we were working at project for Bahrain university between 2002 and 2003.
domain-theory, univariate, classification, clusteringData was captured using a setup that consisted of: - Two Fifth Dimension Technologies (5DT) gloves, one right and one left - Two Ascension Flock-of-B...
time-series, multivariate, classificationThis data set contains time series of greenhouse gas (GHG) concentrations at 2921 grid cells in California created using simulations of the Weather Rese...
regression, time-series, multivariateThis dataset represents a real-life benchmark in the area of Ambient Assisted Living applications, as described in [1]. The binary classification task c...
time-series, multivariate, classification, sequentialObservations come from 2 data streams (people flow in and out of the building), over 15 weeks, 48 time slices per day (half hour count aggregates). T...
time-series, multivariateThe time period is between Jan 1st, 2010 to Dec 31st, 2015. Missing data are denoted as NA.
regression, time-series, multivariateThe experiments have been carried out with a group of 115 students of first-year, undergraduate Engineering major of the University of Genoa. We carri...
classification, clustering, sequential, regression, time-series, multivariateKEGG Metabolic pathways can be realized into network. Two kinds of network / graph can be formed. These include Reaction Network and Relation Network. I...
univariate, classification, clustering, regression, multivariate, textInstrumentation: The data were collected at a sampling rate of 500 Hz, using as a programming kernel the National Instruments (NI) Labview. The signals ...
time-series, classificationNorthix is designed to be a schema matching benchmark problem for data integration of two entity relationship databases. Northix is the resulting schema...
multivariate, text, univariate, classificationThe dataset has been enriched during the Nomao Challenge: [Web Link] organized along with the ALRA workshop (Active Learning in Real-world Applications)...
univariate, classificationThe data set is at 10 min for about 4.5 months. The house temperature and humidity conditions were monitored with a ZigBee wireless sensor network. Each...
regression, time-series, multivariateThe skin dataset is collected by randomly sampling B,G,R values from face images of various age groups (young, middle, and old), race groups (white, bla...
univariate, classificationThe Dataset for ADL Recognition with Wrist-worn Accelerometer is a public collection of labelled accelerometer data recordings to be used for the creati...
time-series, multivariate, classification, clusteringThe REALDISP (REAListic sensor DISPlacement) dataset has been originally collected to investigate the effects of sensor displacement in the activity rec...
time-series, multivariate, classificationThe data were collected over a series of specifically designed trials. Our hope was to cover most of the types of sensory interactions that a Pioneer mi...
time-series, multivariate