Pseudo Periodic Synthetic Time Series

Others Dataset

Homepage

http://archive.ics.uci.edu/ml/datasets/Pseudo+Periodic+Synthetic+Time+Series

Description

This data set is designed for testing indexing schemes in time series databases. It is a much larger dataset than has been used in any published study (That we are currently aware of). It contains one million data points. The data has been split into 10 sections to facilitate testing (see below). We recommend building the index with 9 of the 100,000-datapoint sections, and randomly extracting a query shape from the 10th section. (Some previously published work seems to have used queries that were also used to build the indexing structure. This will produce optimistic results) The data are interesting because they have structure at different resolutions. Each of the 10 sections where generated by independent invocations of the function: (see equation.gif) Where rand(x) produces a random integer between zero and x. The data appears highly periodic, but never exactly repeats itself. This feature is designed to challenge the indexing structure. The time series are ploted here: (ts1-5.gif), (ts6-10.gif)

Discussion

Related datasets

Localization Data for Perso...

People used for recording of the data were wearing four tags (ankle left, ankle right, belt and chest). Each instance is a localization data for one of...

time-series, univariate, classification, sequential

Life Sciences

Bach Chorales

Sequential (time-series) domain. Single-line melodies of 100 Bach chorales (originally 4 voices). The melody line can be studied independently of othe...

time-series, univariate

Others

Activity Recognition from S...

--- The dataset collects data from a wearable accelerometer mounted on the chest --- Sampling frequency of the accelerometer: 52 Hz --- Acceler...

univariate, classification, clustering, sequential, time-series

Others

ISTANBUL STOCK EXCHANGE

Data is collected from imkb.gov.tr and finance.yahoo.com. Data is organized with regard to working days in Istanbul Stock Exchange.

univariate, classification, regression, time-series, multivariate

Business

User Identification From Wa...

The dataset collects data from an Android smartphone positioned in the chest pocket. Accelerometer Data are collected from 22 participants walking in th...

univariate, classification, clustering, sequential, time-series

Others

SML2010

The dataset could contain missing values. The data was sampled every minute, computing and uploading it smoothed with 15 minute means. The header of the...

text, sequential, regression, time-series, multivariate

Computer Science

Gesture Phase Segmentation

The dataset is composed by features extracted from 7 videos with people gesticulating, aiming at studying Gesture Phase Segmentation. Each video is repr...

classification, clustering, sequential, time-series, multivariate

Others

Occupancy Detection

Three data sets are submitted, for training and testing. Ground-truth occupancy was obtained from time stamped pictures that were taken every minute. Fo...

time-series, multivariate, classification

Computer Science

Bike Sharing Dataset

Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Th...

regression, univariate

Social Sciences

ICS-FORTH MHAD101 Action Co...

This is a custom generated dataset designed for the task of action co-segmentation in pairs of action sequences. The dataset contains 101 pairs of ac...

time-series, temporal segmentation, action cosegmentation, motion-capture-data

Vision

Robot Execution Failures

The donation includes 5 datasets, each of them defining a different learning problem: * LP1: failures in approach to grasp position * LP2: fail...

time-series, multivariate, classification

Physical Systems

microblogPCU

Our dataset is used by us to explore spammers in microblog and you can access our demo system at [Web Link]Please add :8080 after the domain name as por...

univariate, classification, sequential, causal-discovery, multivariate, text

Computer Science

Vicon Physical Action Data Set

1. Protocol: Seven male and three female subjects (age 25 to 30), who have experienced aggression in scenarios such as physical fighting, took par...

time-series, classification

Physical Systems

FMA: A Dataset For Music An...

* Audio track (encoded as mp3) of each of the 106,574 tracks. It is on average 10 millions samples per track.* Nine audio features (consisting of 518 at...

time-series, multivariate, classification, clustering

Computer Science

ICU

Please see documentation

time-series, multivariate

Life Sciences

EEG Database

This data arises from a large study to examine EEG correlates of genetic predisposition to alcoholism. It contains measurements from 64 electrodes place...

time-series, multivariate

Life Sciences

MHEALTH Dataset

The MHEALTH (Mobile HEALTH) dataset comprises body motion and vital signs recordings for ten volunteers of diverse profile while performing several phys...

time-series, multivariate, classification

Computer Science

Activity Recognition system...

This dataset represents a real-life benchmark in the area of Activity Recognition applications, as described in [1]. The classification tasks consist ...

time-series, multivariate, classification, sequential

Computer Science

Diabetes

Diabetes patient records were obtained from two sources: an automatic electronic recording device and paper records. The automatic device had an inter...

time-series, multivariate

Life Sciences

Dodgers Loop Sensor

This loop sensor data was collected for the Glendale on ramp for the 101 North freeway in Los Angeles. It is close enough to the stadium to see unusual...

time-series, multivariate

Others

Gas sensor array under dyna...

This data set contains the acquired time series from 16 chemical sensors exposed to gas mixtures at varying concentration levels. In particular, we gene...

regression, time-series, multivariate, classification

Computer Science

EMG Physical Action Data Set

1. Protocol: Three male and one female subjects (age 25 to 30), who have experienced aggression in scenarios such as physical fighting, took part ...

time-series, classification

Physical Systems

Human Activity Recognition ...

The experiments have been carried out with a group of 30 volunteers within an age bracket of 19-48 years. Each person performed six activities (WALKING,...

time-series, multivariate, classification, clustering

Computer Science

Daphnet Freezing of Gait

The Daphnet Freezing of Gait Dataset is a dataset devised to benchmark automatic methods to recognize gait freeze from wearable acceleration sensors pl...

time-series, multivariate, classification

Life Sciences

OPPORTUNITY Activity Recogn...

The OPPORTUNITY Dataset for Human Activity Recognition from Wearable, Object, and Ambient Sensors is a dataset devised to benchmark human activity recog...

time-series, multivariate, classification

Computer Science

Beijing PM2.5 Data

The datas time period is between Jan 1st, 2010 to Dec 31st, 2014. Missing data are denoted as NA.

regression, time-series, multivariate

Physical Systems

Smartphone-Based Recognitio...

The experiments were carried out with a group of 30 volunteers within an age bracket of 19-48 years. They performed a protocol of activities composed of...

time-series, multivariate, classification

Life Sciences

Online Retail

This is a transnational data set which contains all the transactions occurring between 01/12/2010 and 09/12/2011 for a UK-based and registered non-store...

classification, clustering, sequential, time-series, multivariate

Business

Dow Jones Index

In predicting stock prices you collect data over some period of time - day, week, month, etc. But you cannot take advantage of data from a time period u...

time-series, classification, clustering

Business

EEG Eye State

All data is from one continuous EEG measurement with the Emotiv EEG Neuroheadset. The duration of the measurement was 117 seconds. The eye state was det...

time-series, multivariate, classification, sequential

Life Sciences

Air Quality

The dataset contains 9358 instances of hourly averaged responses from an array of 5 metal oxide chemical sensors embedded in an Air Quality Chemical Mul...

regression, time-series, multivariate

Computer Science

Buzz in social media

Please see [Web Link]

regression, time-series, multivariate, classification

Computer Science

Individual household electr...

This archive contains 2075259 measurements gathered between December 2006 and November 2010 (47 months). Notes: 1.(global_active_power*1000/60 - sub_me...

regression, time-series, multivariate, clustering

Physical Systems

Open University Learning An...

Open University Learning Analytics Dataset (OULAD) contains data about courses, students and their interactions with Virtual Learning Environment (VLE) ...

classification, clustering, sequential, regression, time-series, multivariate

Computer Science

Gas sensors for home activi...

This dataset has recordings of a gas sensor array composed of 8 MOX gas sensors, and a temperature and humidity sensor. This sensor array was exposed to...

time-series, multivariate, classification

Computer Science

EMG dataset in Lower Limb

2. Information database: 2.1. Protocol: 22 male subjects , 11 with different knee abnormalities previously diagnosed by a professional. They undergo th...

time-series, multivariate

Computer Science

Spoken Arabic Digit

Dataset from 8800(10 digits x 10 repetitions x 88 speakers) time series of 13 Frequency Cepstral Coefficients (MFCCs) had taken from 44 males and 44 fem...

time-series, multivariate, classification

Others

Gas sensor arrays in open s...

Number of instances: 18000 times-series measurements recorded from a 72 metal-oxide gas sensor array-based chemical detection platform. Number of attri...

time-series, multivariate, classification

Computer Science

Synthetic Control Chart Tim...

This dataset contains 600 examples of control charts synthetically generated by the process in Alcock and Manolopoulos (1999). There are six different c...

time-series, classification, clustering

Others

Predict keywords activities...

See files and/or [Web Link]

time-series, multivariate, sequential

Computer Science

Daily and Sports Activities

Brief Description of the Dataset: --------------------------------- Each of the 19 activities is performed by eight subjects (4 female, 4 male, between ...

time-series, multivariate, classification, clustering

Computer Science

PAMAP2 Physical Activity Mo...

The PAMAP2 Physical Activity Monitoring dataset contains data of 18 different physical activities (such as walking, cycling, playing soccer, etc.), perf...

time-series, multivariate, classification

Computer Science

Gas sensor array exposed to...

A chemical detection platform composed of 8 chemo-resistive gas sensors was exposed to turbulent gas mixtures generated naturally in a wind tunnel. The ...

regression, time-series, multivariate, classification

Computer Science

Heterogeneity Activity Reco...

The Heterogeneity Dataset for Human Activity Recognition from Smartphone and Smartwatch sensors consists of two datasets devised to investigate sensor h...

time-series, multivariate, classification, clustering

Computer Science

Smartphone Dataset for Huma...

This dataset is an addition to the dataset at [Web Link] We collected more dataset to improve the accuracy of our HAR algorithms applied in ...

time-series, classification

Computer Science

Gas Sensor Array Drift Data...

This data set contains 13,910 measurements from 16 chemical sensors exposed to 6 gases at different concentration levels. This dataset is an extension o...

causa, classification, clustering, regression, time-series, multivariate

Computer Science

PEMS-SF

We have downloaded 15 months worth of daily data from the California Department of Transportation PEMS website, [Web Link], The data describes the occup...

time-series, multivariate, classification

Computer Science

Data for Software Engineeri...

The data can be used to try to predict student learning in SE teamwork based on observation of their team activity **** README FILE from the submitted...

time-series, classification, sequential

Computer Science

URL Reputation

Uncompressing the archive url_svmlight.tar.gz will yield a directory url_svmlight/ containing the following files: * FeatureTypes --- A text file li...

time-series, multivariate, classification

Computer Science

Japanese Vowels

The data was collected for examining our newly developed classifier for multidimensional curves (multidimensional time series). Nine male speakers utter...

time-series, multivariate, classification

Others

Geo-Magnetic field and WLAN...

Indoor localisation is a key topic for the Ambient Intelligence (AmI) research community. In this scenarios, recent advancements in wearable technolog...

classification, clustering, sequential, regression, time-series, multivariate

Computer Science

ElectricityLoadDiagrams2011...

Data set has no missing values. Values are in kW of each 15 min. To convert values in kWh values must be divided by 4. Each column represent one client....

regression, time-series, clustering

Computer Science

Planning Relax

EEG record contains many regular oscillations, which are believed to reflect synchronized rhythmic activity in a group of neurons. Most activity related...

univariate, classification

Computer Science

Ozone Level Detection

For a list of attributes, please refer to those two .names files. They use the following naming convention: All the attribute start with T means the t...

time-series, multivariate, classification, sequential

Physical Systems

Activities of Daily Living ...

This dataset comprises information regarding the ADLs performed by two users on a daily basis in their own homes. This dataset is composed by two insta...

classification, clustering, sequential, time-series, multivariate

Computer Science

Machine Learning based ZZAl...

1. Title of Database: Machine Learning based ZZAlpha Stock Recommendations 2. Sources: (a) Original owners of data: ZZAlpha Ltd., 4729 E. Sunrise #10...

time-series, classification, sequential

Business

UJIIndoorLoc-Mag

Indoor localization is a key topic for mobile computing. However, it is still very difficult for the mobile sensing community to compare state-of-art In...

classification, clustering, sequential, regression, time-series, multivariate

Computer Science

Gas sensor array under flow...

The measured data was collected using a chemical sensing system based on an array of 16 metal-oxide gas sensors and an external mechanical ventilator to...

regression, time-series, multivariate, classification

Computer Science

Taxi Service Trajectory - P...

For complete information see the official challenge page: [Web Link]

clustering, sequential, causal-discovery, time-series, multivariate, domain-theory

Computer Science

KEGG Metabolic Reaction Net...

KEGG Metabolic pathways can be realized into network. Two kinds of network / graph can be formed. These include Reaction Network and Relation Network. I...

univariate, classification, clustering, regression, multivariate, text

Life Sciences

Epileptic Seizure Recognition

Please find the original data at '[Web Link]'

time-series, multivariate, classification, clustering

Life Sciences

Sales_Transactions_Dataset_...

52 columns for 52 weeks; normalised values of provided too.

time-series, multivariate, clustering

Others

Badges

Part of the problem in using an automated program to discover the unknown target function is to decide how to encode names such that the program can be ...

text, univariate, classification

Others

Twin gas sensor arrays

This dataset includes the recordings of five replicates of an 8-sensor array. Each unit holds 8 MOX sensors and integrates custom-designed electronics f...

classification, regression, time-series, multivariate, domain-theory

Computer Science

Character Trajectories

The characters here were used for a PhD study on primitive extraction using HMM based models. The data consists of 2858 character samples, contained in ...

time-series, classification, clustering

Computer Science

Hybrid Indoor Positioning D...

The measurements were created to ease the development, comparison and evaluation of fingerprinting based hybrid indoor positioning methods. The measurem...

time-series, multivariate, classification, sequential

Computer Science

Australian Sign Language signs

The source of the data is the raw measurements from a Nintendo PowerGlove. It was interfaced through a PowerGlove Serial Interface to a Silicon Graphics...

time-series, multivariate, classification

Others

Amazon Access Samples

This is a sparse data set, less than 10% of the attributes are used for each sample. The link is to a '*.tgz' file which contains two files: [amzn-anon-...

clustering, causal-discovery, regression, time-series, domain-theory

Business

Perfume Data

The data set gathered when we were working at project for Bahrain university between 2002 and 2003.

domain-theory, univariate, classification, clustering

Computer Science

Australian Sign Language si...

Data was captured using a setup that consisted of: - Two Fifth Dimension Technologies (5DT) gloves, one right and one left - Two Ascension Flock-of-B...

time-series, multivariate, classification

Others

Greenhouse Gas Observing Ne...

This data set contains time series of greenhouse gas (GHG) concentrations at 2921 grid cells in California created using simulations of the Weather Rese...

regression, time-series, multivariate

Physical Systems

Indoor User Movement Predic...

This dataset represents a real-life benchmark in the area of Ambient Assisted Living applications, as described in [1]. The binary classification task c...

time-series, multivariate, classification, sequential

Computer Science

CalIt2 Building People Counts

Observations come from 2 data streams (people flow in and out of the building), over 15 weeks, 48 time slices per day (half hour count aggregates). T...

time-series, multivariate

Others

PM2.5 Data of Five Chinese ...

The time period is between Jan 1st, 2010 to Dec 31st, 2015. Missing data are denoted as NA.

regression, time-series, multivariate

Physical Systems

Educational Process Mining ...

The experiments have been carried out with a group of 115 students of first-year, undergraduate Engineering major of the University of Genoa. We carri...

classification, clustering, sequential, regression, time-series, multivariate

Computer Science

KEGG Metabolic Relation Net...

KEGG Metabolic pathways can be realized into network. Two kinds of network / graph can be formed. These include Reaction Network and Relation Network. I...

univariate, classification, clustering, regression, multivariate, text

Life Sciences

sEMG for Basic Hand movements

Instrumentation: The data were collected at a sampling rate of 500 Hz, using as a programming kernel the National Instruments (NI) Labview. The signals ...

time-series, classification

Life Sciences

Northix

Northix is designed to be a schema matching benchmark problem for data integration of two entity relationship databases. Northix is the resulting schema...

multivariate, text, univariate, classification

Computer Science

Nomao

The dataset has been enriched during the Nomao Challenge: [Web Link] organized along with the ALRA workshop (Active Learning in Real-world Applications)...

univariate, classification

Computer Science

Appliances energy prediction

The data set is at 10 min for about 4.5 months. The house temperature and humidity conditions were monitored with a ZigBee wireless sensor network. Each...

regression, time-series, multivariate

Computer Science

Skin Segmentation

The skin dataset is collected by randomly sampling B,G,R values from face images of various age groups (young, middle, and old), race groups (white, bla...

univariate, classification

Computer Science

Dataset for ADL Recognition...

The Dataset for ADL Recognition with Wrist-worn Accelerometer is a public collection of labelled accelerometer data recordings to be used for the creati...

time-series, multivariate, classification, clustering

Computer Science

REALDISP Activity Recogniti...

The REALDISP (REAListic sensor DISPlacement) dataset has been originally collected to investigate the effects of sensor displacement in the activity rec...

time-series, multivariate, classification

Computer Science

Pioneer-1 Mobile Robot Data

The data were collected over a series of specifically designed trials. Our hope was to cover most of the types of sensory interactions that a Pioneer mi...

time-series, multivariate

Computer Science