Bike sharing systems are new generation of traditional bike rentals where whole process from membership, rental and return back has become automatic. Through these systems, user is able to easily rent a bike from a particular position and return back at another position. Currently, there are about over 500 bike-sharing programs around the world which is composed of over 500 thousands bicycles. Today, there exists great interest in these systems due to their important role in traffic, environmental and health issues. Apart from interesting real world applications of bike sharing systems, the characteristics of data being generated by these systems make them attractive for the research. Opposed to other transport services such as bus or subway, the duration of travel, departure and arrival position is explicitly recorded in these systems. This feature turns bike sharing system into a virtual sensor network that can be used for sensing mobility in the city. Hence, it is expected that most of important events in the city could be detected via monitoring these data.
Data is collected from imkb.gov.tr and finance.yahoo.com. Data is organized with regard to working days in Istanbul Stock Exchange.
univariate, classification, regression, time-series, multivariateKEGG Metabolic pathways can be realized into network. Two kinds of network / graph can be formed. These include Reaction Network and Relation Network. I...
univariate, classification, clustering, regression, multivariate, textKEGG Metabolic pathways can be realized into network. Two kinds of network / graph can be formed. These include Reaction Network and Relation Network. I...
univariate, classification, clustering, regression, multivariate, textInformation about customers consists of 86 variables and includes product usage data and socio-demographic data derived from zip area codes. The data wa...
regression, multivariate, descriptionThe dataset could contain missing values. The data was sampled every minute, computing and uploading it smoothed with 15 minute means. The header of the...
text, sequential, regression, time-series, multivariateA description of the underlying Cargo 2000 standard and the processes reflected in the data set can be found at [Web Link].
regression, multivariate, classification, sequentialOur dataset is used by us to explore spammers in microblog and you can access our demo system at [Web Link]Please add :8080 after the domain name as por...
univariate, classification, sequential, causal-discovery, multivariate, textYou should respect the following train / test split: train: first 463,715 examples test: last 51,630 examples It avoids the 'producer effect' by making ...
regression, multivariateThe dataset contains 9568 data points collected from a Combined Cycle Power Plant over 6 years (2006-2011), when the power plant was set to work with fu...
regression, multivariateThe DrivFace database contains images sequences of subjects while driving in real scenarios. It is composed of 606 samples of 640480 pixels each, acquir...
regression, multivariate, classification, clusteringThe PD and control handwriting database consists of 62 PWP (People with parkinson) and 15 healthy individuals who appealed at the Department of Neurolog...
regression, multivariate, classification, clusteringNotes: -- The database contains 3 potential classes, one for the number of times a certain type of solar flare occured in a 24 hour period. -- Ea...
regression, multivariateThere are two databases: (both use the same set of 5 attributes): 1. Primary o-ring erosion and/or blowby 2. Primary o-ring erosion only The two databa...
regression, multivariateCollect the real time readings for residential,commercial,industrial,agriculure,to find the accuracy consumption in Tamil Nadu Around Thanajvur
regression, multivariate, classification, clusteringThis data set contains the acquired time series from 16 chemical sensors exposed to gas mixtures at varying concentration levels. In particular, we gene...
regression, time-series, multivariate, classificationThe dataset was built from a personal collection of 1059 tracks covering 33 countries/area. The music used is traditional, ethnic or `world' only, as c...
regression, multivariate, classificationThis dataset was constructed by adding elevation information to a 2D road network in North Jutland, Denmark (covering a region of 185 x 135 km^2). Eleva...
regression, text, clustering, sequentialThis data set is designed for testing indexing schemes in time series databases. It is a much larger dataset than has been used in any published study (...
time-series, univariateThe datas time period is between Jan 1st, 2010 to Dec 31st, 2014. Missing data are denoted as NA.
regression, time-series, multivariateAll the 504 reviews were collected between January and August of 2015.
regression, classification--- The dataset collects data from a wearable accelerometer mounted on the chest --- Sampling frequency of the accelerometer: 52 Hz --- Acceler...
univariate, classification, clustering, sequential, time-seriesThe main goal of this data set is providing clean and valid signals for designing cuff-less blood pressure estimation algorithms. The raw electrocardiog...
regression, multivariate, classificationThe experiments have been carried out by means of a numerical simulator of a naval vessel (Frigate) characterized by a Gas Turbine (GT) propulsion plant...
regression, multivariateThe estimated relative performance values were estimated by the authors using a linear regression method. See their article (pp 308-313) for more detai...
regression, multivariateThe source datasets needed to be combined via programming. Many variables are included so that algorithms that select or learn weights for attributes co...
regression, multivariate* The articles were published by Mashable (www.mashable.com) and their content as the rights to reproduce it belongs to them. Hence, this dataset does n...
regression, multivariate, classificationThe dataset contains 9358 instances of hourly averaged responses from an array of 5 metal oxide chemical sensors embedded in an Air Quality Chemical Mul...
regression, time-series, multivariateThis archive contains 2075259 measurements gathered between December 2006 and November 2010 (47 months). Notes: 1.(global_active_power*1000/60 - sub_me...
regression, time-series, multivariate, clusteringOpen University Learning Analytics Dataset (OULAD) contains data about courses, students and their interactions with Virtual Learning Environment (VLE) ...
classification, clustering, sequential, regression, time-series, multivariatePrediction of residuary resistance of sailing yachts at the initial design stage is of a great value for evaluating the ships performance and for estima...
regression, multivariatePeople used for recording of the data were wearing four tags (ankle left, ankle right, belt and chest). Each instance is a localization data for one of...
time-series, univariate, classification, sequentialThe Dataset is uploaded in ZIP format. The dataset contains 5 variants of the dataset, for the details about the variants and detailed analysis read and...
regression, multivariateThis data set consists of three types of entities: (a) the specification of an auto in terms of various characteristics, (b) its assigned insurance risk...
regression, multivariateThe data is related to posts' published during the year of 2014 on the Facebook's page of a renowned cosmetics brand. This dataset contains 500 of the 7...
regression, multivariateThis dataset is composed of a range of biomedical voice measurements from 42 people with early-stage Parkinson's disease recruited to a six-month trial ...
regression, multivariateWe present a dataset to address the problem of visual privacy - where users unintentionally leak private information when sharing personal images online...
multilabel, privacy, classification, flickr, scene, regressionA chemical detection platform composed of 8 chemo-resistive gas sensors was exposed to turbulent gas mixtures generated naturally in a wind tunnel. The ...
regression, time-series, multivariate, classificationThe dataset is composed by two tables. The first table go_track_tracks presents general attributes and each instance has one trajectory that is represen...
regression, multivariate, classificationThis data set contains 13,910 measurements from 16 chemical sensors exposed to 6 gases at different concentration levels. This dataset is an extension o...
causa, classification, clustering, regression, time-series, multivariateSequential (time-series) domain. Single-line melodies of 100 Bach chorales (originally 4 voices). The melody line can be studied independently of othe...
time-series, univariateThis dataset is a slightly modified version of the dataset provided in the StatLib library. In line with the use by Ross Quinlan (1993) in predicting t...
regression, multivariateIndoor localisation is a key topic for the Ambient Intelligence (AmI) research community. In this scenarios, recent advancements in wearable technolog...
classification, clustering, sequential, regression, time-series, multivariateNumber of instances 1030 Number of Attributes 9 Attribute breakdown 8 quantitative input variables, and 1 quantitative output variable Missing Attribut...
regression, multivariateAIMS AND PURPOSES This corpus is intended to do cleaning (or binarization) and enhancement of noisy grayscale printed text images using supervised lear...
regression, multivariate, classificationData set has no missing values. Values are in kW of each 15 min. To convert values in kWh values must be divided by 4. Each column represent one client....
regression, time-series, clusteringThis data approach student achievement in secondary education of two Portuguese schools. The data attributes include student grades, demographic, social...
regression, multivariate, classificationEEG record contains many regular oscillations, which are believed to reflect synchronized rhythmic activity in a group of neurons. Most activity related...
univariate, classificationIndoor localization is a key topic for mobile computing. However, it is still very difficult for the mobile sensing community to compare state-of-art In...
classification, clustering, sequential, regression, time-series, multivariateIn [Cortez and Morais, 2007], the output 'area' was first transformed with a ln(x+1) function. Then, several Data Mining methods were applied. After ...
regression, multivariateThe measured data was collected using a chemical sensing system based on an array of 16 metal-oxide gas sensors and an external mechanical ventilator to...
regression, time-series, multivariate, classificationRoss Quinlan: This data was given to me by Karl Ulrich at MIT in 1986. I didn't record his description at the time, but here's his subsequent (1992) r...
regression, multivariateProvide all relevant information about your data set.
regression, multivariateProvide all relevant information about your data set.
regression, multivariate, classificationPart of the problem in using an automated program to discover the unknown target function is to decide how to encode names such that the program can be ...
text, univariate, classificationThe dataset collects data from an Android smartphone positioned in the chest pocket. Accelerometer Data are collected from 22 participants walking in th...
univariate, classification, clustering, sequential, time-seriesThis dataset includes the recordings of five replicates of an 8-sensor array. Each unit holds 8 MOX sensors and integrates custom-designed electronics f...
classification, regression, time-series, multivariate, domain-theoryThe most important feature of this dataset is its simplicity to use and its being well-documented, which can be widely used in various studies of text a...
regression, multivariate, text, classificationEach record represents follow-up data for one breast cancer case. These are consecutive patients seen by Dr. Wolberg since 1984, and include only those...
regression, multivariate, classificationThe PD database consists of training and test files. The training data belongs to 20 PWP (6 female, 14 male) and 20 healthy individuals (10 female, 10 m...
regression, multivariate, classificationThis data originates from blog posts. The raw HTML-documents of the blog posts were crawled and processed. The prediction task associated with the dat...
regression, multivariateMany real world applications need to know the localization of a user in the world to provide their services. Therefore, automatic user localization has ...
regression, multivariate, classificationThe data set includes 103 data points. There are 7 input variables, and 3 output variables in the data set. The initial data set included 78 data. After...
regression, multivariateThis is a sparse data set, less than 10% of the attributes are used for each sample. The link is to a '*.tgz' file which contains two files: [amzn-anon-...
clustering, causal-discovery, regression, time-series, domain-theoryThe data was retrieved from a set of 53500 CT images from 74 different patients (43 male, 31 female). Each CT slice is described by two histograms in p...
regression, domain-theoryThe NASA data set comprises different size NACA 0012 airfoils at various wind tunnel speeds and angles of attack. The span of the airfoil and the observ...
regression, multivariateOngoing research on university faculty perceptions and practices of using Wikipedia as a teaching resource. Based on a Technology Acceptance Model, the ...
regression, multivariate, clustering, causal-discoveryThe data set gathered when we were working at project for Bahrain university between 2002 and 2003.
domain-theory, univariate, classification, clusteringThis data set contains time series of greenhouse gas (GHG) concentrations at 2921 grid cells in California created using simulations of the Weather Rese...
regression, time-series, multivariateThere are three disadvantages of weighted scoring stock selection models. First, they cannot identify the relations between weights of stock-picking con...
regression, multivariateMany variables are included so that algorithms that select or learn weights for attributes could be tested. However, clearly unrelated attributes were...
regression, multivariate-- We aggregated screen movements into screen-fixations using a Salvucci & Goldberg (2000) dispersion-threshold algorithm, and defined Perception Act...
regression, multivariateThe two datasets are related to red and white variants of the Portuguese "Vinho Verde" wine. For more details, consult: [Web Link] or the reference [Cor...
regression, multivariate, classificationThe presented dataset is composed of two tsv files named 'youtube_videos.tsv' and 'transcoding_mesurment.tsv'. The first contains 10 columns of fundame...
regression, multivariateThe time period is between Jan 1st, 2010 to Dec 31st, 2015. Missing data are denoted as NA.
regression, time-series, multivariateThe experiments have been carried out with a group of 115 students of first-year, undergraduate Engineering major of the University of Genoa. We carri...
classification, clustering, sequential, regression, time-series, multivariateNorthix is designed to be a schema matching benchmark problem for data integration of two entity relationship databases. Northix is the resulting schema...
multivariate, text, univariate, classificationThe dataset has been enriched during the Nomao Challenge: [Web Link] organized along with the ALRA workshop (Active Learning in Real-world Applications)...
univariate, classificationThe data set is at 10 min for about 4.5 months. The house temperature and humidity conditions were monitored with a ZigBee wireless sensor network. Each...
regression, time-series, multivariateWe perform energy analysis using 12 different building shapes simulated in Ecotect. The buildings differ with respect to the glazing area, the glazing a...
regression, multivariate, classificationThe skin dataset is collected by randomly sampling B,G,R values from face images of various age groups (young, middle, and old), race groups (white, bla...
univariate, classification