Dataset Category

YearPredictionMSD

You should respect the following train / test split: train: first 463,715 examples test: last 51,630 examples It avoids the 'producer effect' by making su…

multivariate, regression

Others

Record Linkage Comparison Pat…

The records represent individual data including first and family name, sex, date of birth and postal code, which were collected through iterative insertio…

classification, multivariate

Others

Vertebral Column

Biomedical data set built by Dr. Henrique da Mota during a medical residence period in the Group of Applied Research in Orthopaedics (GARO) of the Centre …

classification, multivariate

Others

QtyT40I10D100K

This data set is generated from the original T40I10D100K data set, to mine fuzzy sequential patterns over quantitative streams. While the original T40I10D…

sequential

Others

Legal Case Reports

This dataset contains Australian legal cases from the Federal Court of Australia (FCA). The cases were downloaded from AustLII ([Web Link]). We included a…

classification, text

Others

QSAR biodegradation

The QSAR biodegradation dataset was built in the Milano Chemometrics and QSAR Research Group (Universit degli Studi Milano Bicocca, Milano, Italy). The r…

classification, multivariate

Others

Turkiye Student Evaluation

N/A

classification, clustering, multivariate

Others

seismic-bumps

Mining activity was and is always connected with the occurrence of dangers which are commonly called mining hazards. A special case of such threat is a s…

classification, multivariate

Others

User Identification From Walk…

The dataset collects data from an Android smartphone positioned in the chest pocket. Accelerometer Data are collected from 22 participants walking in the …

classification, clustering, sequential, time-series, univariate

Others

Activity Recognition from Sin…

--- The dataset collects data from a wearable accelerometer mounted on the chest --- Sampling frequency of the accelerometer: 52 Hz --- Accelerom…

classification, clustering, sequential, time-series, univariate

Others

Others

85 Datasets

Datasets

YearPredictionMSD

Record Linkage Comparison Pat…

Vertebral Column

QtyT40I10D100K

Legal Case Reports

QSAR biodegradation

Turkiye Student Evaluation

seismic-bumps

User Identification From Walk…

Activity Recognition from Sin…