Reuters Transcribed Subset

Data Characteristics: -------------------- This data was created by selecting 20 files each from the 10 largest classes in the Reuters-21578 collection...

text, classification

Blood Transfusion Service C...

To demonstrate the RFMTC marketing model (a modified version of RFM), this study adopted the donor database of Blood Transfusion Service Center in Hsin-...

multivariate, classification

Wine Quality

The two datasets are related to red and white variants of the Portuguese "Vinho Verde" wine. For more details, consult: [Web Link] or the reference [Cor...

regression, multivariate, classification

Amazon Access Samples

This is a sparse data set, less than 10% of the attributes are used for each sample. The link is to a '*.tgz' file which contains two files: [amzn-anon-...

clustering, causal-discovery, regression, time-series, domain-theory

Farm Ads

This data was collected from text ads found on twelve websites that deal with various farm animal related topics. Information from the ad creative and ...

text, classification

Bank Marketing

The data is related with direct marketing campaigns of a Portuguese banking institution. The marketing campaigns were based on phone calls. Often, more ...

multivariate, classification


This is a data set containing 1080 documents of free text business descriptions of Brazilian companies categorized into a subset of 9 categories catalog...

multivariate, text, classification


Data is collected from and Data is organized with regard to working days in Istanbul Stock Exchange.

univariate, classification, regression, time-series, multivariate

Wholesale customers

multivariate, classification, clustering

Dow Jones Index

In predicting stock prices you collect data over some period of time - day, week, month, etc. But you cannot take advantage of data from a time period u...

time-series, classification, clustering