Life Sciences

100 Datasets

Datasets


Zoo

A simple database containing 17 Boolean-valued attributes. The "type" attribute appears to be the class attribute. Here is a breakdown of which animal...

multivariate, classification

E. Coli Genes

The data was collected from several sources, including GenProtEC ([Web Link]) and SWISSPROT ([Web Link]). Structure prediction was made by PROF ([Web Li...

relational

EEG Database

This data arises from a large study to examine EEG correlates of genetic predisposition to alcoholism. It contains measurements from 64 electrodes place...

time-series, multivariate

M. Tuberculosis Genes

The data was collected from several sources, including the Sanger Centre ([Web Link]) and SWISSPROT ([Web Link]). Structure prediction was made by PROF ...

relational

Dorothea

Drugs are typically small organic molecules that achieve their desired activity by binding to a target site on a receptor. The first step in the discove...

multivariate, classification

Statlog (Heart)

Cost Matrix _______ abse pres absence 0 1 presence 5 0 where the rows represent the true values and the columns the predicted.

multivariate, classification

Mammographic Mass

Mammography is the most effective method for breast cancer screening available today. However, the low positive predictive value of breast biopsy result...

multivariate, classification

Arcene

ARCENE was obtained by merging three mass-spectrometry datasets to obtain enough training and test data for a benchmark. The original features indicate ...

multivariate, classification

Abscisic Acid Signaling Net...

The objective is to determine the set of boolean rules that describe the interactions of the nodes within this plant signaling network. The dataset incl...

multivariate, causal-discovery