2 data files: -- horse-colic.data: 300 training instances -- horse-colic.test: 68 test instances Possible class attributes: 24 (whether le...
multivariate, classificationThis is perhaps the best known database to be found in the pattern recognition literature. Fisher's paper is a classic in the field and is referenced f...
multivariate, classificationThe first 5 variables are all blood tests which are thought to be sensitive to liver disorders that might arise from excessive alcohol consumption. Each...
multivariateThis data was used by Hong and Young to illustrate the power of the optimal discriminant plane even in ill-posed settings. Applying the KNN method in th...
multivariate, classificationThis is one of three domains provided by the Oncology Institute that has repeatedly appeared in the machine learning literature. (See also breast-cancer...
multivariate, classificationThis dataset has been developed to help evaluate a "hybrid" learning algorithm ("KBANN") that uses examples to inductively refine preexisting knowledge....
domain-theory, classification, sequentialThis is a data set used by Ning Qian and Terry Sejnowski in their study using a neural net to predict the secondary structure of certain globular protei...
classification, sequentialProblem Description: Splice junctions are points on a DNA sequence at which `superfluous' DNA is removed during the process of protein creation in hig...
domain-theory, classification, sequentialSeveral constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 ye...
multivariate, classification