This is perhaps the best known database to be found in the pattern recognition literature. Fisher's paper is a classic in the field and is referenced fre…
classification, multivariateThe first 5 variables are all blood tests which are thought to be sensitive to liver disorders that might arise from excessive alcohol consumption. Each l…
multivariateThis data was used by Hong and Young to illustrate the power of the optimal discriminant plane even in ill-posed settings. Applying the KNN method in the …
classification, multivariateThis is one of three domains provided by the Oncology Institute that has repeatedly appeared in the machine learning literature. (See also breast-cancer a…
classification, multivariateThis dataset has been developed to help evaluate a "hybrid" learning algorithm ("KBANN") that uses examples to inductively refine preexisting knowledge. …
classification, domain-theory, sequentialThis is a data set used by Ning Qian and Terry Sejnowski in their study using a neural net to predict the secondary structure of certain globular proteins…
classification, sequentialProblem Description: Splice junctions are points on a DNA sequence at which `superfluous' DNA is removed during the process of protein creation in highe…
classification, domain-theory, sequentialSeveral constraints were placed on the selection of these instances from a larger database. In particular, all patients here are females at least 21 year…
classification, multivariateThe classification task of this database is to determine where patients in a postoperative recovery area should be sent to next. Because hypothermia is a…
classification, multivariateThis is one of three domains provided by the Oncology Institutenthat has repeatedly appeared in the machine learning literature. (See also breast-cancer …
classification, multivariate