Description

The file animals.c is a data generator of structured instances representing quadruped animals as used by Gennari, Langley, and Fisher (1989) to evaluate the CLASSIT unsupervised learning algorithm. Instances have 8 components: neck, four legs, torso, head, and tail. Each component is represented as a simplified/generalized cylinder (i.e., inspired by David Marr's work in "Vision: A Computational Investigation Into the Human Representation and Processing of Visual Information", published by Freeman in 1982). Each cylinder is itself described by 9 attributes: location x 3, axis x 3, height, radius, and texture. This code generates instances in one of four classes: dogs, cats, horses, and giraffes. The program generates instances by selecting a class according to a distribution determined by function rand4(). Each class has a prototype; the prototype of the selected class is perturbed according to a distribution described in the code for the four classes (i.e., parameterized means with Guassian distributions are used to represent prototypes and perturbation distributions, where the means are used to distinguish the four classes). From John Gennari: (1990) The only notes I have about it is that I don't use the data format it creates any more. To change this, modify "printpart()". Also, it uses a very rough approximation for a bell-shaped distribution. Currently, I use a much more sophisticated random number generator. To fix this, just replace "bellrand()" with a real bell shaped distribution.