Cluster generators: synthetic data for the evaluation
of clustering algorithms
Downloads:
Description of the two generators (.pdf file)
160 sample data sets. These data sets are in space separated row-column format, with the last colum containg the class label.
Ellipsoid generator (C source code)
Gaussian generator (C++ source code)
Hand-crafted data sets