Cluster generators: synthetic data for the evaluation of clustering algorithms

J. Handl and J. Knowles


2-dimensional data set with 40 clusters 100-dimensional data set with 10 clusters

Downloads:
Description of the two generators (.pdf file)
160 sample data sets. These data sets are in space separated row-column format, with the last colum containg the class label.
Ellipsoid generator (C source code)
Gaussian generator (C++ source code)
Hand-crafted data sets