查看: 4327|回复: 5

ant cluster tool box

[复制链接]

字体大小: 正常放大

szfjnu

16 主题	10 听众	1527 积分

升级 52.7%

TA的每日心情

	无聊 2023-3-3 16:34

签到天数: 101 天

[LV.6]常住居民II

电梯直达

1^#

发表于 2010-12-4 21:58 |只看该作者 |倒序浏览

|招呼Ta 关注Ta

This is a Matlab toolbox for investigating the application of cluster ensembles to data classification, with the objective of improving the accuracy and/or speed of clustering. The toolbox divides the cluster ensemble problem into four areas, providing functionality for each. These include, (1) synthetic data generation, (2) clustering to generate individual data partitions and similarity matrices, (3) consensus function generation and final clustering to generate ensemble data partitioning, and (4) implementation of accuracy metrics.
With regard to data generation, Gaussian data of arbitrary dimension can be generated. The kcenters algorithm can then be used to generate individual data partitions by either, (a) subsampling the data and clustering each subsample, or by (b) randomly initializing the algorithm and generating a clustering for each initialization. In either case an overall similarity matrix can be computed using a consensus function operating on the individual similarity matrices. A final clustering can be performed and performance metrics are provided for evaluation purposes.

mce.zip

7.44 KB, 下载次数: 1, 下载积分: 体力 -2 点

zan

转播0 淘帖0 分享0 收藏0 支持1 反对0 微信

使用道具举报

szfjnu

16 主题	10 听众	1527 积分

升级 52.7%

TA的每日心情

	无聊 2023-3-3 16:34

签到天数: 101 天

[LV.6]常住居民II

2^#

发表于 2010-12-4 21:58 |只看该作者 |招呼Ta 关注Ta

This is a Matlab toolbox for investigating the application of cluster ensembles to data classification, with the objective of improving the accuracy and/or speed of clustering. The toolbox divides the cluster ensemble problem into four areas, providing functionality for each. These include, (1) synthetic data generation, (2) clustering to generate individual data partitions and similarity matrices, (3) consensus function generation and final clustering to generate ensemble data partitioning, and (4) implementation of accuracy metrics.

With regard to data generation, Gaussian data of arbitrary dimension can be generated. The kcenters algorithm can then be used to generate individual data partitions by either, (a) subsampling the data and clustering each subsample, or by (b) randomly initializing the algorithm and generating a clustering for each initialization. In either case an overall similarity matrix can be computed using a consensus function operating on the individual similarity matrices. A final clustering can be performed and performance metrics are provided for evaluation purposes.