王平心,刘强,杨习贝,米据生.基于动态邻域的三支聚类分析[J].计算机科学,2018,45(1):62-66, 89
基于动态邻域的三支聚类分析
Three-way Clustering Analysis Based on Dynamic Neighborhood
投稿时间:2017-03-03  修订日期:2017-05-07
DOI:10.11896/j.issn.1002-137X.2018.01.009
中文关键词:  三支聚类,邻域,K-means 聚类,谱聚类
英文关键词:Three-way clustering,Neighborhood,K-means clustering,Spectral clustering
基金项目:本文受国家自然科学基金资助
作者单位E-mail
王平心 江苏科技大学理学院 江苏 镇江212003
河北师范大学数学与信息科学学院 石家庄050024 
pingxin_wang@hotmail.com 
刘强 江苏科技大学计算机科学学院 江苏 镇江212003 qliu05@sina.com 
杨习贝 江苏科技大学计算机科学学院 江苏 镇江212003 zhenjiangyangxibei@163.com 
米据生 河北师范大学数学与信息科学学院 石家庄050024 mijsh@263.net 
摘要点击次数: 273
全文下载次数: 183
中文摘要:
      目前,大多数聚类方法是二支聚类,即对象要么属于一个类,要么不属于一个类,聚类的结果必须具有清晰的边界。然而,将某些不确定的对象强制分配到某个类中将降低聚类结果的结构和精度。三支聚类是一种重叠聚类,它采用核心域和边界域来表示每个类别,较好地处理了具有不确定性对象的聚类问题。提出了一种使用样本邻域将二支聚类转化为三支聚类的方法。该方法利用二支聚类的结果和每个类中元素的邻域是否完全包含在该类中来对集合进行收缩,同时利用不在该类中的元素的邻域是否与该类有交集来进行扩张。收缩的区域称为核心域,扩张域和核心域的差集称为边界域。在UCI数据集上的实验结果显示,该方法在提高聚类结果的结构和F1值方面有较好的效果。
英文摘要:
      Most of the existing clustering methods are two-way clustering,which are based on the assumption that a cluster must be represented by a set with crisp boundary.However,assigning uncertain points into a cluster will reduce the accuracy of the method.Three-way clustering is an overlapping clustering which describes each cluster by core region and fringe region.This paper presented a strategy for converting a two-way cluster to three-way cluster using the neighborhood of the samples.In the proposed method,a two-way cluster is shrunk according to whether the neighborhood of sample are contained in this cluster and it is stretched according to whether the neighborhood of sample intersects with this cluster.The shrunk result is called core region and the difference between the shrunk result and stretched result is regarded as the fringe region.Experiment using the proposed method on UCI data sets shows that this strategy is effective in improving the structure and F1 values of clustering results.
查看全文  查看/发表评论  下载PDF阅读器