Improving document clustering using Okapi BM25 feature weighting

您所在的位置:网站首页 税务计算器在线转换 Improving document clustering using Okapi BM25 feature weighting

Improving document clustering using Okapi BM25 feature weighting

2023-03-13 17:40| 来源: 网络整理| 查看: 265

来自 Springer  喜欢 0

阅读量:

50

作者:

John S. Whissell,Charles L. A. Clarke

展开

摘要:

weighting is heavily dependent on both the dataset being clustered and the algorithm used. In addition, binary weighting is shown to be consistently inferior to both weighting. We investigate clustering using both BM25 term saturation in isolation and BM25 term saturation with , confirming that both are superior to their non-BM25 counterparts under several common clustering quality measures. Finally, we investigate estimation of the 1 BM25 parameter when clustering. Our results indicate that typical values of 1 from other IR tasks are not appropriate for clustering; 1 needs to be higher.

展开

关键词:

document clustering feature weighting okapi BM25

DOI:

10.1007/s10791-011-9163-y

被引量:

29

年份:

2011



【本文地址】


今日新闻


推荐新闻


CopyRight 2018-2019 办公设备维修网 版权所有 豫ICP备15022753号-3