BanditPAM-BanditPAM:一种最先进的高性能k-medoids算法。-James Mayclin and Eric Frankel, Mo Tiwari
发布:2020-12-22 19:40:35.469782
作者:James Mayclin and Eric Frankel, Mo Tiwari
### 作者邮箱:ericsf@stanford.edu
### 首页:https://github.com/ThrunGroup/BanditPAM
### 文档:None
### 下载链接
BanditPAM: A state-of-the-art, high-performance k-medoids algorithm
Introduction
Clustering algorithms such as k-means are ubiquitous in modern data science applications. Despite its popularity, however, k-means has several drawbacks. K-means only supports certain distance metrics, and its cluster centers can lack interpretability. Alternatively, k-medoids uses the points in the dataset itself -- the medoids -- as cluster centers. This enables interpretability of the cluster centers. Furthermore, k-medoids supports arbitrary dissimilarity functions in place of a distance metric.
Details
In this package, we provide a high-performance implementation of BanditPAM , a state-of-the-art k-medoids algorithm. BanditPAM matches state-of-the-art in clustering quality but improves the runtime of previous approaches from O(n^2) to O(nlogn) in each iteration. Furthermore, BanditPAM supports arbitrarily dissimilarity functions between points -- these functions need not even be metrics. This implementation is written in C++ for performance, but is callable from Python via Python bindings. For further details, please see the paper at https://arxiv.org/abs/2006.06856 for the full paper and the code at https://github.com/ThrunGroup/BanditPAM/blob/main/README.md
If you use this software, please cite: Mo Tiwari, Martin Jinye Zhang, James Mayclin, Sebastian Thrun, Chris Piech, Ilan Shomorony. "Bandit-PAM: Almost Linear Time k-medoids Clustering via Multi-Armed Bandits" Advances in Neural Information Processing Systems (NeurIPS) 2020.
Copy from pypi.org
查询时间:5.704ms
渲染时间:5.827ms
本文同步分享在 博客“zhenruyan”(other)。
如有侵权,请联系 support@oschina.cn 删除。
本文参与“OSC源创计划”,欢迎正在阅读的你也加入,一起分享。