Package elki.clustering.silhouette
Class FastMSC<O>
- java.lang.Object
-
- elki.clustering.kmedoids.PAM<O>
-
- elki.clustering.silhouette.PAMSIL<O>
-
- elki.clustering.silhouette.PAMMEDSIL<O>
-
- elki.clustering.silhouette.FastMSC<O>
-
- Type Parameters:
O-
- All Implemented Interfaces:
Algorithm,ClusteringAlgorithm<Clustering<MedoidModel>>,KMedoidsClustering<O>
- Direct Known Subclasses:
FasterMSC
@Reference(authors="Lars Lenssen and Erich Schubert", title="Clustering by Direct Optimization of the Medoid Silhouette", booktitle="Int. Conf. on Similarity Search and Applications, SISAP 2022", url="https://doi.org/10.1007/978-3-031-17849-8_15", bibkey="DBLP:conf/sisap/LenssenS22") public class FastMSC<O> extends PAMMEDSIL<O>
Fast Medoid Silhouette Clustering.This clustering algorithm tries to find an optimal silhouette clustering for an approximation to the silhouette called "medoid silhouette" using a swap-based heuristic similar to PAM. By also caching the distance to the third nearest center (compare to FastPAM, which only used the second nearest), we are able to reduce the runtime per iteration to just O(n²), which yields an acceptable run time for many use cases, while often finding a solution with better silhouette than other clustering methods.
Reference:
Lars Lenssen and Erich Schubert
Clustering by Direct Optimization of the Medoid Silhouette
Int. Conf. on Similarity Search and Applications, SISAP 2022- Author:
- Erich Schubert
-
-
Nested Class Summary
Nested Classes Modifier and Type Class Description protected classFastMSC.InstanceFastMSC clustering instance for a particular data set.protected classFastMSC.Instance2Simplified FastMSC clustering instance for k=2.static classFastMSC.Par<O>Parameterization class.protected static classFastMSC.RecordData stored per point.-
Nested classes/interfaces inherited from interface elki.Algorithm
Algorithm.Utils
-
-
Constructor Summary
Constructors Constructor Description FastMSC(Distance<? super O> distance, int k, int maxiter, KMedoidsInitialization<O> initializer)Constructor.
-
Method Summary
All Methods Static Methods Instance Methods Concrete Methods Modifier and Type Method Description protected LogginggetLogger()Get the static class logger.protected static doubleloss(double a, double b)Loss function used - here simply a/b, 0 if a=b=0.Clustering<MedoidModel>run(Relation<O> relation, int k, DistanceQuery<? super O> distQ)Run k-medoids clustering with a given distance query.
Not a very elegant API, but needed for some types of nested k-medoids.-
Methods inherited from class elki.clustering.kmedoids.PAM
getInputTypeRestriction, initialMedoids, run, wrapResult
-
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
Methods inherited from interface elki.clustering.ClusteringAlgorithm
autorun
-
-
-
-
Field Detail
-
LOG
private static final Logging LOG
The logger for this class.
-
-
Constructor Detail
-
FastMSC
public FastMSC(Distance<? super O> distance, int k, int maxiter, KMedoidsInitialization<O> initializer)
Constructor.- Parameters:
distance- Distance functionk- Number of clustermaxiter- Maximum number of iterationsinitializer- Initialization
-
-
Method Detail
-
run
public Clustering<MedoidModel> run(Relation<O> relation, int k, DistanceQuery<? super O> distQ)
Description copied from interface:KMedoidsClusteringRun k-medoids clustering with a given distance query.
Not a very elegant API, but needed for some types of nested k-medoids.
-
loss
protected static final double loss(double a, double b)Loss function used - here simply a/b, 0 if a=b=0.- Parameters:
a- distance to nearestb- distance to second- Returns:
- loss, a/b or 0.
-
-