O - Object type@Reference(authors="A. McCallum, K. Nigam, L. H. Ungar", title="Efficient Clustering of High Dimensional Data Sets with Application to Reference Matching", booktitle="Proc. 6th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining", url="https://doi.org/10.1145/347090.347123", bibkey="DBLP:conf/kdd/McCallumNU00") public class CanopyPreClustering<O> extends AbstractDistanceBasedAlgorithm<O,Clustering<PrototypeModel<O>>> implements ClusteringAlgorithm<Clustering<PrototypeModel<O>>>
Reference:
 A. McCallum, K. Nigam, L. H. Ungar
 Efficient Clustering of High Dimensional Data Sets with Application to
 Reference Matching
 Proc. 6th ACM SIGKDD Int. Conf. on Knowledge Discovery and Data Mining
| Modifier and Type | Class and Description | 
|---|---|
static class  | 
CanopyPreClustering.Parameterizer<O>
Parameterization class 
 | 
| Modifier and Type | Field and Description | 
|---|---|
private static Logging | 
LOG
Class logger. 
 | 
private double | 
t1
Threshold for inclusion 
 | 
private double | 
t2
Threshold for removal 
 | 
ALGORITHM_IDDISTANCE_FUNCTION_ID| Constructor and Description | 
|---|
CanopyPreClustering(DistanceFunction<? super O> distanceFunction,
                   double t1,
                   double t2)
Constructor. 
 | 
| Modifier and Type | Method and Description | 
|---|---|
TypeInformation[] | 
getInputTypeRestriction()
Get the input type restriction used for negotiating the data query. 
 | 
protected Logging | 
getLogger()
Get the (STATIC) logger for this class. 
 | 
Clustering<PrototypeModel<O>> | 
run(Database database,
   Relation<O> relation)
Run the algorithm 
 | 
getDistanceFunctionrunclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitrunprivate static final Logging LOG
private double t1
private double t2
public CanopyPreClustering(DistanceFunction<? super O> distanceFunction, double t1, double t2)
distanceFunction - Distance functiont1 - Inclusion thresholdt2 - Exclusion thresholdpublic Clustering<PrototypeModel<O>> run(Database database, Relation<O> relation)
database - Databaserelation - Relation to processpublic TypeInformation[] getInputTypeRestriction()
AbstractAlgorithmgetInputTypeRestriction in interface AlgorithmgetInputTypeRestriction in class AbstractAlgorithm<Clustering<PrototypeModel<O>>>protected Logging getLogger()
AbstractAlgorithmgetLogger in class AbstractAlgorithm<Clustering<PrototypeModel<O>>>Copyright © 2019 ELKI Development Team. License information.