Class SetMatchingPurity


  • @Reference(authors="M. Meil\u0103",
               title="Comparing clusterings",
               booktitle="University of Washington, Seattle, Technical Report 418",
               url="http://www.stat.washington.edu/mmp/Papers/compare-colt.pdf",
               bibkey="tr/washington/Meila02")
    public class SetMatchingPurity
    extends java.lang.Object
    Set matching purity measures.

    References:

    M. Meilă
    Comparing clusterings
    University of Washington, Seattle, Technical Report 418

    Y. Zhao, G. Karypis
    Criterion functions for document clustering: Experiments and analysis
    University of Minnesota, Dep. Computer Science, Technical Report 01-40

    M. Steinbach, G. Karypis, V. Kumar
    A Comparison of Document Clustering Techniques
    KDD workshop on text mining. Vol. 400. No. 1

    E. Amigó, J. Gonzalo, J. Artiles, and F. Verdejo
    A comparison of extrinsic clustering evaluation metrics based on formal constraints
    Inf. Retrieval 12(5)

    Since:
    0.5.0
    Author:
    Sascha Goldhofer
    • Field Summary

      Fields 
      Modifier and Type Field Description
      protected double smFFirst
      Result cache
      protected double smFSecond
      Result cache
      protected double smInversePurity
      Result cache
      protected double smPurity
      Result cache
    • Method Summary

      All Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      double f1Measure()
      Get the set matching F1-Measure
      double fMeasureFirst()
      Get the Van Rijsbergen’s F measure (asymmetric) for first clustering
      double fMeasureSecond()
      Get the Van Rijsbergen’s F measure (asymmetric) for second clustering
      double inversePurity()
      Get the set matchings inverse purity (second:first clustering) (normalized, 1 = equal)
      double purity()
      Get the set matchings purity (first:second clustering) (normalized, 1 = equal)
      • Methods inherited from class java.lang.Object

        clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
    • Field Detail

      • smPurity

        protected double smPurity
        Result cache
      • smInversePurity

        protected double smInversePurity
        Result cache
      • smFFirst

        protected double smFFirst
        Result cache
      • smFSecond

        protected double smFSecond
        Result cache
    • Constructor Detail

      • SetMatchingPurity

        protected SetMatchingPurity​(ClusterContingencyTable table)
        Constructor.
        Parameters:
        table - Contingency table
    • Method Detail

      • purity

        @Reference(authors="Y. Zhao, G. Karypis",
                   title="Criterion functions for document clustering: Experiments and analysis",
                   booktitle="University of Minnesota, Dep. Computer Science, Technical Report 01-40",
                   url="http://www-users.cs.umn.edu/~karypis/publications/Papers/PDF/vscluster.pdf",
                   bibkey="tr/umn/ZhaoK01")
        public double purity()
        Get the set matchings purity (first:second clustering) (normalized, 1 = equal)

        Y. Zhao, G. Karypis
        Criterion functions for document clustering: Experiments and analysis
        University of Minnesota, Dep. Computer Science, Technical Report 01-40

        Returns:
        purity
      • inversePurity

        public double inversePurity()
        Get the set matchings inverse purity (second:first clustering) (normalized, 1 = equal)
        Returns:
        Inverse purity
      • f1Measure

        @Reference(authors="M. Steinbach, G. Karypis, V. Kumar",
                   title="A Comparison of Document Clustering Techniques",
                   booktitle="KDD workshop on text mining. Vol. 400. No. 1",
                   url="http://glaros.dtc.umn.edu/gkhome/fetch/papers/docclusterKDDTMW00.pdf",
                   bibkey="conf/kdd/SteinbachKK00")
        public double f1Measure()
        Get the set matching F1-Measure

        M. Steinbach, G. Karypis, V. Kumar
        A Comparison of Document Clustering Techniques
        KDD workshop on text mining. Vol. 400. No. 1

        Returns:
        Set Matching F1-Measure
      • fMeasureFirst

        @Reference(authors="E. Amig\u00f3, J. Gonzalo, J. Artiles, F. Verdejo",
                   title="A comparison of extrinsic clustering evaluation metrics based on formal constraints",
                   booktitle="Information Retrieval 12(5)",
                   url="https://doi.org/10.1007/s10791-009-9106-z",
                   bibkey="DBLP:journals/ir/AmigoGAV09a")
        public double fMeasureFirst()
        Get the Van Rijsbergen’s F measure (asymmetric) for first clustering

        E. Amigó, J. Gonzalo, J. Artiles, and F. Verdejo
        A comparison of extrinsic clustering evaluation metrics based on formal constraints
        Information Retrieval 12(5)

        Returns:
        Set Matching F-Measure of first clustering
      • fMeasureSecond

        @Reference(authors="E. Amig\u00f3, J. Gonzalo, J. Artiles, F. Verdejo",
                   title="A comparison of extrinsic clustering evaluation metrics based on formal constraints",
                   booktitle="Information Retrieval 12(5)",
                   url="https://doi.org/10.1007/s10791-009-9106-z",
                   bibkey="DBLP:journals/ir/AmigoGAV09a")
        public double fMeasureSecond()
        Get the Van Rijsbergen’s F measure (asymmetric) for second clustering

        E. Amigó, J. Gonzalo, J. Artiles, and F. Verdejo
        A comparison of extrinsic clustering evaluation metrics based on formal constraints
        Information Retrieval 12(5)

        Returns:
        Set Matching F-Measure of second clustering