Class CosineHashFunctionFamily

  • All Implemented Interfaces:
    LocalitySensitiveHashFunctionFamily<NumberVector>

    @Reference(authors="M. S. Charikar",title="Similarity estimation techniques from rounding algorithms",booktitle="Proc. 34th ACM Symposium on Theory of Computing, STOC\'02",url="https://doi.org/10.1145/509907.509965",bibkey="DBLP:conf/stoc/Charikar02") @Reference(authors="M. Henzinger",title="Finding near-duplicate web pages: a large-scale evaluation of algorithms",booktitle="Proc. 29th ACM Conf. Research and Development in Information Retrieval (SIGIR 2006)",url="https://doi.org/10.1145/1148170.1148222",bibkey="DBLP:conf/sigir/Henzinger06")
    public class CosineHashFunctionFamily
    extends java.lang.Object
    implements LocalitySensitiveHashFunctionFamily<NumberVector>
    Hash function family to use with Cosine distance, using simplified hash functions where the projection is only drawn from +-1, instead of Gaussian distributions.

    References:

    M. S. Charikar
    Similarity estimation techniques from rounding algorithms
    Proc. 34th ACM Symposium on Theory of Computing, STOC'02

    M. Henzinger
    Finding near-duplicate web pages: a large-scale evaluation of algorithms
    Proc. 29th ACM Conf. Research and Development in Information Retrieval (SIGIR 2006)

    Since:
    0.7.0
    Author:
    Evgeniy Faerman