Class TermFrequencyParser<V extends SparseNumberVector>

  • All Implemented Interfaces:
    BundleStreamSource, Parser, StreamingParser

    public class TermFrequencyParser<V extends SparseNumberVector>
    extends NumberVectorLabelParser<V>
    A parser to load term frequency data, which essentially are sparse vectors with text keys.

    Parse a file containing term frequencies. The expected format is:

     rowlabel1 term1 <freq> term2 <freq> ...
     rowlabel2 term1 <freq> term3 <freq> ...
    Terms must not contain the separator character!

    If your data does not contain frequencies, you can maybe use SimpleTransactionParser instead.

    Erich Schubert