public interface DfTable
Interface to object that keeps track of the document frequency of each term in the collection. Concrete classes may vary in terms of implementation, e.g., hashes (faster lookup, but less memory efficient) or arrays (slower binary search lookup, but more memory efficient).
Modifier and Type | Method and Description |
---|---|
int |
getCountOfTermWithDfOne()
Returns the number of terms that only appear in one document.
|
int |
getDf(String term)
Returns the document frequency of a term.
|
int |
getDocumentCount()
Returns the number of documents in the collection.
|
int |
getMaxDf()
Returns the document frequency of the term with the highest document
frequency.
|
String |
getMaxDfTerm()
Returns the term with the highest document frequency.
|
int |
getVocabularySize()
Returns the number of unique terms in the collection.
|
int getCountOfTermWithDfOne()
int getDf(String term)
int getDocumentCount()
int getMaxDf()
String getMaxDfTerm()
int getVocabularySize()