Class | Description |
---|---|
ComputeSignaturesMinhash |
A Hadoop task to compute signatures from document vectors.
|
ComputeSignaturesMinhash.MyMapper |
Signatures are created in a sequence of MyMapper calls.
|
ComputeSignaturesRandom |
A Hadoop task to compute signatures from document vectors.
|
ComputeSignaturesRandom.MyMapper |
Convert int doc vectors into NBitSignature objects using LSH.
|
ComputeSignaturesSimhash |
A Hadoop task to compute signatures from document vectors.
|
ComputeSignaturesSimhash.MyMapper |
Simhash implementation, as explained in Manku et al's Detecting near-duplicates for web
crawling (WWW07)
|
GeneralHashFunctionLibrary | |
WriteRandomVectors | |
WriteRandomVectors.MyMapper0 | |
WriteRandomVectors.MyReducer0 |