Ivory

A Hadoop toolkit for web-scale information retrieval research

Ivory is a research toolkit for exploring large-scale indexing and retrieval algorithms.

If you're looking for something that "just works", then Ivory probably isn't for you. Try Lucene instead. On the other hand, if you're interested in playing with (half-baked) implementations of state-of-the-art retrieval algorithms, this may be the system for you! Ivory includes features presented in many academic research papers:

See our publications page for a full list of publications.