Ivory

A Hadoop toolkit for web-scale information retrieval research

The following are papers about Ivory or describing algorithms implemented in Ivory:

Exploiting Representations from Statistical Machine Translation for Cross-Language Information Retrieval.
Ferhan Ture and Jimmy Lin.
ACM Transactions on Information Systems, 32(4), Article 19, 2014.

Flat vs. Hierarchical Phrase-Based Translation Models for Cross-Language Information Retrieval.
Ferhan Ture and Jimmy Lin.
Proceedings of the 36th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2013), pages 813-816, July 2013, Dublin, Ireland.

Document Vector Representations for Feature Extraction in Multi-Stage Document Ranking.
Nima Asadi and Jimmy Lin.
Information Retrieval, 16(6):747-768, 2013.

Combining Statistical Translation Techniques for Cross-Language Information Retrieval.
Ferhan Ture, Jimmy Lin, and Douglas W. Oard.
Proceedings of the 24th International Conference on Computational Linguistics (COLING 2012), pages 2685-2702, December 2012, Mumbai, India.

Fast Candidate Generation for Two-Phase Document Ranking: Postings List Intersection with Bloom Filters.
Nima Asadi and Jimmy Lin
Proceedings of 21th International Conference on Information and Knowledge Management (CIKM 2012), pages 2419-2422, October 2012, Maui, Hawaii.

Looking Inside the Box: Context-Sensitive Translation for Cross-Language Information Retrieval
Ferhan Ture, Jimmy Lin, and Douglas W. Oard
Proceedings of the 35th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2012), pages 1105-1106
August 2012, Portland, Oregon

Why Not Grab a Free Lunch? Mining Large Corpora for Parallel Sentences to Improve Translation Modeling
Ferhan Ture and Jimmy Lin
Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL/HLT 2012), pages 626-630
June 2012, Montreal, Quebec, Canada

When Close Enough Is Good Enough: Approximate Positional Indexes for Efficient Ranked Retrieval
Tamer Elsayed, Jimmy Lin, and Don Metzler
Proceedings of 20th International Conference on Information and Knowledge Management (CIKM 2011), pages 1993-1996
October 2011, Glasgow, Scotland

No Free Lunch: Brute Force vs. Locality-Sensitive Hashing for Cross-lingual Pairwise Similarity
Ferhan Ture, Tamer Elsayed, and Jimmy Lin
Proceedings of the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2011), page 943-952
July 2011, Beijing, China

A Cascade Ranking Model for Efficient Ranked Retrieval
Lidan Wang, Jimmy Lin, and Donald Metzler
Proceedings of the 34th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2011), page 105-114
July 2011, Beijing, China

Ranking Under Temporal Constraints
Lidan Wang, Donald Metzler, and Jimmy Lin
Proceedings of 19th International Conference on Information and Knowledge Management (CIKM 2010), pages 79-88
October 2010, Toronto, Canada

Learning to Efficiently Rank
Lidan Wang, Jimmy Lin, and Donald Metzler
Proceedings of the 33rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2010), pages 138-145
July 2010, Geneva, Switzerland

Learning Concept Importance Using a Weighted Dependence Model
Michael Bendersky and Donald Metzler and W. Bruce Croft
Proceedings of the Third ACM International Conference on Web Search and Data Mining (WSDM 2010), pages 31-40
February 2010, New York, New York

Of Ivory and Smurfs: Loxodontan MapReduce Experiments for Web Search
Jimmy Lin, Donald Metzler, Tamer Elsayed, and Lidan Wang
Proceedings of the Eighteenth Text REtrieval Conference (TREC 2009)
November 2009, Gaithersburg, Maryland

Automatic Feature Selection in the Markov Random Field Model for Information Retrieval
Donald Metzler
Proceedings of the Sixteenth International Conference on Information and Knowledge Management (CIKM 2007), pages 253-262
November 2007, Lisbon, Portugal

Latent Concept Expansion Using Markov Random Fields
Donald Metzler and W. Bruce Croft
Proceedings of the 30th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2007), pages 311-318
July 2007, Amsterdam, The Netherlands

Linear Feature-Based Models for Information Retrieval
Donald Metzler and W. Bruce Croft
Information Retrieval, 10(3):257-274, 2007

A Markov Random Field Model for Term Dependencies
Donald Metzler and W. Bruce Croft
Proceedings of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2005), pages 472-479
August 2005, Salvador, Brazil