Cloud9

A Hadoop toolkit for working with big data

Getting Started

Text Collections

Exercises

Reference Implementations

Cloud9 provides reference implementations of many design patterns and algorithms introduced in the book Data-Intensive Text Processing with MapReduce by Lin and Dyer. Some of these examples are also solutions to exercises included with the library, which have been previously used in MapReduce courses at the University of Maryland.