University of Maryland, College Park


CMSC 723: Computational Linguistics I (Fall 2009): Software

This course will be taught with Python and the Natural Language Toolkit (NLTK). You should install both on your machine at your earliest convenience. As of September 8, 2009, the latest version of Python is 2.6.2, and the latest version of NLTK is 2.0b5. As a note, the NLTK download page seems to only have 2.0b4 for Windows; you can get 2.0b5 from this alternate source. You should install Numpy and Matplotlib (needed for the third assignment), also available from the NLTK download page.

To find out what version of Python you have, run the command 'python -V' in a shell. To find out what version of NLTK you have, in Python, type 'import nltk' followed by 'nltk.__version__'. NLTK 2.0b5 fixes some warnings from 2.0b4. If you've installed 2.0b4 already on Windows, a reinstall of the newer NLTK version is how you "upgrade".

If your preferred development environment is Eclipse, you might be interested in Pydev, a Python plugin for Eclipse.

For the second assignment, you'll need to install nltk_contrib, which has the module for finite-state transducers. Follow these instructions:

  1. Download http://nltk.googlecode.com/files/nltk_contrib-2.0b5.zip.
  2. Unzip. Change to that directory.
  3. Run 'python setup.py install' at the command line or 'sudo python setup.py install' if you're on Linux/Mac

Back to main page


This page, first created: 14 Sep 2009; last updated: Valid XHTML 1.0! Valid CSS!