NLP220: Data Collection, Wrangling and Crowdsourcing for Natural Language Processing

This course covers a broad set of tools and core skills required for working with Natural Language Data. It covers methods for collecting, merging, cleaning, structuring and analyzing the properties of large and heterogeneous datasets of natural language, in order to address questions and support applications relying on those data. It covers both working with existing corpora as well as the challenges in collecting new corpora. Enrollment restricted to NLP graduate students.

5 Credits


While the information on this web site is usually the most up to date, in the event of a discrepancy please contact your adviser to confirm which information is correct.