Python 3 Text Processing with NLTK 3 Cookbook
上QQ阅读APP看书,第一时间看更新

Chapter 3. Creating Custom Corpora

In this chapter, we will cover the following recipes:

  • Setting up a custom corpus
  • Creating a wordlist corpus
  • Creating a part-of-speech tagged word corpus
  • Creating a chunked phrase corpus
  • Creating a categorized text corpus
  • Creating a categorized chunk corpus reader
  • Lazy corpus loading
  • Creating a custom corpus view
  • Creating a MongoDB-backed corpus reader
  • Corpus editing with file locking