Creating a categorized text corpus