Classifying/clustering the text