ML with text data – from language to features