NLP preprocessing lemmatization

Any comments are welcome