I haven't found the the right way to set the language for POS Tagging and Lemmatizer in different languages yet. I have seen that in NLTK there is a possibility to choice the the right language for sentences tokenization like this: I need to do two main operations: POS Tagging and lemmatization. I would like to develop an app that analyzes reviews made by travelers and so I have to manage a lot of texts written in different languages. Recently I approached to the NLP and I tried to use NLTK and TextBlob for analyzing texts. This is just like ready to made corpus for you. In the same way, you can use this corpus and mold it to work some dynamic functionality. With the help of Wordnet, you can create your corpus for spelling checking, language translation, Spam detection and many more. It can be used in the area of artificial intelligence for text analysis. Going deeper in wordnet, it is divided into four total subnets such as In short or nutshell one can treat it as Dictionary or Thesaurus. It also holds information on the results of the related word. It is used to find the similarities between any two words. WordNet also provides information on co-ordinate terms, derivates, senses and more.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. Archives
January 2023
Categories |