Edit Distance and Jaccard Distance Calculation with NLTK

Edit Distance

Edit Distance (a.k.a. Levenshtein Distance) is a measure of similarity between two strings referred to as the source string and the target string.

The distance between the source string and the target string is the minimum number of edit operations (deletions, insertions, or substitutions) required to transform the source into the target. The lower the distance, the more similar the two strings. 

Continue reading “Edit Distance and Jaccard Distance Calculation with NLTK”

Chatbot Building for Facebook Messenger

In this chatbot tutorial, you will learn the basic concepts behind building a Chatbot. By the end of this tutorial, you will be able to create a simple Facebook chatbot bot.

Continue reading “Chatbot Building for Facebook Messenger”

Python Programmer, Computer Scientist, Researcher at Federal University of Maranhão, Brazil, and member of the Intelligent Distributed Systems Laboratory, who loves sharing what he knows. Among the topics Ivan is interested in are: Machine Learning (ML), Reinforcement Learning (RL), Game Theory (GT), Natural Language Processing (NLP), Computer Vision (CV), Time Series (TS), and other Artificial Intelligence (AI) related topics.

Text Classification with Pandas & Scikit

In this tutorial, we introduce one of most common NLP and Text Mining tasks, that of Document Classification. Note that while being common, it is far from useless, as the problem of classifying content is a constant hurdle we humans face every day. It is important to know basic elements of this problem since many of them can be used for other tasks such as sentiment analysis.

Continue reading “Text Classification with Pandas & Scikit”

Python Programmer, Computer Scientist, Researcher at Federal University of Maranhão, Brazil, and member of the Intelligent Distributed Systems Laboratory, who loves sharing what he knows. Among the topics Ivan is interested in are: Machine Learning (ML), Reinforcement Learning (RL), Game Theory (GT), Natural Language Processing (NLP), Computer Vision (CV), Time Series (TS), and other Artificial Intelligence (AI) related topics.