Stemming - Natural Language Processing With Python and NLTK p.3

Channel:
Subscribers:
1,410,000
Published on ● Video Link: https://www.youtube.com/watch?v=yGKTphqxR9Q



Duration: 8:16
193,489 views
2,003


Another form of data pre-processing with natural language processing is called "stemming."

This is the process where we remove word affixes from the end of words.

The reason we would do this is so that we do not need to store the meaning of every single tense of a word. For example:

Reader
Reading
Read

Aside from tense, and even one of these is a noun, they all have the same meaning for their "root" stem (read).

This way, we store one single value for the root stem of "read." Then, when we wish to learn more, we can look into the affixes that were on the end, like "ing" is an active word, or in the past, then you have reader as someone who reads... then just plain read as either past tense or current.

sample code: http://pythonprogramming.net
http://hkinsley.com
https://twitter.com/sentdex
http://sentdex.com
http://seaofbtc.com




Other Videos By sentdex


2015-05-14Naive Bayes - Natural Language Processing With Python and NLTK p.13
2015-05-13Words as Features for Learning - Natural Language Processing With Python and NLTK p.12
2015-05-12Text Classification - Natural Language Processing With Python and NLTK p.11
2015-05-11WordNet - Natural Language Processing With Python and NLTK p.10
2015-05-10NLTK Corpora - Natural Language Processing With Python and NLTK p.9
2015-05-07Named Entity Recognition - Natural Language Processing With Python and NLTK p.7
2015-05-07Lemmatizing - Natural Language Processing With Python and NLTK p.8
2015-05-06Chinking - Natural Language Processing With Python and NLTK p.6
2015-05-05Chunking - Natural Language Processing With Python and NLTK p.5
2015-05-04Part of Speech Tagging - Natural Language Processing With Python and NLTK p.4
2015-05-03Stemming - Natural Language Processing With Python and NLTK p.3
2015-05-02Stop Words - Natural Language Processing With Python and NLTK p.2
2015-05-01Natural Language Processing With Python and NLTK p.1 Tokenizing words and Sentences
2015-04-06Flask Tutorial Web Development with Python 23 - More CMS
2015-04-06Flask Tutorial Web Development with Python 24 - Crontab / Cron jobs
2015-04-06Flask Tutorial Web Development with Python 25 - Conclusion
2015-04-06Flask Tutorial Web Development with Python 22 - Content Management System
2015-04-02Flask Tutorial Web Development with Python 21 - Dynamic user Content
2015-04-02Flask Tutorial Web Development with Python 20 - Login Required Decorator Wrapper
2015-04-02Flask Tutorial Web Development with Python 19 - user login system
2015-04-02Flask Tutorial Web Development with Python 18 - Password Hashing with Passlib



Tags:
Natural Language Toolkit (Software)
Natural Language Processing (Software Genre)
Python (Programming Language)
Stemming
Outline Of Natural Language Processing
porter stemmer
PorterStemmer
nltk.stem
NLTK