Stemming - Natural Language Processing With Python and NLTK p.3

Channel:

sentdex

Subscribers:

1,410,000

Published on May 3, 2015 9:52:17 PM ● Video Link: https://www.youtube.com/watch?v=yGKTphqxR9Q

Duration: 8:16

193,489 views

2,003

Another form of data pre-processing with natural language processing is called "stemming."

This is the process where we remove word affixes from the end of words.

The reason we would do this is so that we do not need to store the meaning of every single tense of a word. For example:

Reader
Reading
Read

Aside from tense, and even one of these is a noun, they all have the same meaning for their "root" stem (read).

This way, we store one single value for the root stem of "read." Then, when we wish to learn more, we can look into the affixes that were on the end, like "ing" is an active word, or in the past, then you have reader as someone who reads... then just plain read as either past tense or current.

sample code: http://pythonprogramming.net
http://hkinsley.com
https://twitter.com/sentdex
http://sentdex.com
http://seaofbtc.com

Other Videos By sentdex

2015-05-14	Naive Bayes - Natural Language Processing With Python and NLTK p.13
2015-05-13	Words as Features for Learning - Natural Language Processing With Python and NLTK p.12
2015-05-12	Text Classification - Natural Language Processing With Python and NLTK p.11
2015-05-11	WordNet - Natural Language Processing With Python and NLTK p.10
2015-05-10	NLTK Corpora - Natural Language Processing With Python and NLTK p.9
2015-05-07	Named Entity Recognition - Natural Language Processing With Python and NLTK p.7
2015-05-07	Lemmatizing - Natural Language Processing With Python and NLTK p.8
2015-05-06	Chinking - Natural Language Processing With Python and NLTK p.6
2015-05-05	Chunking - Natural Language Processing With Python and NLTK p.5
2015-05-04	Part of Speech Tagging - Natural Language Processing With Python and NLTK p.4
2015-05-03	Stemming - Natural Language Processing With Python and NLTK p.3
2015-05-02	Stop Words - Natural Language Processing With Python and NLTK p.2
2015-05-01	Natural Language Processing With Python and NLTK p.1 Tokenizing words and Sentences
2015-04-06	Flask Tutorial Web Development with Python 23 - More CMS
2015-04-06	Flask Tutorial Web Development with Python 24 - Crontab / Cron jobs
2015-04-06	Flask Tutorial Web Development with Python 25 - Conclusion
2015-04-06	Flask Tutorial Web Development with Python 22 - Content Management System
2015-04-02	Flask Tutorial Web Development with Python 21 - Dynamic user Content
2015-04-02	Flask Tutorial Web Development with Python 20 - Login Required Decorator Wrapper
2015-04-02	Flask Tutorial Web Development with Python 19 - user login system
2015-04-02	Flask Tutorial Web Development with Python 18 - Password Hashing with Passlib

Tags:

Natural Language Toolkit (Software)

Natural Language Processing (Software Genre)

Python (Programming Language)

Stemming

Outline Of Natural Language Processing

porter stemmer

PorterStemmer

nltk.stem

NLTK

Channel	Latest
Mahaa News USA	6 hours ago
xStill_Infamous	6 hours ago
Lucas hss	6 hours ago
David Burgess Mechanical Music, MUGEN, And More	6 hours ago
CranKy Ducklings	6 hours ago
Nerdcultuur	7 hours ago
WhyJordie	7 hours ago
TheNino	7 hours ago
Factykillian	7 hours ago
dougbarrett619part2	7 hours ago
Mr. Robert	7 hours ago
TNT Sports Chile	7 hours ago
Jove	7 hours ago
Devvcario	7 hours ago
Sukru Kagnici	7 hours ago
FilmIsNow Movies	7 hours ago
Haki	7 hours ago
Wumbotize	7 hours ago
Ryuyin Ovi	7 hours ago
World of Longplays	7 hours ago
Inspiration House Network	7 hours ago
Claaaaash	7 hours ago
Pepe	8 hours ago
hmmmp	8 hours ago
Retro & Zeivu Gaming	8 hours ago