Python 3 Programming Tutorial - Parsing Websites with re and urllib

Channel:

Subscribers:

1,410,000

Published on July 22, 2014 3:00:24 AM ● Video Link: https://www.youtube.com/watch?v=GEshegZzt3M

Category:

Tutorial

Duration: 7:29

196,377 views

1,746

In this video, we use two of Python 3's standard library modules, re and urllib, to parse paragraph data from a website. As we saw, initially, when you use Python 3 and urllib to parse a website, you get all of the HTML data, like using "view source" on a web page. This HTML data is great if you are viewing via a browser, but is incredibly messy if you are viewing the raw source. For this reason, we need to build something that can sift through the mess and just pull the article data that we are interested in.

Sample code for this basics series: http://pythonprogramming.net/beginner-python-programming-tutorials/

Python 3 Programming tutorial Playlist: http://www.youtube.com/watch?v=oVp1vrfL_w4&feature=share&list=PLQVvvaa0QuDe8XSftW-RAxdo6OmaeL85M

http://seaofbtc.com
http://sentdex.com
http://hkinsley.com
https://twitter.com/sentdex

Bitcoin donations: 1GV7srgR4NJx4vrk7avCmmVQQrqmv87ty6

Other Videos By sentdex

2014-07-29	Pandas with Python 2.7 Part 9 - Statistical Information
2014-07-29	Pandas with Python 2.7 Part 3 - Reading from and saving to CSV
2014-07-29	Pandas with Python 2.7 Part 6 - Data visualization with Matplotlib
2014-07-29	Pandas with Python 2.7 Part 1 - Downloading and dependencies
2014-07-29	Pandas with Python 2.7 Part 7 - 3D Matplotlib Graphs
2014-07-29	Python 3 Programming Tutorial - Tkinter adding images and text
2014-07-28	Python 3 Programming Tutorial - Tkinter menu bar
2014-07-26	Python 3 Programming Tutorial - Tkinter event handling
2014-07-24	Python 3 Programming Tutorial - Tkinter adding buttons
2014-07-22	Python 3 Programming Tutorial - tkinter module making windows
2014-07-21	Python 3 Programming Tutorial - Parsing Websites with re and urllib
2014-07-20	Python 3 Programming Tutorial - Regular Expressions / Regex with re
2014-07-19	Python 3 Programming Tutorial - urllib module
2014-07-18	Python 3 Programming Tutorial - Sys Module
2014-07-17	Python 3 Programming Tutorial - OS Module
2014-07-16	Python 3 Programming Tutorial - Built-in Functions
2014-07-13	Python 3 Programming Tutorial - Dictionaries
2014-07-13	Python 3 Programming Tutorial - Multi-line Print
2014-07-12	Python 3 Programming Tutorial - Try and Except error Handling
2014-07-11	Python 3 Programming Tutorial - Reading from a CSV spreadsheet
2014-07-10	Python 3 Programming Tutorial - Multi-dimensional List

Tags:

Python (Programming Language)

Regular Expression

Website Parse Template

scrape

spider

parse

urllib

regex

grab

article

paragraph

programming

tutorial

basics

python 3

python 3.3

python 3.4

python 2 and 3

python 2.7

2.7

3.3

3.4

beginner

how-to

coding

easy