Python 3 Programming Tutorial - Parsing Websites with re and urllib

Channel:
Subscribers:
1,410,000
Published on ● Video Link: https://www.youtube.com/watch?v=GEshegZzt3M



Category:
Tutorial
Duration: 7:29
196,377 views
1,746


In this video, we use two of Python 3's standard library modules, re and urllib, to parse paragraph data from a website. As we saw, initially, when you use Python 3 and urllib to parse a website, you get all of the HTML data, like using "view source" on a web page. This HTML data is great if you are viewing via a browser, but is incredibly messy if you are viewing the raw source. For this reason, we need to build something that can sift through the mess and just pull the article data that we are interested in.

Sample code for this basics series: http://pythonprogramming.net/beginner-python-programming-tutorials/

Python 3 Programming tutorial Playlist: http://www.youtube.com/watch?v=oVp1vrfL_w4&feature=share&list=PLQVvvaa0QuDe8XSftW-RAxdo6OmaeL85M

http://seaofbtc.com
http://sentdex.com
http://hkinsley.com
https://twitter.com/sentdex

Bitcoin donations: 1GV7srgR4NJx4vrk7avCmmVQQrqmv87ty6







Tags:
Python (Programming Language)
Regular Expression
Website Parse Template
scrape
spider
parse
urllib
re
regex
grab
article
paragraph
programming
tutorial
basics
python 3
python 3.3
python 3.4
python 2 and 3
python 2.7
2.7
3.3
3.4
beginner
how-to
coding
easy