IO Basics - p.3 Data Analysis with Python and Pandas Tutorial

Channel:
Subscribers:
1,410,000
Published on ● Video Link: https://www.youtube.com/watch?v=9Z7wvippeko



Category:
Tutorial
Duration: 17:12
163,147 views
1,400


Welcome to Part 3 of Data Analysis with Pandas and Python. In this tutorial, we will begin discussing IO, or input/output, with Pandas, and begin with a realistic use-case. To get ample practice, a very useful website is Quandl. Quandl contains a plethora of free and paid data sources. What makes this location great is that the data is generally normalized, it's all in one place, and extracting the data is the same method. If you are using Python, and you access the Quandl data via their simple module, then the data is automatically returned to a dataframe. For the purposes of this tutorial, we're going to just manually download a CSV file instead, for learning purposes, since not every data source you find is going to have a nice and neat module for extracting the datasets.

Let's say we're interested in maybe purchasing or selling a home in Austin, Texas. The zipcode there is 77006. We could go to the local housing listings and see what the current prices are, but this doesn't really give us any real historical information, so let's just try to get some data on this. Let's query for "home value index 77006." Sure enough, we can see an index here. There's top, middle, lower tier, three bedroom, and so on. Let's say, sure, we got a a three bedroom house. Let's check that out. Turns out Quandl already provides graphs, but let's grab the dataset anyway, make our own graph, and maybe do some other analysis. Go to download, and choose CSV. Pandas is capable of IO with csv, excel data, hdf, sql, json, msgpack, html, gbq, stata, clipboard, and pickle data, and the list continues to grow. Check out the IO Tools documentation for the current list. Take that CSV and move it into the local directory (the directory that you are currently working in / where this .py script is).

sample code and text-based write up for this tutorial: http://pythonprogramming.net/input-output-data-analysis-python-pandas-tutorial/

http://pythonprogramming.net
https://twitter.com/sentdex




Other Videos By sentdex


2015-10-27Joining 30 year mortgage rate - p.13 Data Analysis with Python and Pandas Tutorial
2015-10-21Applying Comparison Operators to DataFrame - p.12 Data Analysis with Python and Pandas Tutorial
2015-10-17Rolling statistics - p.11 Data Analysis with Python and Pandas Tutorial
2015-10-12Handling Missing Data - p.10 Data Analysis with Python and Pandas Tutorial
2015-10-09Resampling - p.9 Data Analysis with Python and Pandas Tutorial
2015-10-05Percent Change and Correlation Tables - p.8 Data Analysis with Python and Pandas Tutorial
2015-10-03Pickling - p.7 Data Analysis with Python and Pandas Tutorial
2015-09-29Joining and Merging Dataframes - p.6 Data Analysis with Python and Pandas Tutorial
2015-09-25Concatenating and Appending dataframes - p.5 Data Analysis with Python and Pandas Tutorial
2015-09-23Building dataset - p.4 Data Analysis with Python and Pandas Tutorial
2015-09-20IO Basics - p.3 Data Analysis with Python and Pandas Tutorial
2015-09-16Pandas Basics - p.2 Data Analysis with Python and Pandas Tutorial
2015-09-14Data Analysis with Python and Pandas Tutorial Introduction
2015-09-11PythonProgramming.net's +=1 Subscription
2015-09-01OpenCV Face Detection with Raspberry Pi - Robotics with Python p.7
2015-08-30Programming Autonomy - Robotics with Python Raspberry Pi and GoPiGo p.6
2015-08-27Weaponizing our Robot - Robotics with Python Raspberry Pi and GoPiGo p.5
2015-08-25Programming Remote Control - Robotics with Python Raspberry Pi and GoPiGo p.4
2015-08-23Programming Robot Basics - Robotics with Python Raspberry Pi and GoPiGo p.3
2015-08-22Supplies Needed - Robotics with Python Raspberry Pi and GoPiGo p.2
2015-08-22Robotics with Python Raspberry Pi and GoPiGo Introduction



Tags:
Pandas
Python (Programming Language)
Data Analysis (Media Genre)
dataframe
read_csv
read_hdf
read_sql
read_excel
read_html
to_csv
to_excel
to_hdf
to_pickle
to_sql