Building dataset - p.4 Data Analysis with Python and Pandas Tutorial

Channel:
Subscribers:
1,410,000
Published on ● Video Link: https://www.youtube.com/watch?v=3GpvWlVinf0



Category:
Tutorial
Duration: 11:04
123,928 views
927


In this part of Data Analysis with Python and Pandas tutorial series, we're going to expand things a bit. Let's consider that we're multi-billionaires, or multi-millionaires, but it's more fun to be billionaires, and we're trying to diversify our portfolio as much as possible. We want to have all types of asset classes, so we've got stocks, bonds, maybe a money market account, and now we're looking to get into real estate to be solid. You've all seen the commercials right? You buy a CD for $60, attend some $500 seminar, and you're set to start making your 6 figure at a time investments into property, right?

Okay, maybe not, but we definitely want to do some research and have some sort of strategy for buying real estate. So, what governs the prices of homes, and do we need to do the research to find this out? Generally, no, you don't really need to do that digging, we know the factors. The factors for home prices are governed by: The economy, interest rates, and demographics. These are the three major influences in general for real estate value. Now, of course, if you're buying land, various other things matter, how level is it, are we going to need to do some work to the land before we can actually lay foundation, how is drainage etc. If there is a house, then we have even more factors, like the roof, windows, heating/AC, floors, foundation, and so on. We can begin to consider these factors later, but first we'll start at the macro level. You will see how quickly our data sets inflate here as it is, it'll blow up fast.

So, our first step is to just collect the data. Quandl still represents a great place to start, but this time let's automate the data grabbing. We're going to pull housing data for the 50 states first, but then we stand to try to gather other data as well. We definitely dont want to be manually pulling this data. First, if you do not already have an account, you need to get one. This will give you an API key and unlimited API requests to the free data, which is awesome.

Once you create an account, go to your account / me, whatever they are calling it at the time, and then find the section marked API key. That's your key, which you will need. Next, we want to grab the Quandl module. We really don't need the module to make requests at all, but it's a very small module, and the size is worth the slight ease it gives us, so might as well. Open up your terminal/cmd.exe and do pip install quandl (again, remember to specify the full path to pip if pip is not recognized).

Next, we're ready to rumble, open up a new editor.

http://pythonprogramming.net
https://twitter.com/sentdex




Other Videos By sentdex


2015-10-29Adding other economic indicators - p.14 Data Analysis with Python and Pandas Tutorial
2015-10-27Joining 30 year mortgage rate - p.13 Data Analysis with Python and Pandas Tutorial
2015-10-21Applying Comparison Operators to DataFrame - p.12 Data Analysis with Python and Pandas Tutorial
2015-10-17Rolling statistics - p.11 Data Analysis with Python and Pandas Tutorial
2015-10-12Handling Missing Data - p.10 Data Analysis with Python and Pandas Tutorial
2015-10-09Resampling - p.9 Data Analysis with Python and Pandas Tutorial
2015-10-05Percent Change and Correlation Tables - p.8 Data Analysis with Python and Pandas Tutorial
2015-10-03Pickling - p.7 Data Analysis with Python and Pandas Tutorial
2015-09-29Joining and Merging Dataframes - p.6 Data Analysis with Python and Pandas Tutorial
2015-09-25Concatenating and Appending dataframes - p.5 Data Analysis with Python and Pandas Tutorial
2015-09-23Building dataset - p.4 Data Analysis with Python and Pandas Tutorial
2015-09-20IO Basics - p.3 Data Analysis with Python and Pandas Tutorial
2015-09-16Pandas Basics - p.2 Data Analysis with Python and Pandas Tutorial
2015-09-14Data Analysis with Python and Pandas Tutorial Introduction
2015-09-11PythonProgramming.net's +=1 Subscription
2015-09-01OpenCV Face Detection with Raspberry Pi - Robotics with Python p.7
2015-08-30Programming Autonomy - Robotics with Python Raspberry Pi and GoPiGo p.6
2015-08-27Weaponizing our Robot - Robotics with Python Raspberry Pi and GoPiGo p.5
2015-08-25Programming Remote Control - Robotics with Python Raspberry Pi and GoPiGo p.4
2015-08-23Programming Robot Basics - Robotics with Python Raspberry Pi and GoPiGo p.3
2015-08-22Supplies Needed - Robotics with Python Raspberry Pi and GoPiGo p.2



Tags:
Pandas
Python (Programming Language)
Data Analysis (Media Genre)
Data (Website Category)
dataframe
Data Set