Resampling - p.9 Data Analysis with Python and Pandas Tutorial

Channel:
Subscribers:
1,410,000
Published on ● Video Link: https://www.youtube.com/watch?v=p_Fn_BksF9k



Category:
Tutorial
Duration: 11:51
48,254 views
342


Welcome to another data analysis with Python and Pandas tutorial. In this tutorial, we're going to be talking about smoothing out data by removing noise. There are two main methods to do this. The most popular method used is what is called resampling, though it might take many other names. This is where we have some data that is sampled at a certain rate. For us, we have the Housing Price Index sampled at a one-month rate, but we could sample the HPI every week, every day, every minute, or more, but we could also resample at every year, every 10 years, and so on.

Another environment where resampling almost always occurs is with stock prices, for example. Stock prices are intra-second. What winds up happening though, is usually stock prices are resampled to minute data at the lowest for free data. You can buy access to live data, however. On a long-term scale, usually the data will be sampled daily, or even every 3-5 days. This is often done to keep the size of the data being transferred low. For example, over the course of, say, one year, intra-second data is usually in the multiples of gigabytes, and transferring all of that at once is unreasonable and people would be waiting minutes or hours for pages to load.

Using our current data, which is currently sampled at once a month, how might we sample it instead to once every 6 months, or 2 years? Try to think about how you might personally write a function that might perform that task, it's a fairly challenging one, but it can be done. That said, it's a fairly computationally inefficient job, but Pandas has our backs and does it very fast.

Sample code and text tutorial for this video: http://pythonprogramming.net/resample-data-analysis-python-pandas-tutorial/

http://pythonprogramming.net
https://twitter.com/sentdex




Other Videos By sentdex


2015-11-23URL Converters - Flask Web Development with Python 28
2015-11-18Jinja Templating Cont'd - Flask Web Development with Python 27
2015-11-16Includes - Flask Web Development with Python 26
2015-11-08Scikit Learn Incorporation - p.16 Data Analysis with Python and Pandas Tutorial
2015-11-03Rolling Apply and Mapping Functions - p.15 Data Analysis with Python and Pandas Tutorial
2015-10-29Adding other economic indicators - p.14 Data Analysis with Python and Pandas Tutorial
2015-10-27Joining 30 year mortgage rate - p.13 Data Analysis with Python and Pandas Tutorial
2015-10-21Applying Comparison Operators to DataFrame - p.12 Data Analysis with Python and Pandas Tutorial
2015-10-17Rolling statistics - p.11 Data Analysis with Python and Pandas Tutorial
2015-10-12Handling Missing Data - p.10 Data Analysis with Python and Pandas Tutorial
2015-10-09Resampling - p.9 Data Analysis with Python and Pandas Tutorial
2015-10-05Percent Change and Correlation Tables - p.8 Data Analysis with Python and Pandas Tutorial
2015-10-03Pickling - p.7 Data Analysis with Python and Pandas Tutorial
2015-09-29Joining and Merging Dataframes - p.6 Data Analysis with Python and Pandas Tutorial
2015-09-25Concatenating and Appending dataframes - p.5 Data Analysis with Python and Pandas Tutorial
2015-09-23Building dataset - p.4 Data Analysis with Python and Pandas Tutorial
2015-09-20IO Basics - p.3 Data Analysis with Python and Pandas Tutorial
2015-09-16Pandas Basics - p.2 Data Analysis with Python and Pandas Tutorial
2015-09-14Data Analysis with Python and Pandas Tutorial Introduction
2015-09-11PythonProgramming.net's +=1 Subscription
2015-09-01OpenCV Face Detection with Raspberry Pi - Robotics with Python p.7



Tags:
Pandas
Python (Programming Language)
Data Analysis (Media Genre)
pandas.DataFrame.resample
DataFrame.resample