Introduction to R: Preparing Numeric Data

Channel:
Subscribers:
54,100
Published on ● Video Link: https://www.youtube.com/watch?v=o5GCVDqniIg



Duration: 14:49
639 views
20


Numeric data tends to be cleaner than text data, but there are still a variety of preprocessing steps to consider when working with numeric data to prepare it for analysis and modeling. In this lesson we consider four data preparation steps including: data normalization (centering and scaling), dealing with skewed data, dealing with highly correlated variables and imputing missing values.

This is lesson 15 of a 30-part introduction to the R programming language for data analysis and predictive modeling. Link to the code notebook below:

Introduction to R: Preparing Numeric Data https://www.kaggle.com/hamelg/intro-to-r-part-15-preparing-numeric-data

This guide does not assume any prior exposure to R, programming or data science. It is intended for beginners with an interest in data science and those who might know other programming languages and would like to learn R.

I will create the videos for this guide such that you should be able to learn a lot just watching on YouTube, but to get the most out of the guide, it is recommended that you create a Kaggle account so that you can fork and edit each lesson so that you can follow along and run code yourself.

Follow DataDaft on social media for news and updates:
Twitter: https://twitter.com/DataDaft

Introduction to R Playlist:
https://www.youtube.com/playlist?list=PLiC1doDIe9rDjk9tSOIUZJU4s5NpEyYtE







Tags:
data preprocessing
data normalization
centering and scaling data
reducing data skew
highly correlated variables
multicollinearity
numeric data preparation
data cleaning in R
R data cleaning
data manipulation
data preparation in r
r basics
r programming tutorial
r programming for beginners
introduction to R
intro to r
r programming
r tutorial
r programming language
getting started with r
r notebook tutorial
data science
data analysis