dplyr: Grouping
The group_by function in dplyr lets use other functions like summarize and mutate on subgroups within a data frame rather than operating on entire columns all at once. This is useful for generating statistics and creating new variables according to the levels of categorical variables.
Link to the Kaggle Notebook code used for this video series:
https://www.kaggle.com/hamelg/dplyr-in-r
View the whole dplyr in R playlist here:
https://www.youtube.com/watch?v=THGFXV4RW8U&list=PLiC1doDIe9rC8RgWPAWqDETE-VbKOWfWl
dplyr cheat sheet from RStudio:
https://www.rstudio.com/wp-content/uploads/2015/02/data-wrangling-cheatsheet.pdf
dplyr documentation:
https://cran.r-project.org/web/packages/dplyr/dplyr.pdf
Follow DataDaft on social media for news and updates:
Twitter: https://twitter.com/DataDaft
Join the DataDaft Discord to discuss all things data science:
https://discord.gg/ZS2yPNw
#dplyr #rprogramming #datascience
Other Videos By DataDaft
| 2019-11-01 | How to Make Violin Plots in R |
| 2019-10-31 | How to Make Pie Charts in R |
| 2019-10-29 | How to Make Boxplots in R |
| 2019-10-15 | How to Make Stacked and Grouped Bar Plots in R |
| 2019-10-14 | How To Make a Bar Plot in R |
| 2019-10-14 | How to Make a Scatterplot in R |
| 2019-10-10 | How to Make a Density Plot in R |
| 2019-10-09 | How To Make a Histogram in R |
| 2019-10-08 | U.S. Major Sports League Worldwide Web Search Volume (01/2004 - 10/2019) |
| 2019-10-07 | dplyr: Joins |
| 2019-10-06 | dplyr: Grouping |
| 2019-10-05 | dplyr: summarize |
| 2019-10-03 | MLB All-Time Team Records: Most Home Runs Hit in a Season (1871-2019) |
| 2019-09-30 | dplyr: gather and spread |
| 2019-09-28 | dplyr: separate and unite |
| 2019-09-27 | dplyr: mutate |
| 2019-09-25 | dlpyr: rename and arrange |
| 2019-09-24 | dplyr: select |
| 2019-09-17 | dplyr: filter |
| 2019-09-13 | dplyr: Pipes |
| 2019-09-13 | dplyr: Getting Started |

