dplyr: summarize
The summarize (aka summarise) function in dplyr lets you create summary statistics from the columns of a data frame. When run on an ungrouped data frame, a summary of a column should be a single value like the mean, median, mode, etc. We will learn how to generate summary statistics on subsets of the data using grouping in the next lesson.
Link to the Kaggle Notebook code used for this video series:
https://www.kaggle.com/hamelg/dplyr-in-r
View the whole dplyr in R playlist here:
https://www.youtube.com/watch?v=THGFXV4RW8U&list=PLiC1doDIe9rC8RgWPAWqDETE-VbKOWfWl
dplyr cheat sheet from RStudio:
https://www.rstudio.com/wp-content/uploads/2015/02/data-wrangling-cheatsheet.pdf
dplyr documentation:
https://cran.r-project.org/web/packages/dplyr/dplyr.pdf
Follow DataDaft on social media for news and updates:
Twitter: https://twitter.com/DataDaft
Join the DataDaft Discord to discuss all things data science:
https://discord.gg/ZS2yPNw
#dplyr #rprogramming #datascience
Other Videos By DataDaft
| 2019-10-31 | How to Make Pie Charts in R |
| 2019-10-29 | How to Make Boxplots in R |
| 2019-10-15 | How to Make Stacked and Grouped Bar Plots in R |
| 2019-10-14 | How To Make a Bar Plot in R |
| 2019-10-14 | How to Make a Scatterplot in R |
| 2019-10-10 | How to Make a Density Plot in R |
| 2019-10-09 | How To Make a Histogram in R |
| 2019-10-08 | U.S. Major Sports League Worldwide Web Search Volume (01/2004 - 10/2019) |
| 2019-10-07 | dplyr: Joins |
| 2019-10-06 | dplyr: Grouping |
| 2019-10-05 | dplyr: summarize |
| 2019-10-03 | MLB All-Time Team Records: Most Home Runs Hit in a Season (1871-2019) |
| 2019-09-30 | dplyr: gather and spread |
| 2019-09-28 | dplyr: separate and unite |
| 2019-09-27 | dplyr: mutate |
| 2019-09-25 | dlpyr: rename and arrange |
| 2019-09-24 | dplyr: select |
| 2019-09-17 | dplyr: filter |
| 2019-09-13 | dplyr: Pipes |
| 2019-09-13 | dplyr: Getting Started |
| 2019-09-11 | Introduction to R: Descriptive Statistics |

