Using dplyr's group_by for grouping data in R (CC163)

Описание к видео Using dplyr's group_by for grouping data in R (CC163)

In this episode of Code Club, Pat uses dplyr's group_by function for grouping data in R without using the summarize function. Instead he uses slice_max to return the row or rows for each group that has the most recent date. He'll demonstrate how to use these functions with a massive data frame that we downloaded from Our World in Data that describes COVID-19 vaccination rates by country and day.

In this episode, Pat uses #group_by and #slice_max from the #dplyr #R package in #Rstudio. The accompanying blog post can be found at https://www.riffomonas.org/code_club/....

Want more practice on the concepts covered in Code Club? You can sign up for my weekly newsletter at https://shop.riffomonas.org/youtube to get practice problems, tips, and insights.

If you're interested in taking an upcoming 3 day R workshop be sure to check out our schedule at https://riffomonas.org/workshops/


You can also find complete tutorials for learning R with the tidyverse using...
Microbial ecology data: https://www.riffomonas.org/minimalR/
General data: https://www.riffomonas.org/generalR/

0:00 Using group_by with and without summarize
4:19 Using group_by without summarize
7:46 Ungrouping data with ungroup
8:24 Removing columns with select
8:57 Putting it all together to get most recent data

Комментарии

Информация по комментариям в разработке