Guide to tidy git analysis
@drsimonj here to help you embark on git repo analyses!
Ever wondered who contributes to git repos? How their contributions have changed over time? What sort of conventions different authors use in their commit messages? Maybe you were inspired by Mara Averick to contribute to tidyverse packages and wonder how you fit in?
This post – intended for intermediate R users – will help you answer these sorts of questions using tidy R tools.
Install and load these packages to follow along:
# Parts 1 and 2 library(tidyverse) library(glue) library(stringr) library(forcats) # Part 3 library(tidygraph) library(ggraph) library(tidytext)
Part 1: Git repo to a tidy data frame
Get a git repo
We’ll explore the open-source ggplot2 repo by copying it to our local machine with
git clone, typically run on a command-line like:
git clone <repository_url> <directory>
Continue reading →