14 Field specific

14.1 An introduction to quantitative analysis of political data in R

by Erik Gahner Larsen, Zoltán Fazekas

In this book, we aim to provide an easily accessible introduction to R for the collection, study and presentation of different types of political data. Specifically, the book will teach you how to get different types of political data into R and manipulate, analyze and visualize the output. In doing this, we will not only teach you how to get existing data into R, but also how to collect your own data.

Link: http://qpolr.com/

14.2 An Open-Source Active Learning Curriculum for Data Science in Engineering

by Zachary del Rosario

This work provides open-source content for an active learning curriculum in data science. The scope of the content is sufficient for a full-semester introduction to scientifically reproducible statistical computation, data wrangling, visualization, basic statistical literacy, and data-driven modeling. The content is broken into short exercises that introduce new concepts, and longer challenges that encourage students to develop those skills in an open-ended context.

Paid: Free (and open source)

Link: https://zdelrosario.github.io/data-science-curriculum/index.html

14.3 Computer-age Calculus with R

by Daniel Kaplan

R is closely associated with statistics, but not with calculus. It turns out that R is an excellent language for doing calculus.

This book shows how to do common calculus calculations using R.

Link: https://dtkaplan.github.io/RforCalculus/

14.4 Crime by the Numbers A Criminologist’s Guide to R

by Jacob Kaplan

This book introduces the programming language R and is meant for undergrads or graduate students studying criminology. R is a programming language that is well-suited to the type of work frequently done in criminology - taking messy data and turning it into useful information. While R is a useful tool for many fields of study, this book focuses on the skills criminologists should know and uses crime data for the example data sets.

Link: https://crimebythenumbers.com/

14.5 Cryptocurrency Research Open Source R Tutorial

by Riccardo (Ricky) Esclapon, John Chandler Johnson, Kai R. Larsen

The tutorial is in R. For those without experience programming in R we have a high-level version to help you learn before attempting the full version. Scroll down for a breakdown of the individual sections for an overview of what you will learn throughout.

You will get more familiar with tools from the tidyverse, including dplyr, ggplot2, tibble and purrr. These tools provide an excellent complete ecosystem to do data science in R.

You will learn to create machine learning models and how to fairly assess their performance.

Cryptocurrency Data: You will learn these tools analyzing the latest cryptocurrency data. The tutorial automatically refreshes every 12 hours and the data is publicly available and refreshed hourly.

Link: https://cryptocurrencyresearch.org/

14.6 Data Science in Education Using R

by Ryan A. Estrellado, Emily A. Bovee, Jesse Mostipak, Isabella C. Velásquez

Dear Data Scientists, Educators, and Data Scientists who are Educators:

This book is a warm welcome and an invitation. If you’re a data scientist in education or an educator in data science, your role isn’t exactly straightforward. This book is our contribution to a growing movement to merge the paths of data analysis and education. We wrote this book to make your first step on that path a little clearer and a little less scary.

Link: https://datascienceineducation.com/

14.7 Data Skills for Reproducible Science

by PsyTeachR team, University of Glasgow

This course provides an overview of skills needed for reproducible research and open science using the statistical programming language R. Students will learn about data visualisation, data tidying and wrangling, archiving, iteration and functions, probability and data simulations, general linear models, and reproducible workflows. Learning is reinforced through weekly assignments that involve working with different types of data.

Link: https://psyteachr.github.io/msc-data-skills/

14.8 Discrete Data Analysis with R Visualization and Modeling Techniques for Categorical and Count Data

by Michael Friendly, David Meyer

Presents an applied treatment of modern methods for the analysis of categorical data, both discrete response data and frequency data.

It explains how to use graphical methods for exploring data, spotting unusual features, visualizing fitted models, and presenting results.

Paid: $80

Link: http://ddar.datavis.ca/

14.9 Handbook of Graphs and Networks in People Analytics With Examples in R and Python

by Keith McNulty

The technology of graphs is all around us, and enables so many of the ways in which we live our lives today. That same technology is also available to us at no cost as an analytic tool to allow us to better understand network structures and dynamics in the fields of science, technology, economics, sociology and psychology to name just a few. It is available to academics and practitioners alike, and can be used on problems ranging from a very small network analysis which takes a few minutes on a laptop, to massive scale network mining requiring days or weeks of processing time.

But here’s the problem: few people really know how to do network analysis. It is still considered by many as a deep specialism or even a ‘dark art.’ It shouldn’t be.

This book aims to make the field of graph and network analysis more approachable to students and professionals by explaining the most important elements of theory and sharing common methodologies using open source programming languages like R and Python. It does so by explaining theory in as much detail as is necessary to support analytical curiosity and interpretation, and by using a wide array of example data sets and code snippets to demonstrate the specific implementation and interpretation of methodologies.

Link: https://ona-book.org/

14.10 Handbook of Regression Modeling in People Analytics

by Keith McNulty

It is the author’s firm belief that all people analytics professionals should have a strong understanding of regression models and how to implement and interpret them in practice, and the aim with this book is to provide those who need it with help in getting there.

For accompanying solutions to some of the questions: https://keithmcnulty.github.io/peopleanalytics-regression-book/solutions/

Link: http://peopleanalytics-regression-book.org/index.html

14.11 How to be a modern scientist

by Jeffrey Leek

A book about how to be a scientist the modern, open-source way. The face of academia is changing. It is no longer sufficient to just publish or perish. We are now in an era where Twitter, Github, Figshare, and Alt Metrics are regular parts of the scientific workflow. Here I give high level advice about which tools to use, how to use them, and what to look out for. This book is appropriate for scientists at all levels who want to stay on top of the current technological developments affecting modern scientific careers.

Paid: Free or pay what you want $10

Link: https://leanpub.com/modernscientist

14.12 Introduction to Econometrics with R

by Christoph Hanck, Martin Arnold, Alexander Gerber, Martin Schmelzer

Beginners with little background in statistics and econometrics often have a hard time understanding the benefits of having programming skills for learning and applying Econometrics. Introduction to Econometrics with R is an interactive companion to the well-received textbook Introduction to Econometrics by James H. Stock and Mark W. Watson (2015). It gives a gentle introduction to the essentials of R programming and guides students in implementing the empirical applications presented throughout the textbook using the newly acquired skills. This is supported by interactive programming exercises and integration of interactive visualizations of central concepts which are based on the flexible JavaScript library D3.js.

Link: https://www.econometrics-with-r.org/

14.13 Learning Microeconometrics with R

by Christopher P. Adams

This book provides an introduction to the field of microeconometrics through the use of R. The focus is on applying current learning from the field to real world problems. It uses R to both teach the concepts of the field and show the reader how the techniques can be used. It is aimed at the general reader with the equivalent of a bachelor’s degree in economics, statistics or some more technical field. It covers the standard tools of microeconometrics, OLS, instrumental variables, Heckman selection and difference in difference. In addition, it introduces bounds, factor models, mixture models and empirical Bayesian analysis.

Paid: $100

Link: https://www.routledge.com/Learning-Microeconometrics-with-R/Adams/p/book/9780367255381

14.14 Public Policy Analytics Code & Context for Data Science in Government

by Ken Steif, Ph.D

The goal of this book is to make data science accessible to social scientists and City Planners, in particular. I hope to convince readers that one with strong domain expertise plus intermediate data skills can have a greater impact in government than the sharpest computer scientist who has never studied economics, sociology, public health, political science, criminology etc.

Link: https://urbanspatial.github.io/PublicPolicyAnalytics/

14.15 R for Excel users

by Julie Lowndes, Allison Horst

This course is for Excel users who want to add or integrate R and RStudio into their existing data analysis toolkit. It is a friendly intro to becoming a modern R user, full of tidyverse, RMarkdown, GitHub, collaboration & reproducibility.

Link: https://rstudio-conf-2020.github.io/r-for-excel/

14.16 R for SEO

by François Joly

Even though R’ is a terrific option for SEO, there are simply not enough resources out there. This guide is not here to deliver a course about R, there are plenty already. This guide is meant to be as practical as possible. How things should be done in an “R-ish way” is not the purpose of this guide. Grab what you want to grab and feel free to submit your own solution.

Link: https://www.rforseo.com/

14.17 R for Water Resources Data Science

by Ryan Peek, Rich Pauloo

Consists of 2 courses

Introductory: This course is most relevant and targeted at folks who work with data, from analysts and program staff to engineers and scientists. This course provides an introduction to the power and possibility of a reproducible programming language (R) by demonstrating how to import, explore, visualize, analyze, and communicate different types of data. Using water resources based examples, this course guides participants through basic data science skills and strategies for continued learning and use of R.

Intermediate: In this course, we will move more quickly, assume familiarity with basic R skills, and also assume that the participant has working experience with more complex workflows, operations, and code-bases. Each module in this course functions as a “stand-alone” lesson, and can be read linearly, or out of order according to your needs and interests. Each module doesn’t necessarily require familiarity with the previous module.

This course emphasizes intermediate scripting skills like iteration, functional programming, writing functions, and controlling project workflows for better reproducibility and efficiency. Approaches to working with more complex data structures like lists and timeseries data, the fundamentals of building Shiny Apps, pulling water resources data from APIs, intermediate mapmaking and spatial data processing, integrating version control in projects with git.

Link: https://www.r4wrds.com/

14.18 R Programming with Minecraft

by Brooke Anderson, Karl Broman, Gergely Daróczi, Mario Inchiosa, David Smith, Ali Zaidi

Minecraft is awesome fun, especially in creative mode, where you can build all sorts of crazy stuff. But ambitious building projects can be really tedious to create by hand. With the miner R package, you can write R code to manipulate your Minecraft world and create even more awesome stuff.

Here’s an introduction Rstats NYC conference talk on it: https://www.youtube.com/watch?v=r_JgPF8MJpY

Link: https://kbroman.org/miner_book/?s=09

14.19 Technical Foundations of Informatics

by Michael Freeman, Joel Ross

This book covers the foundation skills necessary to start writing computer programs to work with data using modern and reproducible techniques. It requires no technical background. These materials were developed for the INFO 201: Technical Foundations of Informatics course taught at the University of Washington Information School; however they have been structured to be an online resource for anyone hoping to learn to work with information using programmatic approaches.

Link: https://info201.github.io/

 

Created and maintained by Oscar Baruffa

For updates, sign up to my newsletter