13 Field specific

13.1 An introduction to quantitative analysis of political data in R

by Erik Gahner Larsen, Zoltán Fazekas

In this book, we aim to provide an easily accessible introduction to R for the collection, study and presentation of different types of political data. Specifically, the book will teach you how to get different types of political data into R and manipulate, analyze and visualize the output. In doing this, we will not only teach you how to get existing data into R, but also how to collect your own data.

Link: http://qpolr.com/

13.2 Analyzing Financial and Economic Data with R

by Marcelo S. Perlin

Not surprisingly, fields with abundant access to data and practical applications, such as economics and finance, it is expected that a graduate student or a data analyst has learned at least one programming language that allows him/her to do his work efficiently. Learning how to program is becoming a requisite for the job market.

Link: https://www.msperlin.com/afedR/

13.3 Computer-age Calculus with R

by Daniel Kaplan

R is closely associated with statistics, but not with calculus. It turns out that R is an excellent language for doing calculus.

This book shows how to do common calculus calculations using R.

Link: https://dtkaplan.github.io/RforCalculus/

13.4 Crime by the Numbers A Criminologist’s Guide to R

by Jacob Kaplan

This book introduces the programming language R and is meant for undergrads or graduate students studying criminology. R is a programming language that is well-suited to the type of work frequently done in criminology - taking messy data and turning it into useful information. While R is a useful tool for many fields of study, this book focuses on the skills criminologists should know and uses crime data for the example data sets.

Link: https://crimebythenumbers.com

13.5 Cryptocurrency Research Open Source R Tutorial

by Riccardo (Ricky) Esclapon, John Chandler Johnson, Kai R. Larsen

The tutorial is in R. For those without experience programming in R we have a high-level version to help you learn before attempting the full version. Scroll down for a breakdown of the individual sections for an overview of what you will learn throughout.

You will get more familiar with tools from the tidyverse, including dplyr, ggplot2, tibble and purrr. These tools provide an excellent complete ecosystem to do data science in R.

You will learn to create machine learning models and how to fairly assess their performance.

Cryptocurrency Data: You will learn these tools analyzing the latest cryptocurrency data. The tutorial automatically refreshes every 12 hours and the data is publicly available and refreshed hourly.

Link: https://cryptocurrencyresearch.org/

13.6 Data Science in Education Using R

by Ryan A. Estrellado, Emily A. Bovee, Jesse Mostipak, Isabella C. Velásquez

Dear Data Scientists, Educators, and Data Scientists who are Educators:

This book is a warm welcome and an invitation. If you’re a data scientist in education or an educator in data science, your role isn’t exactly straightforward. This book is our contribution to a growing movement to merge the paths of data analysis and education. We wrote this book to make your first step on that path a little clearer and a little less scary.

Link: https://datascienceineducation.com/

13.7 Data Skills for Reproducible Science

by PsyTeachR team, University of Glasgow

This course provides an overview of skills needed for reproducible research and open science using the statistical programming language R. Students will learn about data visualisation, data tidying and wrangling, archiving, iteration and functions, probability and data simulations, general linear models, and reproducible workflows. Learning is reinforced through weekly assignments that involve working with different types of data.

Link: https://psyteachr.github.io/msc-data-skills/

13.8 Discrete Data Analysis with R Visualization and Modeling Techniques for Categorical and Count Data

by Michael Friendly, David Meyer

Presents an applied treatment of modern methods for the analysis of categorical data, both discrete response data and frequency data.

It explains how to use graphical methods for exploring data, spotting unusual features, visualizing fitted models, and presenting results.

Paid: $80

Link: http://ddar.datavis.ca/

13.9 Handbook of Graphs and Networks in People Analytics With Examples in R and Python

by Keith McNulty

The technology of graphs is all around us, and enables so many of the ways in which we live our lives today. That same technology is also available to us at no cost as an analytic tool to allow us to better understand network structures and dynamics in the fields of science, technology, economics, sociology and psychology to name just a few. It is available to academics and practitioners alike, and can be used on problems ranging from a very small network analysis which takes a few minutes on a laptop, to massive scale network mining requiring days or weeks of processing time.

But here’s the problem: few people really know how to do network analysis. It is still considered by many as a deep specialism or even a ‘dark art.’ It shouldn’t be.

This book aims to make the field of graph and network analysis more approachable to students and professionals by explaining the most important elements of theory and sharing common methodologies using open source programming languages like R and Python. It does so by explaining theory in as much detail as is necessary to support analytical curiosity and interpretation, and by using a wide array of example data sets and code snippets to demonstrate the specific implementation and interpretation of methodologies.

Link: https://ona-book.org/

13.10 Handbook of Regression Modeling in People Analytics

by Keith McNulty

It is the author’s firm belief that all people analytics professionals should have a strong understanding of regression models and how to implement and interpret them in practice, and the aim with this book is to provide those who need it with help in getting there.

For accompanying solutions to some of the questions: https://keithmcnulty.github.io/peopleanalytics-regression-book/solutions/

Link: http://peopleanalytics-regression-book.org/index.html

13.11 How to be a modern scientist

by Jeffrey Leek

A book about how to be a scientist the modern, open-source way. The face of academia is changing. It is no longer sufficient to just publish or perish. We are now in an era where Twitter, Github, Figshare, and Alt Metrics are regular parts of the scientific workflow. Here I give high level advice about which tools to use, how to use them, and what to look out for. This book is appropriate for scientists at all levels who want to stay on top of the current technological developments affecting modern scientific careers.

Paid: Free or pay what you want $10

Link: https://leanpub.com/modernscientist

13.12 Introduction to Econometrics with R

by Christoph Hanck, Martin Arnold, Alexander Gerber, Martin Schmelzer

Instead of confronting students with pure coding exercises and complementary classic literature like the book by Venables & Smith (2010), we figured it would be better to provide interactive learning material that blends R code with the contents of the well-received textbook Introduction to Econometrics by Stock & Watson (2015) which serves as a basis for the lecture.

Link: https://www.econometrics-with-r.org/

13.13 Learning Microeconometrics with R

by Christopher P. Adams

This book provides an introduction to the field of microeconometrics through the use of R. The focus is on applying current learning from the field to real world problems. It uses R to both teach the concepts of the field and show the reader how the techniques can be used. It is aimed at the general reader with the equivalent of a bachelor’s degree in economics, statistics or some more technical field. It covers the standard tools of microeconometrics, OLS, instrumental variables, Heckman selection and difference in difference. In addition, it introduces bounds, factor models, mixture models and empirical Bayesian analysis.

Paid: $100

Link: https://www.routledge.com/Learning-Microeconometrics-with-R/Adams/p/book/9780367255381

13.14 Machine Learning for Factor Investing

by Guillaume Coqueret, Tony Guida

This book is intended to cover some advanced modelling techniques applied to equity investment strategies that are built on firm characteristics.

Link: http://www.mlfactor.com/

13.15 Public Policy Analytics Code & Context for Data Science in Government

by Ken Steif, Ph.D

The goal of this book is to make data science accessible to social scientists and City Planners, in particular. I hope to convince readers that one with strong domain expertise plus intermediate data skills can have a greater impact in government than the sharpest computer scientist who has never studied economics, sociology, public health, political science, criminology etc.

Link: https://urbanspatial.github.io/PublicPolicyAnalytics/

13.16 R for Excel users

by Julie Lowndes, Allison Horst

This course is for Excel users who want to add or integrate R and RStudio into their existing data analysis toolkit. It is a friendly intro to becoming a modern R user, full of tidyverse, RMarkdown, GitHub, collaboration & reproducibility.

Link: https://rstudio-conf-2020.github.io/r-for-excel/

13.17 R for SEO

by François Joly

Even though R’ is a terrific option for SEO, there are simply not enough resources out there. This guide is not here to deliver a course about R, there are plenty already. This guide is meant to be as practical as possible. How things should be done in an “R-ish way” is not the purpose of this guide. Grab what you want to grab and feel free to submit your own solution.

Link: https://www.rforseo.com/

13.18 R for Water Resources Data Science

by Ryan Peek, Rich Pauloo

Consists of 2 courses

Introductory: This course is most relevant and targeted at folks who work with data, from analysts and program staff to engineers and scientists. This course provides an introduction to the power and possibility of a reproducible programming language (R) by demonstrating how to import, explore, visualize, analyze, and communicate different types of data. Using water resources based examples, this course guides participants through basic data science skills and strategies for continued learning and use of R.

Intermediate: In this course, we will move more quickly, assume familiarity with basic R skills, and also assume that the participant has working experience with more complex workflows, operations, and code-bases. Each module in this course functions as a “stand-alone” lesson, and can be read linearly, or out of order according to your needs and interests. Each module doesn’t necessarily require familiarity with the previous module.

This course emphasizes intermediate scripting skills like iteration, functional programming, writing functions, and controlling project workflows for better reproducibility and efficiency. Approaches to working with more complex data structures like lists and timeseries data, the fundamentals of building Shiny Apps, pulling water resources data from APIs, intermediate mapmaking and spatial data processing, integrating version control in projects with git.

Link: https://www.r4wrds.com/

13.19 R Programming with Minecraft

by Brooke Anderson, Karl Broman, Gergely Daróczi, Mario Inchiosa, David Smith, Ali Zaidi

Minecraft is awesome fun, especially in creative mode, where you can build all sorts of crazy stuff. But ambitious building projects can be really tedious to create by hand. With the miner R package, you can write R code to manipulate your Minecraft world and create even more awesome stuff.

Here’s an introduction Rstats NYC conference talk on it: https://www.youtube.com/watch?v=r_JgPF8MJpY

Link: https://kbroman.org/miner_book/?s=09

13.20 Technical Foundations of Informatics

by Michael Freeman, Joel Ross

This book covers the foundation skills necessary to start writing computer programs to work with data using modern and reproducible techniques. It requires no technical background. These materials were developed for the INFO 201: Technical Foundations of Informatics course taught at the University of Washington Information School; however they have been structured to be an online resource for anyone hoping to learn to work with information using programmatic approaches.

Link: https://info201.github.io/


Created and maintained by Oscar Baruffa

For updates, sign up to my newsletter