Professional work

Various publications with the Pew Research Center (including some more on the methodological side)

While a great deal of my work at Pew has been focused on improving my team’s workflows, educating staff on how to use data science tools, and quality checking, I have also contributed to a number of reports, primarily focusing on Twitter use by U.S. adults.

Improving Workplace Safety through Proactive Inspections, with Dirección del Trabajo Chile

Many Chilean workers face significant workplace safety issues. Dirección del Trabajo, Chile’s labour agency, is responsible for reducing the risks workers face through performing inspections and remediations. However, a lot of time has been spent inspecting facilities reactively based on complaints. My team built a pipeline in Python to move data from its raw format to clean Postgres databases, run machine learning models, and evaluate the results of these models to help DT identify facilities to prioritize in future inspections. This was complicated, and messy, so feel free to reach out for more details.

As a volunteer project manager with DSSG Solve, I led a team working with ILAO to find patterns in usage of the resources on the site with the aim of providing those resources to users more proactively to ensure they have access to everything they need.

Publications

Using Administrative Records and Survey Data to Construct Samples of Tweeters and Tweets

Hughes, Adam G., Stefan D. McCabe, William R. Hobbs, Emma Remy, Sono Shah, and David M. J. Lazer. Forthcoming in Public Opinion Quarterly. “Using Administrative Records and Survey Data to Construct Samples of Tweeters and Tweets.” https://doi.org/10.1093/poq/nfab020.

Are Volcanic Eruptions Increasing?: An example of teaching data wrangling and visualization

In review: A paper co-authored with Dr. Kelly McConville advising statistics educators on how to teach basic data wrangling and visualization in R with dplyr and ggplot2 using the Global Volcanism Program’s terrific data.

Other academic work

The Reality Gap: Implementing NEAT with a Scribbler Robot

Alison Rosenzweig and I explored moving a simulated adaptive robot into reality, creating some fun abstract art in the process.

Gerrymandering and Impossibility

I wrote my senior capstone paper (a kind of expository thesis) about some mathematical aspects of drawing and evaluating districts that make gerrymandering a difficult problem to solve.

Pill Image Segmentation

Inspired by the 2016 NLM Pill Recognition Challenge, we segmented pills from backgrounds in reference images of pills.

Metaballs

We used C++ and OpenGPL to implement 3-D metaballs.

Conferences and events

Where are Volcanoes Erupting? Mapping with R and R Shiny

As an extension of the paper I wrote with Dr. McConville, I presented a virtual poster at the 2017 Electronic Undergraduate Statistics Conference about visualizing map data. This presentation demonstrates and advises undergraduates and educators of undergraduates on how to create this visualizations ranging from simple ggplot2 graphics to R Shiny apps. Code referenced in this presentation is available here.

Peripeteia weekend

Peripeteia is an annual event where people from the Swarthmore College community teach short courses on diverse topics. I helped organize this in its inaugural year, as well as the two subsequent years. I also created and maintained the website, and helped organize smaller-scale events related to Peripeteia.

Grace Hopper Celebration for Women in Computing

In 2017, I was awarded the celebrated scholarship from AnitaB.org and attended GHC as a Grace Hopper Scholar.

Tools

HealthyTexts

This project won the “Sponsored Prize” at Local Hack Day Swarthmore. It is an SMS service for sending health reminders – our goal was to make the functionality of many fitness tracking smartphone apps accessible to those who do not have or are not comfortable with smartphone technology. This was built using the Twilio API.

Bahasa Wooi language learning tool

An in-progress simple website teaching the basics of bahasa Wooi, a language spoken by around 200 people on Yapen island. This website is an extension of the research I did in 2016 in Manokwari, Papua collecting data on Wooi with the assistance of my language consultant Jimmi Kirihio and my mentor Dr. Emily Gasser.

Volcano explorer

An R Shiny app visualizing volcano locations and information using the Global Volcanism Program’s data and Leaflet. I have since explored the idea of replicating this using D3.js.

California arrest data interactive

A simple R Shiny app that allows users to explore arrest rates for different demographic groups.