Pipelines for Data Analysis

Guest Lecture


January 21, 2015
11:00am to 12:00pm
190 Doe Library
Over the last year and half, three things have had a profound impact on how I develop tools for data analysis: Rcpp, writing the advanced R book (http://adv-r.had.co.nz/), and the pipe operator (%>%, from magrittr). In this talk, I'll focus on the pipe operator and how it's influenced the development of tidyr, dplyr and ggvis, the next generation of reshape2, plyr, and ggplot2. Come along to learn about why I think pipelines are awesome and see how pipelines + tidyr, dplyr, and ggvis can make your data analysis fast, fluent, and fun.


Hadley Wickham

Assistant Professor, Rice University; Chief Scientist, RStudio

Hadley Wickham is a statistician from New Zealand who is currently chief scientist at RStudio and an adjunct assistant professor of statistics at Rice University. He is best known for his development of open source statistical analysis software packages for R (programming language) that implement logics of data visualization and data transformation. Wickham completed his undergraduate studies at the University of Auckland and his PhD at Iowa State University under the supervision of Di Cook and Heike Hoffman. In 2006, he was awarded the John Chambers Award for Statistical Computing for his work developing tools for data reshaping and visualization.

He is a prominent and active member of the R user community and has developed several notable and widely used packages including ggplot2, plyr, dplyr, and reshape2.

Watch a webcast for this lecture here