"Instead of arguing about whether results hold up, let’s push to provide enough information for others to repeat the experiments." This article by BIDS Senior Fellow Philip B. Stark appeared in the journal Nature on May 31, 2018.
The Types, Roles, and Practices of Documentation in Data Analytics Open Source Software Libraries; A Collaborative Ethnography of Documentation Work
Data analytics increasingly relies on open source software (OSS) libraries that extend scripted languages like python and R. Software documentation for these libraries is crucial for people across all experience levels, but documentation work raises many challenges, particularly in open source communities. In this collaboration between ethnographers and data scientists, we discuss the types, roles, practices, and motivations around documentation in data analytics OSS libraries.
clusterExperiment and RSEC: A Bioconductor package and framework for clustering of single-cell and other large gene expression datasets
Abstract: Colorado conducted risk-limiting tabulation audits (RLAs) across the state in 2017, including both ballot-level comparison audits and ballot-polling audits. Those audits only covered contests restricted to a single county; methods to efficiently audit contests that cross county boundaries and combine ballot polling and ballot-level comparisons have not been available.
Colorado’s current audit software (RLATool) needs to be improved to audit these contests that cross county lines and to audit small contests efficiently.