Statistics and Applied Mathematics

Project Jupyter

Project Jupyter is a community of open-source developers, scientists, educators, and data scientists. Its goal is to build open-source tools and create community that facilitates scientific research, reproducible and open workflows, education, computational narratives, and data analytics. Jupyter supports over 100 programming languages, and connects data analytics tools across a range of disciplines and communities.

There are several core projects of Jupyter that the Berkeley Institute for Data Science supports:


scikit-image was founded in 2009 by Stéfan van der Walt.  It is a community-driven Python project, consisting of a vast collection of high-quality, peer-reviewed image processing algorithms that are made available to a global community of researchers free of charge and free of restriction.  The library is widely used in many different fields, including astronomy, biomedical imaging, and environmental resource management. The library builds on NumPy and SciPy, two other projects in the scientific Python ecosystem supported by BIDS.

Cesium ML

Cesium is an end-to-end machine learning platform for time-series, from calculation of features to model-building to predictions. Cesium has two main components—a Python library, and a web application platform that allows interactive exploration of machine learning pipelines. Take control over the workflow in a Python terminal or Jupyter notebook with the Cesium library, or upload your time-series files, select your machine learning model, and watch Cesium do feature extraction and evaluation right in your browser with the web application.