BIDS TextXD 2022: Text Analysis Across Domains

BIDS TextXD: Text Analysis Across Domains


December 5, 2022 to December 7, 2022
9:00am to 6:00pm
Berkeley, CA


BIDS TextXD 2022: Text Analysis Across Domains
Dates: December 5-7, 2022
Location: Berkeley, CA

  • Registration: Tickets go on sale July 11, 2022.
  • Call for Proposals: Abstract Submission Form. Submit to propose a 15-min presentation, a 3-5min lightning talk, or a poster by September 16 October 2, 2022 (deadline extended). 
  • Subscribe to the TextXD mailing list to receive updated information regarding registration and submissions.

The sixth annual BIDS TextXD: Text Analysis Across Domains is a text-focused data science conference that features three days of workshops, tutorials, invited speakers, posters, and panel discussion sessions on current Natural Language Processing (NLP) research, as well as talks spanning theory, applications, and tools. This December, TextXD 2022 is being presented in partnership with Berkeley's D-Lab and the Division of Computing, Data Science, and Society to convene an international and interdisciplinary group of practitioners, researchers, learners, and entrepreneurs who work with text as a primary source of data, and who use computational text analysis in a wide range of disciplines.


– Day 1 (December 5) will be a full day training workshop on how to perform Text Analysis using natural language processing tools and methods.
– Days 2 and 3 (December 6-7) will feature keynote speakers, talks and posters from current NLP researchers and leaders in text analysis across multiple domains, with discuss sessions over lunch.

Confirmed Keynote Speakers

  • Emily M. Bender is a Professor of Linguistics, the Faculty Director of the Master of Science in Computational Linguistics program, and Director of the Computational Linguistics Laboratory at the University of Washington.
  • David M. Blei is a professor of Statistics and Computer Science at Columbia University.

Organizing Committee

Niek Veldhuis, Professor of Assyriology, UC Berkeley. Heather Haveman, Professor of Sociology and Business, UC Berkeley. Tim Tangherlini, Professor of Danish Literature and Culture, UC Berkeley. Maryam Vareth, Health and Life Sciences Research Lead, BIDS. Ilya Akdemir, Graduate Student, UC Berkeley School of Law. Ciera Martinez, Biology and Environmental Sciences Research Lead, BIDS. Isaac Dalke, Graduate Student of Sociology, UC Berkeley.


Questions about this conference, including inquiries about accessibility and the availability of assistive technologies, may be directed to


Niek Veldhuis

Professor of Assyriology, Department of Near Eastern Studies, UC Berkeley

Niek Veldhuis is Professor of Assyriology (cuneiform studies) in the Department of Near Eastern Studies. He received his PhD at the Rijksuniversiteit Groningen (The Netherlands) in 1997, and came to Berkeley in 2002. His primary interests are in the intellectual history of ancient Mesopotamia (History of the Mesopotamian Lexical Tradition, 2014) and Sumerian literature (Religion, Literature and Scholarship: The Sumerian Composition Nanše and the Birds, 2004).  He is director of the NEH-supported Digital Corpus of Cuneiform Lexical Texts and is a member of the international Oracc Steering Committee, providing tools and standards for digital publication of cuneiform texts to scholars worldwide. Today, his main research focus is on developing computational text analysis scripts (primarily in Jupyter Notebooks) for cuneiform datasets.

Heather A. Haveman

Professor, Department of Sociology, UC Berkeley

Heather A. Haveman is a Professor of Sociology and Business at UC Berkeley. She holds a BA in history and an MBA (from the University of Toronto), and a Ph.D. in organizational behavior and industrial relations (from UC Berkeley).  Following positions at Duke University's Fuqua School of Business, Cornell University's Johnson Graduate School of Management, and Columbia University's Graduate School of Business, Professor Haveman joined UC Berkeley in July 2006. Her research interests include how organizations, the fields in which they are embedded, and the careers of their members and employees evolve. Her current work involves American magazines and wineries, Chinese listed firms, and the emerging marijuana market in several US states. 

Maryam Vareth

Health and Life Sciences Lead

Maryam Vareth leads BIDS’ data science research efforts in the Health & Life Sciences.  Dr. Vareth is a Co-Director of the Innovate For Health initiative, a collaboration among UC Berkeley, UCSF, and Janssen Pharmaceutical Companies of Johnson & Johnson. As an experienced engineer, researcher, and data scientist, she applies mathematics, statistics and physics to solve unmet needs in healthcare to enhance patients’ experience during their medical journey. She is an advocate for “data-driven” medicine, and in particular for linking medical imaging data with medical diagnostics and therapeutics to extract clinically-relevant insights through the use of open research and open source practices.  Dr. Vareth received her BS and MS training in Electrical Engineering and Computer Science (EECS) from UC Berkeley, where she was awarded the prestigious Regent’s and Chancellor’s Scholarship.  She completed her PhD through the joint UC Berkeley-UCSF Bioengineering program as a National Science Foundation Fellow, where she was awarded the Margaret Hart Surbeck Endowed Fellowship for Interdisciplinary Research for her work on developing new techniques and algorithms for the acquisition, reconstruction and quantitative analysis of Magnetic Resonance Spectroscopy Imaging (MRSI), with the goal of improving its speed, sensitivity and specificity to improve the management of patients with brain tumors.  She conducted her post-doctoral fellowship at UCSF, combining structural, physiological and metabolic imaging data from large clinical trials to quantitatively characterize heterogeneity within malignant brain tumors.

Ciera Martinez

Biology and Environmental Sciences Lead

BIDS Biology and Environmental Sciences Lead Ciera Martinez focuses on data intensive research projects that aim to understand how life on this planet evolves in reaction to the environment and climate – especially projects involving large and complex datasets.  A long-time open science advocate, Ciera has been involved with and continues to be interested in working on training for open data, education, publishing, and software, including developing community standards for data management practices.  As a 2019 Mozilla Open Science Fellow, she connected her love of data and museums and worked on projects aimed at understanding and increasing the usability of biodiversity and natural history museum data.  She received her PhD in Plant Biology from UC Davis, researching the genetic mechanisms regulating plant architecture.  She then went on to become a NSF Postdoctoral Fellow at UC Berkeley in the Molecular and Cellular Biology Department, studying genome evolution.  She was also a BIDS postdoctoral Data Science Fellow for 3 years, working on undergraduate research practices, data science training, community development, and best practices for data science, diversity and inclusion, and computational research.