2017 TextXD Conference

Text Analysis Across Domains


December 1, 2017
9:00am to 5:00pm
190 Doe Library
Get Directions

preXD Text Analysis Workshop
Date: November 29, 2017, 1:00 to 6:00 PM
Location: Academic Innovation Studio (Dwinelle 117, Level D), UC Berkeley 

This year, the TextXD Conference started with a preXD Workshop on November 29 from 1:00-6:00 PM, in order to give a quick introduction to text analysis in Python using Jupyter Notebooks. This session was specifically designed to bring people to the text-analysis-starting-line so that everyone would be ready for the make sessions over the next two days. No prior text analysis experience was needed to attend the preXD. Those without Python familiarity were invited to check out some introductory materials from UC Berkeley’s D-Lab here: https://github.com/dlab-berkeley/python-fundamentals - click the “launch binder” black and red badge to run it all in your browser.

TextXD - Text Analysis Across Domains Fall 2017 Conference
Dates: November 30 and December 1, 2017, 10:00 AM to 4:30 PM 
Location: 190 Doe Library, UC Berkeley 

This semester's TextXD event was the biggest TextXD event to date. With so much going on in the world of natural language processing, TextXD opened to researchers beyond the UC Berkeley campus. The agenda consisted of short morning talks on new tools, methods, software, and data (see videos in the Agenda, listed below). Speakers came from our own campus as well as UC San Francisco, UC San Diego, UC Santa Barbara, Princeton, and Drexel. Afternoon "make" sessions were also introduced this year so that participants could roll up their sleeves and spend time working together to craft solutions to our shared problems or to investigate research questions of shared interest. The text data used in the "make" sessions included newspaper articles, twitter feeds, emails, congressional hearings, and journal article abstracts. 


THURSDAY, NOVEMBER 30 at BIDS (190 Doe Library)

FRIDAY, DECEMBER 1 at BIDS (190 Doe Library)



Below is a presentation of the complete set of videos from the 2017 TextXD Conference, held at the Berkeley Institute for Data Science (BIDS) on November 29 through December 1, 2017.

THURSDAY, NOVEMBER 30 at BIDS (190 Doe Library)

10:10-10:35 AM
John Mohr, University of California, Santa Barbara
The Frontiers of Social Scientific Text Analysis


10:35-10:45 AM
Cody Hennesy, University of California, Berkeley
Text Analysis on 14 Million Digital Library Books


10:45-11:15 AM
Julia Silge, StackOverflow
Text Mining with Tidy Data Principles and Count-based Methods


11:15-11:30 AM
Pramit Choudhary, DataScience
Explainable NLP Algorithms: Understanding Word Relevance in Text Datasets


11:30-11:40 AM
Elena Glassman, BIDS, University of California, Berkeley
Wavelets for Text


11:40 AM-12:00 PM
Jamie Murdoch, University of California, Berkeley
Beyond Word Importance: Contextual Decomposition for Interpreting LSTMs


12:00-12:05 PM
Devin Cornell, University of California, Santa Barbara
Word Embedding and Semantic Analysis of News Data


12:05-12:25 PM
Host: Marla Stuart, University of California, Berkeley
Introduction and Make Session Previews


1:00-1:30 PM
Lunch Chat Panel — The Frontier of NLP (at Berkeley and Beyond)


1:30-5:00 PM— Make Session

FRIDAY, DECEMBER 1 at BIDS (190 Doe Library)

10:05-10:20 AM
Claudia von Vacano, D-Lab, University of California, Berkeley
Scalable Detection of Online Hate Speech


10:20-10:50 AM
Jake Ryland Williams, Drexel University
Minimal Semantic Units in Text Analysis


10:50-11:05 AM
Han Zhang, Princeton University
Uncovering Authoritarian Rule: Identifying Collective Action with Social Media Data


11:05-11:35 AM
Rex Douglass (UCSD)
Georeferencing of Events from Text


11:35-11:55 AM 
Nick Adams, BIDS, University of California, Berkeley
TextThresher: Qualitative Text Analysis at a Quantitative Scale 


11:55 AM-12:05 PM
Oksana Gologorskaya, University of California, San Francisco 
Text Analysis in Biomedical Applications at UCSF


12:05-12:20 PM
Miriam Petruck, International Computer Science Institute
The FrameNet Database -- FrameNet: The Tip of the Iceberg


12:20-12:35 PM
Meredith Lee, West Big Data Innovation Hub and University of California, Berkeley
Collaborating with the Big Data Innovation Hubs


1:00-1:30 PM 
Lunch Chat Panel — Humans In the Loop: The Role of Humans in Text Analysis


4:30-5:00 PM
Conference Closing and Remarks from Participants

Follow us on Twitter #TextXD17