campanile and green trees with the sun shining
Cultural Analytics Workshop Week

Building a Benchmark for Long-Form Narrative Understanding

Friday, March 20, 2026
11:00 am - 11:30 am
AI Futures Lab, Downtown Berkeley
David Bamman

Co-sponsored by the Berkeley Institute for Data Science, the School of Information, and the Department of Scandinavian.

Evaluating the long-form reasoning abilities of multimodal language models is difficult in part due to the simple lack of availability of long-form videos — many existing benchmarks for video understanding are comprised of short YouTube clips, contain copyrighted materials (which are then subject to DMCA takedown requests) or contain privately digitized collections that are not publicly shareable. 

I’ll describe ongoing work to create a new benchmark of popular, openly available movies that we can build benchmarks around; this involves identifying measures of popularity going back to the 1920s, assessing public domain status, and creating a suite of narrative understanding questions that we can use to track the capabilities of modern multimodal language models going forward.


Space is limited. Submit the application form to request an invitation.

Apply to attend

Speaker

David Bamman

David Bamman is an associate professor in the School of Information at UC Berkeley, where he works in the areas of natural language processing and cultural analytics, applying NLP and AI to empirical questions in the humanities and social sciences. 

His research focuses on improving the performance of computational methods for underserved domains like literature (including LitBank and BookNLP) and developing new empirical approaches for the study of literature, film and culture. Before Berkeley, he received his Ph.D. in the School of Computer Science at Carnegie Mellon University and was a senior researcher at the Perseus Project of Tufts University. Bamman's work is supported by the National Endowment for the Humanities, National Science Foundation, Mellon Foundation, and an NSF CAREER award. 

Last updated: February 27, 2026