Developments in 2018 and Prospects for 2019 and Facilitating Diverse Collection and Curation in Web Crawling and Indexing
Facilitating Diverse Collection and Curation in Web Crawling and Indexing
Matt Bayley will briefly summarize his work this semester with the Internet Archive on collaborative web crawling, archiving, and indexing. This will include a survey of existing techniques and initiatives as well as an exploration of new protocols for crowd-sourcing these data and representing them within a shared infrastructure.
Developments in 2018 and Prospects for 2019
Every December, I give a plenary talk at the member meeting of the Coalition for Networked Information (CNI), where I serve as the director. Among other things, this talk summarizes what I see as key developments in the previous year and critical prospects for the coming year across a very broad landscape of technology and networked information. Recently, we've established a tradition at Berkeley where the final session of the fall seminar has been used for a somewhat more leisurely exposition and exploration of these developments and prospects in preparation for my plenary talk. Please join us for the 2018 version of this survey.
Matt Bayley is a MIMS student at the I School with a background in data engineering and an interest in software, infrastructure, and tech policy.
Clifford Lynch is the director of the Coalition for Networked Information (CNI) and an adjunct professor at the School of Information. Prior to joining CNI in 1997, Lynch spent eighteen years at the University of California Office of the President, the last ten as director of Library Automation. Lynch is a past president of ASIS&T and a fellow of the American Association for the Advancement of Science and the National Information Standards Organization.