A workshops ticket is required to attend this session.
The TDL community has played an important role in preserving information from the web for many years. As web archives continue to grow, they present challenges of scale and complexity for those that seek to facilitate access and integrate archival data in research and teaching. Web archives are increasingly important primary sources for historians and digital humanities scholars, and computational methods, like text mining and data visualization, show promising pathways for scholarly use of born-digital archives. During this 90 minute workshop, participants will be introduced to web archives as a primary source; gain familiarity with web archive research use cases and how libraries support them; and acquire hands-on experience creating web archive collections and computationally analyzing web archives.
In this demonstration led by Internet Archive’s team, participants will understand how the WARC file format is used for both preservation and computational access and gain hands-on experience using representative processes for analyzing research datasets.
Collection Management Librarian, University of Texas Southwestern Medical Center
I am the Collection Management Librarian at the University of Texas Southwestern Medical Center in Dallas, Texas. I am responsible for the acquisition and licensing of the library's electronic resources, and I ensure that we provide accurate and current links to those resources in... Read More →