Supporting Computational Research on Large Digital Collections (Internet Archive & Archives Unleashed)

Abstract

Every year more and more scholars conduct research on terabytes and even petabytes of digital library and archive collections using computational methods such as data mining, natural language processing, and machine learning (ML), which poses many challenges for supporting research libraries. In 2020, Internet Archive Research Services and Archives Unleashed received funding to combine their tools enabling computational analysis of web and digital archives to support joint technology development, community building, and selected research projects by sponsored cohort teams. The session will feature programs that are building technologies, resources, and communities to support data-driven research, and it will review the beta platform, Archives Research Compute Hub, and discuss working with digital humanities, social and computer science researchers, and industry partners in support of large-scale digital research methods.

Date
Location
Washington, D.C.
Avatar
Nick Ruest
Associate Librarian