Library Workshops and Events
How to read 11 million books
The HathiTrust (http://hathitrust.org) is a collection of 11 million research library electronic texts digitized by Google, and this hands-on class teaches participants how to use computers to analyze these materials. Sometimes called “distant” or “scalable” reading — forms of digital humanities research — this class demonstrates ways to literally count & tabulate the frequency of words in a text in order to find patterns & anomalies within it. Based on the resulting analysis, it is possible to more quickly learn what a corpus is about when compared to reading the corpus without the use of a computer. HathiTrust materials lend themselves quite easily to this sort of analysis. There are no prerequisites, but participants are expected to bring their own laptop to the session.
- Monday, October 24, 2016
- 12:00PM - 1:00PM
- Eric Morgan
- CDS | Text Mining & Analysis Workshops
VIRTUAL WORKSHOPS: a URL to join the Zoom session will be emailed to you several hours before the workshop start time. To ensure access to the workshop from the virtual waiting room, your Zoom name should match the information provided when registering for the workshop.