Library Workshops and Events
Event Details
Using Topic Modeling Against a Corpora
Topic modeling is a process of dividing and conquering a collection of texts in order to better understand the collection as a whole. Given a corpora of documents (books, articles, Web pages, etc.) from any discipline, topic modeling divides the corpora into sub-corpora, and each sub-corpora will be identified with a theme. This process is sometimes useful for identifying genres, authors, and/or subjects in a body of literature. This hands-on workshop will demonstrate and facilitate the use of a free Java-based program called Topic Modeling Tool to do this work.
Participants are expected to bring their own computer, and the computer is expected to have Java already installed, which it probably does.
- DATE
- Wednesday, September 29, 2021
- TIME
- 12:30PM - 1:30PM
- LOCATION
- Navari Family Center for Digital Scholarship // Hesburgh Library 2nd Floor // Consultation Room 247
- PRESENTER
- Eric Lease Morgan
- CATEGORIES
- CDS | Text Mining & Analysis Workshops
Contact Info
Hesburgh Library
131 Hesburgh Library
University of Notre Dame
Notre Dame, IN 46556
(574) 631-8604 | |
emorgan@nd.edu |