Library Workshops and Events
Event Details
Topic Modeling Against a Corpora
Divide and conquer a corpus of texts in order to better understand it as a whole.
Topic modeling is a process of dividing & conquering a collection of texts in order to better understand the collection as a whole. Given a corpora of documents (books, articles, Web pages, etc.), topic modeling divides the corpora into sub-corpora, and each sub-corpora will be identified with a theme. This process is sometimes useful for identifying genres, authors, and/or subjects in a body of literature.
This hands-on workshop will demonstrate and facilitate the use of a free Java-based program called Topic Modeling Tool. Participants are expected to bring their own computer, and the computer is expected to have Java already installed, which it probably does.
RELATED LIBGUIDE: Text mining and natural language processing by Eric Lease Morgan
- DATE
- Wednesday, December 11, 2024
- TIME
- 2:00PM - 3:00PM
- LOCATION
- Navari Family Center for Digital Scholarship // Hesburgh Library 2nd Floor // Consultation Room 247
- PRESENTER
- Eric Lease Morgan
- CATEGORIES
- CDS | Text Mining & Analysis Workshops
Contact Info
Hesburgh Library
131 Hesburgh Library
University of Notre Dame
Notre Dame, IN 46556
(574) 631-8604 | |
emorgan@nd.edu |
Hesburgh Library–2nd Floor NE
cds.library.nd.edu
cds@nd.edu
Julie C. Vecchio '04, MPH, MLIS Co-Interim Director jvecchio@nd.edu (574) 631-4900 |