Library Workshops and Events

Event Details

Topic Modeling Against a Corpora

OPEN TO:
-Faculty -Graduate Students -Postdocs -Staff -Undergraduate Students

Divide and conquer a corpus of texts in order to better understand it as a whole.

Topic modeling is a process of dividing & conquering a collection of texts in order to better understand the collection as a whole. Given a corpora of documents (books, articles, Web pages, etc.), topic modeling divides the corpora into sub-corpora, and each sub-corpora will be identified with a theme. This process is sometimes useful for identifying genres, authors, and/or subjects in a body of literature.

This hands-on workshop will demonstrate and facilitate the use of a free Java-based program called Topic Modeling Tool. Participants are expected to bring their own computer, and the computer is expected to have Java already installed, which it probably does.

RELATED LIBGUIDE: Text mining and natural language processing by Eric Lease Morgan

DATE
Monday, October 14, 2024
TIME
11:00AM - 12:00PM
LOCATION
Navari Family Center for Digital Scholarship // Hesburgh Library 2nd Floor // Consultation Room 247
PRESENTER
Eric Lease Morgan
CATEGORIES
CDS | Text Mining & Analysis Workshops
Registration has closed.

Contact Info

Profile photo of Eric Lease Morgan
Eric Lease Morgan

Hesburgh Library
131 Hesburgh Library
University of Notre Dame
Notre Dame, IN 46556

(574) 631-8604
emorgan@nd.edu
Profile photo of Center for Digital Scholarship
Center for Digital Scholarship

Hesburgh Library–2nd Floor NE
cds.library.nd.edu
cds@nd.edu

Julie C. Vecchio '04, MPH, MLIS
Co-Interim Director
jvecchio@nd.edu
(574) 631-4900