Library Workshops and Events

Event Details

Topic Modeling Against a Corpora

OPEN TO:
-Faculty -Graduate Students -Postdocs -Staff -Undergraduate Students

Divide and conquer a corpus of texts in order to better understand it as a whole.

Topic modeling is a process of dividing & conquering a collection of texts in order to better understand the collection as a whole. Given a corpora of documents (books, articles, Web pages, etc.), topic modeling divides the corpora into sub-corpora, and each sub-corpora will be identified with a theme. This process is sometimes useful for identifying genres, authors, and/or subjects in a body of literature.

This hands-on workshop will demonstrate and facilitate the use of a free Java-based program called Topic Modeling Tool. Participants are expected to bring their own computer, and the computer is expected to have Java already installed, which it probably does.

RELATED LIBGUIDE: Text mining and natural language processing by Eric Lease Morgan

DATE
Tuesday, November 5, 2024
TIME
2:00PM - 3:00PM
LOCATION
Navari Family Center for Digital Scholarship // Hesburgh Library 2nd Floor // Consultation Room 247
PRESENTER
Eric Lease Morgan
CATEGORIES
CDS | Text Mining & Analysis Workshops

Registration is required. There are 7 seats available.

Contact Info

Profile photo of Eric Lease Morgan
Eric Lease Morgan

Hesburgh Library
131 Hesburgh Library
University of Notre Dame
Notre Dame, IN 46556

(574) 631-8604
emorgan@nd.edu
Profile photo of Center for Digital Scholarship
Center for Digital Scholarship

Hesburgh Library–2nd Floor NE
cds.library.nd.edu
cds@nd.edu

Julie C. Vecchio '04, MPH, MLIS
Co-Interim Director
jvecchio@nd.edu
(574) 631-4900