Library Workshops and Events

Event Details

Preparing Files for Text & Data Mining

OPEN TO:

-Faculty -Graduate Students -Postdocs -Staff -Undergraduate Students

Text mining—a process for extracting information from unstructured text—requires everyday files (PDF, Word, HTML, etc.) to be transformed into plain text files. Once your files are in a plain text format (no bold, no italics, no underlining, etc.) they are ready for automated processing and computer analysis.

This hands-on workshop will demonstrate and facilitate the use of a free Java-based program called Tika to do this work. More specifically, this workshop will help attendees install Tika and use it to convert just about any file into plain text, and then participants will be empowered to use a myriad of text mining services available on the 'Net.

There are no prerequisites, but participants will want to bring their own laptop to the session.

DATE: Tuesday, September 3, 2019
TIME: 2:00PM - 3:00PM
LOCATION: Navari Family Center for Digital Scholarship // Hesburgh Library 2nd Floor // Consultation Room 247
PRESENTER: Eric Lease Morgan
CATEGORIES: CDS | Text Mining & Analysis Workshops

Registration has closed.

Contact Info

Research Data Services Team

Hesburgh Library
University of Notre Dame
Notre Dame, IN 46556

	hl-research-data-services-list@nd.edu
	libguides.library.nd.edu/research-data-services

Need help? Ask us!
	Chat with us
	asklib@nd.edu
	(574) 631-6258
	Visit the Ask Us Desk 1st Floor, Hesburgh Library

Library Workshops and Events

Event Details

Preparing Files for Text & Data Mining

Contact Info

Need help? Ask us!

Share