Library Workshops and Events

Event Details

VIRTUAL--Library Carpentry--OpenRefine (Session 1 of 2)

OPEN TO:
-Faculty -Graduate Students -Postdocs -Public, Alumni, & Friends -Staff -Undergraduate Students

Connect via Zoom: a URL to join the Zoom session will be emailed to you 1 hour before the workshop start time. To ensure access to the workshop from the virtual waiting room, your Zoom name should match the information provided when registering for the workshop. Please email cds@nd.edu with any questions or issues.


 

This session introduces working with data in OpenRefine. At the conclusion of the lesson you will understand what the OpenRefine software does and how to use the OpenRefine software to work with data files. OpenRefine can be used to standardize and clean data in a file. OpenRefine is useful where you have data in a simple delimited  format such as a spreadsheet, a comma separated values file (csv), or a tab delimited file (tsv), but with internal inconsistencies either in data formats, where data appears, or in terminology used. 

Objectives

  • Use OpenRefine to Get an overview of a data set

  • Resolve inconsistencies in a data set, for example standardizing date formatting

  • Split cells which contain multiple bits of data so that each piece of data is in its own cell

  • Match local data up to other data sets

  • Enhance a data set with data from other sources

  • Answer questions about the content of a data set using Facets

  • Use facets and filters to work with a subset of data

  • Correct data problems through a facet

  • Use clustering to identify and fix replace varying forms of the same data with a single consistent value

Use your own laptop. You need to install OpenRefine and download a data file  doaj-article-sample.csv to follow the lesson in this session. OpenRefine does not support Internet Explorer or Edge. For this session please use Firefox, Chrome or Safari instead. See Setup for more information. If you’d like help to address difficulties you encounter with setup, please contact cds@nd.edu.  

This workshop is part of a series. You can take this workshop to build toward completion of either the Software or Library Carpentries lesson series. This workshop is taught by a Carpentries-certified instructor.

DATE
Thursday, April 2, 2020
TIME
4:00PM - 5:00PM
PRESENTER
Natalie Meyers
CATEGORIES
CDS | Carpentries Workshops
Registration has closed.

Contact Info

Profile photo of Research Data Services Team
Research Data Services Team