Library Workshops and Events

Event Details

Data Organization in Spreadsheets

OPEN TO:
-Faculty -Graduate Students -Postdocs -Staff -Undergraduate Students

Good data organization is the foundation of any research project. Most researchers have data in spreadsheets, so it’s the place that many research projects start.

Typically we organize data in spreadsheets in ways that we as humans want to work with the data. However computers require data to be organized in particular ways. In order to use tools that make computation more efficient, such as programming languages like R or Python, we need to structure our data the way that computers need the data. Since this is where most research projects start, this is where we want to start too!

In this lesson, you will learn:

  • Good data entry practices - formatting data tables in spreadsheets
  • How to avoid common formatting mistakes
  • Approaches for handling dates in spreadsheets
  • Basic quality control and data manipulation in spreadsheets
  • Exporting data from spreadsheets

In this lesson, however, you will NOT learn about data analysis with spreadsheets. Much of your time as a researcher will be spent in the initial ‘data wrangling’ stage, where you need to organize the data to perform a proper analysis later. It’s not the most fun, but it is necessary. In this lesson you will learn how to think about data organization and some practices for more effective data wrangling. With this approach you can better format current data and plan new data collection so less data wrangling is needed.

This lesson assumes no prior knowledge of the skills or tools.


Before the session, please complete the lesson setup steps:

  • ALL PARTICIPANTS: Download the 3 data files linked above, placing them in a location you can easily find and access on your computer (such as a folder on your desktop).
  • MacOS USERS WHO USE THE APPLE NUMBERS PROGRAM: Plan to use Microsoft Excel or another program for this session, as Numbers does not contain some of the features we will use.
  • IF YOU DO NOT HAVE A SPREADSHEET APPLCICATION ON YOUR COMPUTER: Feel free to to install the free open source LibreOffice program linked above.
DATE
Wednesday, December 7, 2022
TIME
1:00PM - 2:30PM
LOCATION
Navari Family Center for Digital Scholarship // Hesburgh Library 2nd Floor // Consultation Room 247
PRESENTER
Julie Vecchio
CATEGORIES
CDS | Carpentries Workshops CDS | Data Use & Analysis Workshops CDS | Research Data Services Workshops
Registration has closed.

Contact Info

Profile photo of Julie Vecchio
Julie Vecchio

Navari Family Center for Digital Scholarship
250 Hesburgh Library
University of Notre Dame
Notre Dame, IN 46556
cds.library.nd.edu

NFCDS Help Desk
cds@nd.edu

 

Julie Vecchio
Assistant Director, Navari Family Center for Digital Scholarship
jvecchio@nd.edu
(574) 631-4900

 

Profile photo of Center for Digital Scholarship
Center for Digital Scholarship

Hesburgh Library–2nd Floor NE
cds.library.nd.edu
cds@nd.edu

Julie Vecchio
Assistant Director
jvecchio@nd.edu
(574) 631-4900