Module 2

Data science lifecycle & Exploratory data analysis using visualization

Author
Affiliation

Lars Schöbitz

ETH Zurich

Are you ready for some data visualisations? This week is all about exploring data with ggplot2 R package. We will also learn about the data science lifecycle.

Learning Objectives

  1. Learners can list the six elements of the data science lifecycle.
  2. Learners can describe the four main aesthetic mappings that can be used to visualise data using the ggplot2 R Package.
  3. Learners can control the colour scaling applied to a plot using colour as an aesthetic mapping.
  4. Learners can compare three different geoms (bar/col, histogram, point) and their use case.

Slides

View slides in full screen | Download slides as PDF

Readings

  1. Read R for Data Science - Whole game
  2. Read R for Data Science - Section 3 - Workflow basics
  3. Read R for Data Science - Section 2 - Data visualization

Assignments

Due date: September 24, 2025

Please complete the following assignments before the due date.

Thank you for working through these assignments.

Quiz

Due date: October 01, 2025

Complete this quiz after you have worked through the assignments. Completing the quiz of each week is required to receive a course certificate at the end of the course.

Access the quiz here: https://u4x6xe-lars-sch0bitz.shinyapps.io/ds4owd-002-quiz/

You can verify your submissions using our Live Quiz Checker.

Session Recording

You can access the Zoom recording of Module 2 on September 18, 2025. To watch the recording, you will need to register for it.

Register for the recording at this link: https://ethz.zoom.us/rec/share/fxfezDds1UPpXjY-47ZIZN0be1nf9WFkt5wbm4zjf0CsVJEt9wyketdOKfleo-E_.GKy6wztwALoYMyjj

Note: The video will show once the recording has been processed.