Leverage your R Skills: Data Wrangling & Plotting with Tidyverse

AI and Data Science

Interested?

More information
° DA2122-M6 EN

Description

Tidyverse is a collection of R-packages used for data wrangling and visualization that share a common design philosophy. The goal of this course is to get you up to speed with the most up-to-date and essential tidyverse tools for data exploration. After attending this course, you’ll have the tools to tackle a wide variety of data wrangling and visualization challenges, using the best parts of R tidyverse.

This course covers the most essential tools from 3 main R tidyverse packages that are frequently used in general data analysis procedure.
Lectures with R code demonstrations are blended with hands-on exercises which allows you to try out the tools you’ve seen in the class under guides.

This course is part of a larger course series in Data Analysis consisting of 19 individual modules. Find more information and enroll for this module via www.ipvw-ices.ugent.be

Program

What you will learn:

  • Data transforming and summarizing with dplyr: narrowing in on observations of interest, creating new variables that are functions of existing variables, and calculating a set of summary statistics (like counts or means)
  • Data visualization with ggplot2: creating more informative graphs (e.g., scatter plot, bar plot, histogram, smoother/regression line, …) in an elegant and efficient way. Arranging multiple plots on a grid
  • Data ingest and tidying with tidyr: storing it in a consistent form that matches the semantics of the dataset with the way it is stored.
  • Extra tools for programming: Merging and comparing two datasets based on various matching or filtering criterion. Other useful tools for R programming.

Not included in this course:

  • A systematic training guide in basics of R. If you never used R or RStudio before, we highly recommend you to take Module 1 of this year's program which will guide you to be familiar with the R environment for the implementation of data management and exploration tasks.
  • Big data. This course focuses on small, in-memory datasets as you can’t tackle big data easily unless you have experience with small data.
  • Statistics. Although you will see many basic statistics in this course, the main focus is on R and the tidyverse tools instead of explaining the statistical concepts.

Course number:
DA2122-M6
Type:
Short- en long-term programmes
Area of interest:
AI and Data Science, Sciences
Language:
EN
Academic year:
2021 - 2022
Lecturers:
Limin Liu
Contact person:
ipvw.ices@ugent.be
More information

Your browser does not meet the minimum requirements to view this website. The browsers below are compatible. If you do not have one of these browsers, click on the icon to download the desired browser.