AI and Data Science

Interested?

More information
° DA2122-M1 EN

Description

R is a flexible environment for statistical computing and graphics, which is becoming increasingly popular as a tool to get insight in often complex data. While in some ways similar to other programming languages (such as C, Java and Perl), R is particularly suited for data analysis because ready-made functions are available for a wide variety of statistical (classical statistical tests, linear and nonlinear modeling, timeseries analysis, classification, clustering, ...) and graphical techniques.

The base R program can be extended with user-submitted packages, which means new techniques are often implemented in R before being available in other software. This is one of the reasons why R is becoming the de facto standard in certain fields such as bioinformatics (Bioconductor) and financial services.

This course is part of a larger course series in Data Analysis consisting of 19 individual modules. Find more information and enroll for this module via www.ipvw-ices.ugent.be

Program

This course introduces the use of the R environment for the implementation of data management, data exploration, basic statistical analysis and automation of procedures.

It starts with a description of the R GUI, the use of the command line and an overview of basic data structures. The application of standard procedures to import data or to export results to external files will be illustrated.

Creation of new variables, subsetting, merging and stacking of data sets will be covered in the data management section. Exploration of the data by histograms, box plots, scatter plots, summary numbers, correlation coefficients and cross-tabulations will be performed.

Simple statistical procedures that will be covered are:

  • comparisons of observed group means (t-test, ANOVA and their non-parametric versions) and proportions
  • test for independence in 2-way cross tables and linear regression (focusing on the R-implementation of the statistical methods that are the subject of other modules of the statistics series)

Finally, installing new packages and automation of analysis procedures will also be discussed.

Practical sessions and specific exercises will be provided to allow participants to practice their R skills in interaction with the teacher.

Course number:
DA2122-M1
Type:
Short- en long-term programmes
Area of interest:
AI and Data Science, Sciences
Language:
EN
Academic year:
2021 - 2022
Contact person:
ipvw.ices@ugent.be
More information

Your browser does not meet the minimum requirements to view this website. The browsers below are compatible. If you do not have one of these browsers, click on the icon to download the desired browser.