AI and Data Science

Interested?

More information
° DA2122-M8 EN
Tags: Postacademische opleiding

Description

Modern high throughput technologies easily generate data on thousands of variables; e.g. health care data, genomics, chemometrics, environmental monitoring, web logs, movie ratings, …

Conventional statistical methods are no longer suited for effectively analysing such high-dimensional data.
Multivariate statistical methods may be used, but for often the dimensionality of the data set is much larger than the number of (biological) samples. Modern advances in statistical data analyses allow for the appropriate analysis of such data.

Methods for the analysis of high dimensional data rely heavily on multivariate statistical methods. Therefore a large part of the course content is devoted to multivariate methods, but with a focus on high dimensional settings and issues.

Multivariate statistical analysis covers many methods. In this course a selection of techniques is covered based on our experience that they are frequently used in industry and research institutes.

The course is taught using case studies with applications from different fields (analytical chemistry, ecology, biotechnology, genomics, …).

This course is part of a larger course series in Data Analysis consisting of 19 individual modules. Find more information and enroll for this module via www.ipvw-ices.ugent.be

Program

  1. Dimension reduction: Singular Value Decomposition (SVD), Principal Component Analysis (PCA), Multidimensional Scaling (MDS) and biplots for dimension-reduced data visualisation
  2. Sparse SVD and sparse PCA
  3. Prediction with high dimensional predictors: principal component regression; ridge, lasso and elastic net penalised regression methods
  4. Classification (prediction of class membership): (penalised) logistic regression and linear discriminant analysis
  5. Evaluation of prediction models: sensitivity, specificity, ROC curves, mean squared error, cross validation
  6. Clustering
  7. Large scale hypotheses testing: FDR, FDR control methods, empirical Bayes (local) FDR control

Course number:
DA2122-M8
Type:
Short- en long-term programmes
Area of interest:
AI and Data Science, Sciences
Language:
EN
Academic year:
2021 - 2022
Starting date:
07.02.2022
Lecturers:
Lieven Clement
Milan Malfait
Contact person:
ipvw.ices@ugent.be
More information

We use cookies to to give you the best possible user experience on our website. You can refuse the installation of cookies. By doing so some parts of this website will not work in an optimal way. Read more.

Your browser does not meet the minimum requirements to view this website. The browsers below are compatible. If you do not have one of these browsers, click on the icon to download the desired browser.