Drawing Conclusions from Data: an Introduction
AI en Data Science
AI en Data Science
This course aims to provide insight into basic statistical concepts with emphasis on practical applications. Mathematical formulae will be kept to a minimum. The theory and the methods of analysis will be extensively illustrated with examples relating to a wide variety of different fields. To emphasize the practical approach in this course all classes will take place in a pc room.
This course is part of a larger course series in Data Analysis consisting op 19 individual modules. Find more information and enroll for this module via www.ipvw-ices.ugent.be
The first session will be dedicated to getting to know the software package R (and SPSS). Participants are encouraged to participate in both parts.
We start with concise graphical and numerical descriptions of data obtained from observational or experimental studies. The most common and frequently used probability distributions of discrete and continuous variables will be presented. Statistical inference draws conclusions about a population based on sampled data. Chance variations are taken into account such that a level of confidence is attached to these conclusions.
We present the reasoning behind significance tests for the comparison of observed data with a hypothesis, the validity of which we want to assess. We apply this procedure to data obtained either from one or from two populations.
The correct use of the t-test will be discussed. Nonparametric methods are considered as a possible alternative in case the requirements of the t-test are not met.
We cover the basic concepts of hypothesis testing for categorical data, including the chi-square test.
Quite often the relationship between two variables, where the outcome of one variable is seen as depending on the value of the other, is the focus of scientific interest.
We will give an introduction to linear regression analysis, where a regression line based on observations obtained in a sample describes this relation.
Hands-on exercises are worked out behind the PC using the R software. If preferred, participants can use SPSS.