Courses

Participants select either one module or two modules provided that the timeslots do not overlap (see Programme).

A novelty in 2024 is the option to follow a whole “track” of either introductory-level or advanced statistical modules as a full curriculum. If you are interested in statistical training, then you can select the track which best suits your background knowledge:

Module 1: Introduction to R

R is a widely-used programming language for statistical data analysis. This beginner-friendly module aims to provide participants with a solid foundation in R, empowering them to explore, analyze, and visualize data efficiently. Emphasis is placed on hands-on practical exercises and real-world examples, enabling students to immediately apply their knowledge.

read more …

Module 2: Categorical data analysis with R

This course offers an introduction to categorical data analysis which is specifically geared towards researchers in the field of corpus linguistics, variational linguistics, sociolinguistics and translation/interpreting studies. Various inferential statistical analyses are presented and discussed, with a strong focus on hypothesis testing rather than data exploration; authentic data examples from linguistics and translation studies will be provided.

read more …

Module 3: Natural Language Processing with Python

Natural Language Processing or NLP is a discipline that focuses on the interaction between data science and human language and gives the machines the ability to read, understand and derive meaning from human languages. The participants of this course will learn about different NLP techniques and basic programming skills.
read more …

Module 4: PRAAT

This course will introduce Praat scripting. By using scripts it will be much easier to replicate your analyses on speech files and to communicate with others about what you have done and how you have done it.
read more …

Module 5: ELAN

This module is not offered in 2024.

Module 6: Eye-tracking

Over the last decades, eye tracking has become a wide-spread technique to understand how people process and learn language. In this course, we will cover a basic introduction to eye-tracking techniques in language science.
read more …

Module 7: Survey design

This module is not offered in 2024.

Module 8: Linguistic ethnography

In this course, participants will be introduced to the basic ideas behind qualitative ethnographic research methods in the context of linguistics, focusing on the ingredients required for this process, i.e. data collection and analysis. We will discuss the most important concepts in ethnographic research, what counts as valid data, what is required in data collection, good practices, ethics, mixing methods, and how to describe, analyse and interpret data.

read more …

Module 9: Bayesian data analysis

This course introduces modern Bayesian statistics using the probabilistic programming language Stan and the front-end brms, used with R.

read more …

Module 10: Multivariate data analysis with R

This module offers an overview of the most important techniques for analyzing multivariate data, i.e. data involving several (correlated) variables. Such multivariate data arise often in studies involving language, e.g. research into language attitudes, reaction times to stimuli or cooccurrence frequencies in corpora. In addition, word embeddings in NLP share various ideas with multivariate statistical techniques so these similarities will also be touched upon.

read more …

Module 11: Research data management

This in-depth course will help students to develop their knowledge and practical skills in handling and managing the research data they collect. Having these skills becomes increasingly important to researchers seeking to advance their careers. Essential key-concepts and skills in Research Data Management (RDM) will tackled.

read more …

Module 12: Introduction to statistics with R

In this module, you will learn how to analyse linguistic data with R, Rstudio, and the tidyverse. We will cover descriptive and inferential statistics as well as linear modelling. Each lesson consists of a theoretical part followed by hands-on exercises in R.

read more …