Train-the-trainer concept for research data management

As part of the project FDMentor, a German-language train-the-trainer program on research data management was created and piloted in a series of workshops. The comments and tips from the participants in the two pilot phases and the feedback from the relevant community were gradually incorporated over 2019. The second version of the train-the-trainer concept now available offers a revised script with the contents of the teaching units, detailed teaching scripts, working materials, lecture slides and numerous worksheets and templates that are intended to support teaching. The topics covered include both aspects of research data management, such as data management plans and the publication of research data, as well as didactic units on learning concepts.

Applied Data Visualization

The workshop Applied Data Visualization introduces students to the theory and methods underlying data visualization. Data analysts face an ever-increasing amount of data (→ big data) and rather revolutionary technological developments allow researchers to visually engage with data in unprecedented ways. Hence, data visualization is one of the most exciting fields in data science right now. In this workshop students acquire the skills to visualize data in R both for exploratory purposes as well as for the purpose of explanation/presentation. We'll rely on R, the most-popular statistical programming environment when it comes to visualization and we'll make use of popular R packages such as ggplot2 and plotly. Besides creating static graphs we'll also have a look at interactive graphs and discuss how interactive visualization may revolutionize how we present data & findings.

Data Management, Advanced Programming and Automation using Stata

During the workshop, participants will learn how to efficiently handle problems of data management with Stata, and how to avoid repetition by automating (and programming) tasks. The workshop is not an introduction to Stata, but will feature "best practice" of Stata usage in order to modify existing do-files (or create new ones) to be reproducible, maintainable and efficient. The tips and tricks will refer mainly to data preparation and management, but they can also be used or automation of data analysis. The workshop will present some ideas about these topics, but focus on the interactive work where participants shall learn producing efficient Stata syntax by themselves.


University of Tübingen repository

The CLARIN repository at the University of Tübingen offers long-term preservation of digital resources, along with their descriptive metadata.

The mission of the repository is to ensure the availability and long-term preservation of resources, to preserve knowledge gained in research, to aid the transfer of knowledge into new contexts, and to integrate new methods and resources into university curricula.

The repository is part of the eScience infrastructure of the University of Tübingen, which is a core facility that strongly cooperates with the library and computing center of the university.

Integration of the repository into the national CLARIN-D and international CLARIN infrastructures gives it wide exposure, increasing the likelihood that the resources will be used and further developed beyond the lifetime of the projects in which they were developed.

Among the resources currently available in the Tübingen Center Repository, researchers can find widely used treebanks of German (e.g. TüBa-D/Z), the German wordnet (GermaNet), the first manually annotated digital treebank (Index Thomisticus), as well as descriptions of the tools used by the WebLicht ecosystem for natural language processing.


The HZSK is a CLARIN centre that accepts corpora and other linguistic resources from research projects and other contexts in order to make these available mainly to the academic community for research and teaching purposes. The focus of the HZSK is on spoken, multilingual and multimodal corpora, and (spoken) corpora in other languages than German, especially of lesser-recourced or endangered languages.

Bavarian Archive for Speech Signals

Depositing service for corpora of spoken languages which contain a minimum of at least one measured signal that is based on the physical processes of speech production (e.g. acoustic signals, videos, series of measurements, series of pictures).

TeLeMaCo: Browse

The TeLeMaCo search engine, with which learning materials can be browsed via keywords or authors.

Formal Ontologies: A Complete Novice’s Guide

This module is specifically aimed at those who are not yet familiar with ontologies as a means of research data management, and will take you through some of the main features of ontologies, and the reasons for using them.

GESIS Training

At GESIS we offer a wide range of events, especially training courses on empirical social research methods. Our theory founded and hands-on courses develop participants’ methods skills and are aimed at both early career and senior researchers from Germany, Europe, and the whole world.

FD Mentor

Roadmaps, recommendations for action and good practice examples for the strategic development and improvement of research data management at German-speaking universities are developed for subsequent use. The focus is on the development of tools for strategy development for research data management, models for institutional research data policies, and a consulting and training concept