The SSH Training Discovery Toolkit provides an inventory of training materials relevant for the Social Sciences and Humanities.

Use the search bar to discover materials or browse through the collections. The filters will help you identify your area of interest.

 

Researcher

Item
Title Body
UK Business Data User Conference 2021

The UK Data Service and the Office for National Statistics (UK) held a business data conference in 2021. This resource provides all slides, recordings and other useful materials from the conference. 

The conference included a mixture of presentations from the  Office for National Statistics  (ONS), UK Data Service, and researchers who have used data from the UK Surveys covering the areas of business, industry and trade. 

Linking external business data to ONS business data

The UK Data Service is able to assist researchers wishing to link external business data to ONS business data in the Secure Lab. This guide provides all the information needed to guide users wishing to link external business data to ONS business data. 

Secure Lab User Guide

This is a guide to the UKDS Secure Lab. Within this guide, users can find everything they need to know in order to access and use the highly detailed, sensitive data available via the UKDS SecureLab.

Voices of the Parliament: A Corpus Approach to Parliamentary Discourse Research

While corpus methods are widely used in linguistics, including gender analysis, this tutorial shows the potential of richly annotated language corpora for research of the socio-cultural context and changes over time that are reflected through language use. The tutorial encourages students and scholars of modern languages, as well as users from other fields of digital humanities and social sciences who are interested in the study of socio-cultural phenomena through language, to engage with user-friendly digital tools for the analysis of large text collections. The tutorial is designed in such a way that it takes full advantage of both linguistic annotations and the available speaker and text metadata to formulate powerful quantitative queries that are then further extended with manual qualitative analysis in order to ensure adequate framing and interpretation of the results.

The tutorial demonstrates the potential of parliamentary corpora research via concordancers without the need for programming skills. No prior experience in using language corpora and corpus querying tools is required in order to follow this tutorial. While the same analysis could be carried out on any parliamentary corpus with similar annotations and metadata, in this tutorial we will use the siParl 2.0 corpus which contains parliamentary debates of the National Assembly of the Republic of Slovenia from 1990 to 2018. Knowledge of Slovenian is not required to follow the tutorial. To reproduce the analyses in other languages, we invite you to explore a parliamentary corpus of your choice from those available through CLARIN.

 

Taken from: Teaching with CLARIN: 

Privacy by Design in Research

This is privacy by design in research! The aim of this training material is to show researchers how to perform a data protection impact assessment for an innovative research scenario to enable responsible re-use of archived speech corpora.

The project website provides help to develop learning goals.

 

Taken from: Teaching with CLARIN: https://www.clarin.eu/content/privacy-design-research

Introduction to Speech Analysis

This course offers a general picture of managing speech corpora and of the methods that are available for the acoustic-phonetic study of speech. During the course, students use a speech analysis program called Praat and learn to apply the main features of the program in their own work with speech recordings. In addition, students will learn the basics of another program called ELAN that can be used for transcribing and annotating audio as well as video material.

Taken from: Teaching with CLARIN: https://www.clarin.eu/content/introduction-speech-analysis

GATE Training Course

The training materials are all based around teaching the use of GATE, a freely available open-source toolkit for Natural Language Processing that has been widely used in both academia and industry for many different tasks.

The modules provide instruction on how to get to grips with the GATE toolkit for basic language processing, as well as more advanced techniques, and include a number of different scenarios, such as processing social media, hate speech and misinformation detection. They include modules both for programmers who want to further develop their own tools within the toolkit, and for non-programmers who want to just make use of existing tools. The modules teach not only the use of GATE itself, but also how to adapt it to one’s own needs (for example, to adapt English tools to a different language, or how to customise existing tools), and also the basic concepts around a number of language processing tasks including both low-level (tokenisation, POS tagging, parsing) to more sophisticated (information extraction, social media analysis, hate speech detection, misinformation detection), as well as how to interpret and integrate the results of the processing. Finally, it teaches programmers how to extend the toolkit itself, by adding new tools or integrating it into other systems.

 

Taken from Teaching with CLARIN: https://www.clarin.eu/content/gate-training-course 

Teaching ideas: Guides for teaching data analysis

This resource is a collection of short guides designed to make lesson planning more efficient for those teaching data analysis skills. Drawing on real classroom experiences, each guide includes suggested research questions, dataset and exercises:

  • Gender differences in sexual attitudes (PDF) (using the National Survey of Sexual Attitudes and Lifestyles).
  • Risk factors associated with increased levels of systolic blood pressure (PDF) (using the Health Survey for England).
  • The gender gap in life satisfaction (PDF) (using the Opinions and Lifestyle Survey).
  • Public confidence in the police (PDF) (using the Crime Survey for England and Wales).
Source
Title Body
UK Data Service: Survey and Census Data

This source includes a series of resources in the form of guides and e-books dedicated to trainers and students interested in sample design, weighting, changes over time using cross sectional and longitudinal data and mapping census data using different software.

UK Data Service: Secure Lab

The UK Data Service SecureLab has enabled secure access to the most sensitive and confidential data in the collection since 2011. SecureLab provides controlled access to data that are too detailed, sensitive or confidential to be made available under less restrictive access levels.