UK Data Service: Data Skills | SSH Training Discovery Toolkit

Source

This source includes interactive modules designed for users who want to get to grips with key aspects of survey, longitudinal and aggregate data as well as tools that can be used to assess and improve data quality.

Modules can be conducted in your own time and you are able to dip in and out when needed. The modules give an introduction to key aspects of the data using short instructional videos, interactive quizzes and activities using open access software where possible. Tools include guides, documentation and exercises.

Responsible organisation

Disciplines

Intended audience

Formats

License

CC-BY-4.0

Language

English

Curated topics

Open Science

Quantitative data analysis

Survey data

Topics

Data Analysis

data discovery and data reuse

Data anonymisation

Data quality

Status

Continous curation

Contact

UK Data Service

help@ukdataservice.ac.uk

UKDS Training

ukdstraining@manchester.ac.uk

Access points

UK Data Service: Data Skills

Export
schema.org (JSON)

Items

Title	Description	Collections
Data Skills Modules	These introductory level interactive modules are designed for users who want to get to grips with key aspects of survey, longitudinal and aggregate data. Modules can be conducted in your own time and you are able to dip in and out when needed. The modules give an introduction to key aspects of the data using short instructional videos, interactive quizzes and activities using open access software where possible. Each module stands alone but those with little experience of surveys may find it useful to start with the Survey Data Module before moving on to the Longitudinal Data Module. Modules include: Survey Data, Longitudinal Data, Aggregate Data	Training Discovery Toolkit
QAMyData	QAMyData is an easy-to-use, open source tool that provides a health check for numeric data. The tool uses automated methods to detect and report on some of the most common problems in survey or numeric data, such as missingness, duplication, outliers and direct identifiers. The tool offers a number of configurable tests that have been categorised into four types: file, metadata, data integrity, and identifiers, which can be run on popular file formats, including SPSS, Stata, SAS and CSV. A standard config file has default settings for each test, such as a threshold for pass or fail on various tests (e.g. detect value label that are truncated, email addresses identified as a string, or undefined missing values) which can be easily adapted to meet the user’s own desired thresholds. The configuration feature allows the creation of a unique Data Quality Profile. The software creates a ‘data health check’ that details errors and issues as both a summary and detailed report, providing a location of the failed test. New tests can easily be added. Data depositors and publishers can act on the results and resubmit the file until a clean bill of health is produced.	Training Discovery Toolkit