The SSH Training Discovery Toolkit provides an inventory of training materials relevant for the Social Sciences and Humanities.

Use the search bar to discover materials or browse through the collections. The filters will help you identify your area of interest.

 

Long time archiving of language resources and tools

Item
Title Body
Oxford Text Archive

The Oxford Text Archive (OTA) provides repository services for literary and linguistic datasets. In that role the OTA collects, catalogues, preserves and distributes high-quality digital resources for research and teaching. We currently hold thousands of texts in more than 25 languages, and are actively working to extend our catalogue of holdings. The OTA relies upon deposits from the wider community as the primary source of materials. The OTA is part of the CLARIN European Research Infrastructure; it is registered as a CLARIN centre, and OTA services are part of the University of Oxford's contribution to the CLARIN-UK Consortium.

CLARIN:EL

clarin:el is the Greek national network of language resources, a nation-wide Research Infrastructure devoted to the sustainable storage, sharing, dissemination and preservation of language resources.

The Central Aggregator is the central Repository of the clarin:el Infrastructure, which is responsible for the harvesting of metadata from the local Repositories, the organisation and the presentation of the metadata descriptions in a uniform catalogue and the provision of access to the Language Resources to the network members and to the public.

ORTOLANG

This is a depositing service for spoken and linguistic data.

CLARIN.SI

Depositing service for any linguistic/NLP data and tools.

CLARIN-PL

Depositing service for Polish language resources and services.

Dutch Language Institute

The INT is one of the four CLARIN B Centres in The Netherlands and it serves as an exclusive CLARIN B Centre for Flanders (Belgium). In fulfilling this role the INT provides researchers, (assistant) professors and students with (advice about) data and tools for linguistic research.

The INT also offers assistance and an infrastructure to researchers or institutions that want to share data or tools that were developed in research projects in the social sciences and humanities.

More data and tools for Dutch can be found at CLAPOP, the portal of the Dutch CLARIN community and by means of the CLARIN Virtual Language Observatory, a metadata-based portal for all CLARIN language resources and tools.

HZSK

The HZSK is a CLARIN centre that accepts corpora and other linguistic resources from research projects and other contexts in order to make these available mainly to the academic community for research and teaching purposes. The focus of the HZSK is on spoken, multilingual and multimodal corpora, and (spoken) corpora in other languages than German, especially of lesser-recourced or endangered languages.

Bavarian Archive for Speech Signals

Depositing service for corpora of spoken languages which contain a minimum of at least one measured signal that is based on the physical processes of speech production (e.g. acoustic signals, videos, series of measurements, series of pictures).

FIN-CLARIN

Depositing service for language resources related to Finnish, Finland Swedish and the Fenno-Ugric languages, as well as other language resources created in Finland.

CLARIN-DK-UCPH

The mission of CLARIN-DK is to provide easy and sustainable access for scholars in the humanities and social sciences to digital language data (in written, spoken, video or multimodal form) and to provide advanced tools for discovering, exploring, exploiting, annotating, and analyzing them. CLARIN-DK also shares knowledge on Danish language technology and resources and is the Danish node in the European CLARIN-ERIC. The objective of the CLARIN Centre at the University of Copenhagen is to fulfill the CLARIN-DK mission. The centre provides data management consultation and support in connection with depositing and reuse of research data.