Digital humanities

all the domains of the Humanities, from literature to heritage science, including history, social sciences, linguistics, etc.
Item
Title Body
Copyright & Related rights

This section is an introduction to copyright notions and related rights:

CLARIN Centre Vienna

ARCHE (A Resource Centre for the HumanitiEs) is a service that offers stable and persistent hosting as well as the dissemination of digital research data and resources for the Austrian humanities community. ARCHE welcomes data from all humanities fields.

Newspaper corpora

This is a list of newspaper corpora that are available as part of the CLARIN Resource Families initiative.

Collections of newspapers in digital form are a rich source of information for researchers in a number of disciplines in the Humanities and Social Sciences and are especially valuable for synchronic as well as diachronic studies, ranging from history, media and communication studies to lexicography for which newspapers are a rich source of neologisms and other lexicographic phenomena.

Literary corpora

This is a list of literary corpora that are available as part of the CLARIN Resource Families initiative.

Literary corpora comprise poetry and fictional prose texts, such as novels, short stories and plays. They bring together the collected works of a single author or representative from a specific literary period. Since the literary corpora are often available through powerful concordancers, they are especially well suited for a quantitative and qualitative approach to comparative literary analysis, within or across different genres and historical periods.

Historical corpora

This is a list of historical corpora that are available as part of the CLARIN Resource Families initiative.

The CLARIN ERIC infrastructure offers access to historical corpora that cover almost all of the languages spoken in countries that are either members or observers in CLARIN ERIC. In the vast majority of cases, the corpora can be directly downloaded from the national repositories or queried through easy-to-use online search environments. They are also richly tagged and mostly available under public licences.

 

Corpora of academic texts

This is a list of academic corpora that are available as part of the CLARIN Resource Families initiative.

Corpora of academic texts contain scholarly writing, which includes research papers, essays and abstracts published in academic journals, conference proceedings, and edited volumes, theses written by students at the undergraduate and graduate levels, and scientific monographs.

 

Parthenos - For Trainers - Other Teaching Resources

The materials on this web site are intended to assist in bridging that gap, overcoming the general inclination within infrastructure projects to provide only training on tools, rather than finding effective ways to transfer a greater bulk of our experiential knowledge.

These materials are intended for reuse, so please feel free to incorporate them in to your courses and syllabi, or direct your students toward them for further learning.  Please apply a CC-BY license when you do reuse them, crediting the PARTHENOS Project and the specific lecturer by name (if a video or slide deck).

This item focuses on other teaching resources that are available such as links to suggested course outlines, downloadable diagrams and checklists.

 

 

Source
Title Body
CLARIN Legal Information Platform

The platform aims to introduce researchers with basic notions related to the legislative and licensing framework in Europe on Copyright and Data Protection:

  • Introduction to Copyright and Related Rights
  • Licensing Practice
  • Personal Data Protection

It also includes proposals for:

  • Further reading/Bibliography on Legal and Ethical Issues
  • Useful links on Legal and Ethical Issues
CLARIN Depositing Services

One of the fundamental services of the CLARIN infrastructure is making sure that language resources can be archived and made available to the community in a reliable manner. To help researchers to store their resources (e.g. corpora, lexica, audio and video recordings, annotations, grammars, etc.) in a sustainable way, many of the CLARIN centres offer a depositing service. They are willing to store the resources in their repository and assist with the technical and organisational details. This has a wide range of advantages:

  • Long-term archiving: a storage guarantee can be given for a long period (up to 50 years in some cases)
  • Resources can be cited easily with a persistent identifier.
  • The resources and their metadata will be integrated into the infrastructure, making it possibe to search them efficiently.
  • Password-protected resources can be made available via an institutional login.
  • Once resources are integrated in the CLARIN infrastructure, they can be analyzed and enriched more easily with various linguistic tools (e.g. automated part-of-speech taggingphonetic alignment or audio/video analysis).
CLARIN Resource Families

The aim of the CLARIN Resource Families initiative is to provide a user-friendly overview of the available language resources in the CLARIN infrastructure for researchers from digital humanities, social sciences and human language technologies. The overviews are organized according to the types of data in the resources and include listings sorted by language.

The listings include the most important metadata and brief descriptions, such as resource size, text sources, time periods, annotations and licences as well as links to download pages and concordancers, whenever available. In addition to the resources found in the CLARIN infrastructure, CLARIN Resource Families provides an overview of other existing valuable language resources which have not yet been integrated in the infrastructure.

CLARIN Resource Families also provides hyperlinks to other relevant materials such as the thematic CLARIN workshops and tutorials and their accompanying videolectures, as well as a list of key publications on the resources surveyed.