Data Statement for the Public DGS Corpus (v3)

Marc Schulder, Dolly Blanck, Thomas Hanke, Ilona Hofmann, Sung-Eun Hong, Olga Jeziorski, Lutz König, Susanne König, Reiner Konrad, Gabriele Langer, Rie Nishio, Christian Rathmann

February, 2024

Type

Report

Publication

Project Note AP06-2020-01

Versions

Latest version:
Version 3:
Version 2:
Version 1:

Abstract

This data statement of the Public DGS Corpus provides information relevant to judging the nature of the language content of the corpus. It covers how the corpus was curated, specifies the language varieties it covers, and provides demographic information for participants and annotators. It also describes the technical and sociological conditions under which the language data was recorded as well as its topical characteristics. The data statement provides a general overview, supported by references to a variety of publications that cover individual topics in more detail.

Marc Schulder

Research Associate in Computational Linguistics

My research interests include sign languages, natural language processing, and open science.