Overview of Datasets for the Sign Languages of Europe

Publikation
EASIER Deliverable D6.1
Diese Publikation ist nur auf Englisch verfügbar.

Versionen

  • Aktuellste Version: DOI
  • Version 1: DOI

Zusammenfassung

This document identifies linguistic corpora that can be explored as high-quality training data for automatic translation within EASIER (as opposed to loosely aligned broadcast data). For each data set, the document lists what parts of the data are available under what access conditions. It also lists the elicitation formats used in several corpora in order to identify those parts of the available corpora that could be explored to build multilingual resources.

In order to support the construction of an interlingual index across European sign languages, the document also lists lexical resources (lexical databases and dictionaries) available and their characteristics.

Marc Schulder
Marc Schulder
Wissenschaftlicher Mitarbeiter für Computerlinguistik

Meine Forschungsinteressen umfassen Gebärdensprachen, Computerlinguistik und Open Science.