Paper-Conference

EASIER Notation – A Proposal for a Gloss-based Scripting Language for Sign Language Generation Based on Lexical Data
Dieser Artikel ist ©2023 IEEE. Ich empfehle die Nutzung der akzeptierten Fassung des Artikels, welche öffentlich zugänglich ist. Für die closed access veröffentlichte Fassung folgen sie der URL in der Zitiervorlage, wobei zu beachten ist, dass in jener Fassung die meisten Links und Verknüpfungen defekt sind. Die akzeptierte Fassung hat dieses Problem nicht.
Diese Publikation ist nur auf Englisch verfügbar.

Abstract

We introduce EASIER Notation, a gloss-based scripting language to describe sign language content to be signed by an avatar and describe the functionality a lexical database for a sign language needs to provide in order to fully support the notation approach. In addition, we present the prototype of a text editor supporting EASIER Notation for human post-editing of machine translation output as well as pre-scribing signed utterances from scratch.

The Sign Language Interchange Format: Harmonising Sign Language Datasets for Computational Processing
Dieser Artikel ist ©2023 IEEE. Ich empfehle die Nutzung der akzeptierten Fassung des Artikels, welche öffentlich zugänglich ist. Für die closed access veröffentlichte Fassung folgen sie der URL in der Zitiervorlage, wobei zu bBeachten ist, dass in jener Fassung die meisten Links und Verknüpfungen defekt sind. Die akzeptierte Fassung hat dieses Problem nicht.
Diese Publikation ist nur auf Englisch verfügbar.

Abstract

We introduce the Sign Language Interchange Format, a new format for representing annotations and lexical inventories of sign language datasets. The format is designed as an intermediate step in data preparation for language technologies, unifying the annotation conventions of different corpora for further use. Complex gloss notations and implicit relations between tiers are made explicit through a hierarchy of machine-readable container structures. Sample implementations for converting to and from the new format are provided.

Facilitating the Spread of New Sign Language Technologies across Europe
Diese Publikation ist nur auf Englisch verfügbar.

Zusammenfassung

For developing sign language technologies like automatic translation, huge amounts of training data are required. Even the larger corpora available for some sign languages are tiny compared to the amounts of data used for corresponding spoken language technologies. The overarching goal of the European project EASIER is to develop a framework for bidirectional automatic translation between sign and spoken languages and between sign languages. One part of this multi-dimensional project is that it will pool available language resources from European sign languages into a larger dataset to address the data scarcity problem. This approach promises to open the floor for lower-resourced sign languages in Europe. This article focusses on efforts in the EASIER project to allow for new languages to make use of such technologies in the future. What are the characteristics of sign language resources needed to train recognition, translation, and synthesis algorithms, and how can other countries including those without any sign resources follow along with these developments? The efforts undertaken in EASIER include creating workflow documents and organizing training sessions in online workshops. They reflect the current state of the art, and will likely need to be updated in the coming decade.

Introducing Sign Languages to a Multilingual Wordnet: Bootstrapping Corpora and Lexical Resources of Greek Sign Language and German Sign Language
Diese Publikation ist nur auf Englisch verfügbar.

Zusammenfassung

Wordnets have been a popular lexical resource type for many years. Their sense-based representation of lexical items and numerous relation structures have been used for a variety of computational and linguistic applications. The inclusion of different wordnets into multilingual wordnet networks has further extended their use into the realm of cross-lingual research. Wordnets have been released for many spoken languages. Research has also been carried out into the creation of wordnets for several sign languages, but none have yet resulted in publicly available datasets. This article presents our own efforts towards an inclusion of sign languages in a multilingual wordnet, starting with Greek Sign Language (GSL) and German Sign Language (DGS). Based on differences in available language resources between GSL and DGS, we trial two workflows with different coverage priorities. We also explore how synergies between both workflows can be leveraged and how future work on additional sign languages could profit from building on existing sign language wordnet data. The results of our work are made publicly available.