CLARIN-D for Speech and Other Modalities

CLARIN-D supports research in the area of spoken language and language modalities. This support is provided through services for searching, analysing and archiving spoken language resources. Additionally, CLARIN-D provides tools for analysing written and spoken language resources. The Working Group 5 "Speech and Other Modalities" is a network of scholars concerned with spoken language within CLARIN-D.

Data for research

The Virtual Language Observatory (VLO) provides access to many resources for research, e.g. Speech corpora in German.

A dedicated full text search, the Federated Content Search (FCS) of CLARIN-D, allows researchers to search the full text of many resources of the CLARIN community. This way it is possible to find examples for the usage of terms. The search results and the source documents can be extracted and saved as a corpus for further analysis.

→ More about "Accessing "

Software for research projects

CLARIN-D provides software and web services for the analysis and preparation of language data. This includes WebAnno for the manual and semi-automatic annotation of texts or WebLicht for the automatic annotation of texts with a variety of tools. These can be combined according to the needs and preferences of the user.

Tools for working with spoken language:

→ More about "Analysing"

Providing your own research data

Apart from tools for the analysis of language data, the CLARIN network allows archiving one's own research data and providing it to the research community for reuse. By cooperating with a CLARIN centre, the data can be prepared in a way that it is sufficiently described. For example, one tool for describing data is the CMDI-Maker, for creating descriptions that allow easy access for the research community.

Would you like to provide your data through the CLARIN-D infrastructure? Contact a specialised centre or contact the CLARIN-D Helpdesk.

→ More about "Preparation and Depositing"

Contacts in the disciplines

Within CLARIN-D, the disciplines are organised in Working Groups (WGs). Speech researchers are represented by WG5 »Speech and Other Modalities«. The working group addresses the challenges of building corpora that not only contain speech but other paraverbal or nonverbal modalities such as intonation, amplitude, gesture, facial expression, body posture, etc. Multimodal corpora need to capture the specific features of the different modalities and, in addition, need to make explicit the important intermodal structures and relationships.

The working group brings together researchers from different disciplines (Psychology, Linguistics, Neurology, Philology, etc.), who work mainly with multimodal language data. The researchers are accompanied by computer scientists who use data-based approaches to build multimodal user interfaces and develop annotation tools and data management systems. These experts join forces to identify overarching requirements, develop convertible and assessable data formats, strengthen the exchange of data and tools, and establish a dialogue on topics between the different disciplines.

Responsible CLARIN-D Centres

PD Dr. Florian Schiel (Bayrisches Archiv für Sprachsignale, Ludwig-Maximilians-Universität München)
PD Dr. Christoph Draxler (Bayrisches Archiv für Sprachsignale, Ludwig-Maximilians-Universität München)
Dr. Sebastian Drude (Max Planck Institut für Psycholinguistik, Nijmegen)
Han Sloetjes (Max Planck Institut für Psycholinguistik, Nijmegen)

Members

Uni Bielefeld: Katharina Rohlfing, David Schlangen, Stefan Kopp, Angelika Hoenemann, Peter Menke, Andreas Windmann, Barbara Samlowski, Angela Grimminger, Stavros Skopeteas
Uni Erlangen: Elmar Noeth
Uni Frankfurt: Jost Gippert
Uni Hamburg: Daniel Jettka
Uni Kiel: Oliver Niebuhr
Uni Münster: Ulrike Gut, Robert Fuchs
Uni Saarbrücken: Jürgen Trouvain, Bernd Möbius, Zofia Malisz
Uni Stuttgart: Antje Schweitzer, Katrin Schweitzer
Telekom Berlin: Felix Burkhardt
TU Berlin: Benjamin Weiss
ZAS Berlin: Susanne Fuchs
Uni Zürich: Volker Dellwo

Head and Contact

Prof. Dr. Petra Wagner, Universität Bielefeld
Fakultät für Linguistik und Literaturwissenschaft
Universität Bielefeld
Postfach 10 01 31
33501 Bielefeld, Germany
petra.wagner@uni-bielefeld.de, +49 521 106 3510

Simon Betz (Coordinator)
simon.betz@uni-bielefeld.de, +49 521 106 6881

Resources of the discipline for the discipline

During the implementation phase of CLARIN-D, the WG identified important resources and tools, which have been developed and prepared for reuse. These small projects are called curation projects within CLARIN-D.

Curation projects

Curation project 1: "Integration multimodaler Ressourcen in CLARIN-D": | Project information
Curation project 2: "Vorbereitung einer Toolchain für multimodale Daten in CLARIN-D": | Project information
Curation project 3: "Rich Phonetic Annotations of Speech Data in CLARIN-D: WebMAUS Evaluation and Integration Process Documentation based on the ICE-Scotland Corpus" | Project information