CLARIN Tool Portal

EXMARaLDA

1 resources

**EXMARaLDA** is a system for working with oral corpora on a computer. It consists of a transcription and annotation tool ([Partitur-Editor](https://exmaralda.org/en/partitur-editor-2/ "Partitur Editor")), a tool for managing corpora ([Corpus-Manager](https://exmaralda.org/en/corpus-manager-coma-2/ "Corpus-Manager (Coma)")) and a query and analysis tool ([EXAKT](https://exmaralda.org/en/exakt-3/ "EXAKT")). **EXMARaLDA's** features include, for instance: - time-aligned transcription of digital audio or video - flexible annotation for freely choosable categories, - systematic documentation of a corpus through metadata - flexible output of transcription data in various layouts and formats (notation, document) - computer-assisted querying of transcription, annotation and metadata - interoperable as it works XML based data formats that allow for data exchange with other tools (like Praat, ELAN, Transcriber etc.) and enable a flexible processing and sustainable usage of the data. **EXMARaLDA** is used by [researchers world wide](https://exmaralda.org/en/projects/ "Projekte") in different contexts in which spoken language is analysed, these include: - conversation and discourse analysis, - study of language acquisition and multilingualism, - phonetics and phonology, - dialectology and sociolinguistics. **EXMARaLDA** was developed in the project "Computer assisted methods for the creation and analysis of multilingual data" at the Collaborative Research Center "Multilingualism" (Sonderforschungsbereich "Mehrsprachigkeit" – SFB 538) at the University of Hamburg. Since July 2011, the development of EXMARaLDA is continued at the [Hamburg Centre for Language Corpora](https://corpora.uni-hamburg.de/drupal/en), since November 2011 in cooperation with the [Archive for Spoken German](http://agd.ids-mannheim.de/index.shtml) at the Institute for the German Language in Mannheim.

DARIAH DKPro-Wrapper: POS-Tagging und Lemmatization DE

1 resources

The DARIAH DKPro Wrapper is a wrapper for DKPro Core, a tool for linguistic annotation.

WebLicht Tokenization TUR

1 resources

WebLicht Easy Chain for tokenization of Turkish texts. The pipeline makes use of WebLicht's TCF converter, and the tokenizer from the OpenNLP project. The 'newlineBounds' parameter treats newlines as a hard break (a sentence boundary). WebLicht's built-in viewer for annotations can be used to visualize the processing result.

WebLicht Lemmas DE

1 resources

WebLicht Easy Chain for Lemmatization (German). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the IMS TreeTagger. WebLicht's Tundra can be used to visualize the result.

WebLicht POSTags Lemmas IT

1 resources

WebLicht Easy Chain for POS Tagging and Lemmatization (Italian). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the POS Tagger from the OpenNLP project. The model for Italian is trained on a relatively small training corpus (MIDT) and should therefore be considered experimental. WebLicht's Tundra can be used to visualize the result.

WebLicht POSTags Lemmas DE

1 resources

WebLicht Easy Chain for POS Tagging and Lemmatization (German). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the IMS TreeTagger. WebLicht's Tundra can be used to visualize the result.

iDAI.vocab FR

1 resources

The German Archaeological Institute is a scientific institution of the confederation in the business area of the Department of Foreign Affairs.

Opener Tokenizer

1 resources

Tokenizer for Dutch, English, German, French, Spanish and Italian. Consumes Plain text and produces TCF.

Wikipedia Search EN

1 resources

Wikipedia is an online free-content encyclopedia that you can edit and contribute to. Wikipedia co-founder Jimmy Wales has described Wikipedia as "an effort to create and distribute a free encyclopedia of the highest possible quality to every single person on the planet in their own language." Wikipedia exists to bring knowledge to everyone who seeks it.

WebLicht Const Parsing DE

1 resources

WebLicht Easy Chain for Constituency Parsing (German). The pipeline makes use of WebLicht's TCF converter, the tokenizer and sentence boundary detector of the IMS/Stuttgart, and the constituent parser from the Berkeley NLP project. WebLicht's Tundra can be used to visualize the result.

Result filters

Metadata provider

Language

Resource type

Tool task

Organisation

Keywords

Active filters:

Search results

EXMARaLDA

DARIAH DKPro-Wrapper: POS-Tagging und Lemmatization DE

WebLicht Tokenization TUR

WebLicht Lemmas DE

WebLicht POSTags Lemmas IT

WebLicht POSTags Lemmas DE

iDAI.vocab FR

Opener Tokenizer

Wikipedia Search EN

WebLicht Const Parsing DE

Result filters

Metadata provider

Language

Resource type

Tool task

Organisation

Keywords

Active filters:

Search results

Session recording