Result filters

Metadata provider

  • Language Resource Switchboard

Language

Resource type

Organisation

  • CLARIN ERIC - Language Resource Switchboard

Active filters:

  • Organisation: CLARIN ERIC - Language Resource Switchboard
  • Metadata provider: Language Resource Switchboard
Loading...
85 record(s) found

Search results

  • EXMARaLDA

    **EXMARaLDA** is a system for working with oral corpora on a computer. It consists of a transcription and annotation tool ([Partitur-Editor](https://exmaralda.org/en/partitur-editor-2/ "Partitur Editor")), a tool for managing corpora ([Corpus-Manager](https://exmaralda.org/en/corpus-manager-coma-2/ "Corpus-Manager (Coma)")) and a query and analysis tool ([EXAKT](https://exmaralda.org/en/exakt-3/ "EXAKT")). **EXMARaLDA's** features include, for instance: - time-aligned transcription of digital audio or video - flexible annotation for freely choosable categories, - systematic documentation of a corpus through metadata - flexible output of transcription data in various layouts and formats (notation, document) - computer-assisted querying of transcription, annotation and metadata - interoperable as it works XML based data formats that allow for data exchange with other tools (like Praat, ELAN, Transcriber etc.) and enable a flexible processing and sustainable usage of the data. **EXMARaLDA** is used by [researchers world wide](https://exmaralda.org/en/projects/ "Projekte") in different contexts in which spoken language is analysed, these include: - conversation and discourse analysis, - study of language acquisition and multilingualism, - phonetics and phonology, - dialectology and sociolinguistics. **EXMARaLDA** was developed in the project "Computer assisted methods for the creation and analysis of multilingual data" at the Collaborative Research Center "Multilingualism" (Sonderforschungsbereich "Mehrsprachigkeit" – SFB 538) at the University of Hamburg. Since July 2011, the development of EXMARaLDA is continued at the [Hamburg Centre for Language Corpora](https://corpora.uni-hamburg.de/drupal/en), since November 2011 in cooperation with the [Archive for Spoken German](http://agd.ids-mannheim.de/index.shtml) at the Institute for the German Language in Mannheim.
  • WebLicht Tokenization TUR

    WebLicht Easy Chain for tokenization of Turkish texts. The pipeline makes use of WebLicht's TCF converter, and the tokenizer from the OpenNLP project. The 'newlineBounds' parameter treats newlines as a hard break (a sentence boundary). WebLicht's built-in viewer for annotations can be used to visualize the processing result.
  • WebLicht Lemmas DE

    WebLicht Easy Chain for Lemmatization (German). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the IMS TreeTagger. WebLicht's Tundra can be used to visualize the result.
  • WebLicht POSTags Lemmas IT

    WebLicht Easy Chain for POS Tagging and Lemmatization (Italian). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the POS Tagger from the OpenNLP project. The model for Italian is trained on a relatively small training corpus (MIDT) and should therefore be considered experimental. WebLicht's Tundra can be used to visualize the result.
  • WebLicht POSTags Lemmas DE

    WebLicht Easy Chain for POS Tagging and Lemmatization (German). The pipeline makes use of WebLicht's TCF converter, the IMS tokenizer, and the IMS TreeTagger. WebLicht's Tundra can be used to visualize the result.
  • iDAI.vocab FR

    The German Archaeological Institute is a scientific institution of the confederation in the business area of the Department of Foreign Affairs.
  • Wikipedia Search EN

    Wikipedia is an online free-content encyclopedia that you can edit and contribute to. Wikipedia co-founder Jimmy Wales has described Wikipedia as "an effort to create and distribute a free encyclopedia of the highest possible quality to every single person on the planet in their own language." Wikipedia exists to bring knowledge to everyone who seeks it.
  • WebLicht Const Parsing DE

    WebLicht Easy Chain for Constituency Parsing (German). The pipeline makes use of WebLicht's TCF converter, the tokenizer and sentence boundary detector of the IMS/Stuttgart, and the constituent parser from the Berkeley NLP project. WebLicht's Tundra can be used to visualize the result.