CLARIN Tool Portal

Active filters:

Tool task: Tagging

33 record(s) found

Search results

Universal Dependencies 2.15 models for UDPipe 2 (2024-11-21)

2 resources

Tokenizer, POS Tagger, Lemmatizer and Parser models for 147 treebanks of 78 languages of Universal Depenencies 2.15 Treebanks, created solely using UD 2.15 data (https://hdl.handle.net/11234/1-5787). The model documentation including performance can be found at https://ufal.mff.cuni.cz/udpipe/2/models#universal_dependencies_215_models . To use these models, you need UDPipe version 2.0, which you can download from https://ufal.mff.cuni.cz/udpipe/2 .

Use "Universal Dependencies 2.15 models for UDPipe 2 (2024-11-21)"
Universal Dependencies 2.12 models for UDPipe 2 (2023-07-17)

2 resources

Tokenizer, POS Tagger, Lemmatizer and Parser models for 131 treebanks of 72 languages of Universal Depenencies 2.12 Treebanks, created solely using UD 2.12 data (https://hdl.handle.net/11234/1-5150). The model documentation including performance can be found at https://ufal.mff.cuni.cz/udpipe/2/models#universal_dependencies_212_models . To use these models, you need UDPipe version 2.0, which you can download from https://ufal.mff.cuni.cz/udpipe/2 .

Use "Universal Dependencies 2.12 models for UDPipe 2 (2023-07-17)"
EvaLatin 2020 models for UDPipe 2 (2020-08-31)

2 resources

POS Tagger and Lemmatizer models for EvaLatin2020 data (https://github.com/CIRCSE/LT4HALA). The model documentation including performance can be found at https://ufal.mff.cuni.cz/udpipe/2/models#evalatin20_models . To use these models, you need UDPipe version at least 2.0, which you can download from https://ufal.mff.cuni.cz/udpipe/2 .

Use "EvaLatin 2020 models for UDPipe 2 (2020-08-31)"
int-pie

2 resources

The PIE tagger with custom modifications by the Dutch Language Institute (INT).
mbt

1 resources

MBT is a memory-based tagger-generator and tagger in one. The tagger-generator part can generate a sequence tagger on the basis of a training set of tagged sequences; the tagger part can tag new sequences. MBT can, for instance, be used to generate part-of-speech taggers or chunkers for natural language processing. It has also been used for named-entity recognition, information extraction in domain-specific texts, and disfluency chunking in transcribed speech.

Use "mbt"
Frog

1 resources

Frog is an integration of memory-based natural language processing (NLP) modules developed for Dutch. It performs automatic linguistic enrichment such as part of speech tagging, lemmatisation, named entity recognition, shallow parsing, dependency parsing and morphological analysis. All NLP modules are based on TiMBL.

Use "Frog"
python-frog

2 resources

Python binding to Frog, an NLP suite for Dutch doing part-of-speech tagging, lemmatisation, morphological analysis, named-entity recognition, shallow parsing, and dependency parsing.

Use "python-frog"
Ucto-Webservice

2 resources

Ucto is a rule-based tokeniser for multiple languages. This is the webservice for it, for both humans and machines.

Use "Ucto-Webservice"
GaLAHaD

2 resources

GaLAHaD (Generating Linguistic Annotations for Historical Dutch) allows linguists to compare taggers, tag their own corpora, evaluate the results and export their tagged documents.

Use "GaLAHaD"
ucto

1 resources

Ucto tokenizes text files: it separates words from punctuation, and splits sentences. This is one of the first tasks for almost any Natural Language Processing application. Ucto offers several other basic preprocessing steps such as changing case that you can all use to make your text suited for further processing such as indexing, part-of-speech tagging, or machine translation.

Use "ucto"

Result filters

Metadata provider

Language

Resource type

Type of tool

Tool task

Field of study

Availability

Organisation

Project

Keywords

Active filters:

Search results

Universal Dependencies 2.15 models for UDPipe 2 (2024-11-21)

Universal Dependencies 2.12 models for UDPipe 2 (2023-07-17)

EvaLatin 2020 models for UDPipe 2 (2020-08-31)

int-pie

mbt

Frog

python-frog

Ucto-Webservice

GaLAHaD

ucto

Result filters

Metadata provider

Language

Resource type

Type of tool

Tool task

Field of study

Availability

Organisation

Project

Keywords

Active filters:

Search results

Session recording