The CLASSLA-Stanza model for lemmatisation of standard Croatian 2.1
The model for lemmatisation of standard Croatian was built with the CLASSLA-Stanza tool (https://github.com/clarinsi/classla) by training on the hr500k training corpus (http://hdl.handle.net/11356/1792) and using the hrLex inflectional lexicon (http://hdl.handle.net/11356/1232). The estimated F1 of the lemma annotations is ~98.02.
The difference to the previous version is that this version was trained on the new version of the hr500k corpus.