Slavic Department Freiburg (Achim Rabus) · PyLaia · Published December 12, 2022

Handwritten Glagolitic

Text Recognition

Description

This model is based on GT from the manuscripts Cod. Vind. Slav. 3 (Breviary of Vid of Omišalj) and II. beramski brevijar. It can be used for transcribing different hands of 14th and 15th century Croatian Glagolitic handwriting. The model automatically transcribes into the Latin script and is capable of dealing with ligatures and resolving the most common abbreviations. It was trained at the Slavic Department Freiburg (Achim Rabus). GT data was kindly provided by Sanja Zubčić (Rijeka) and Jagoda and Guido Kappel (Vienna).

Try this model

Use this modelOpen in Transkribus
Low error rate5.6% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 5.6% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material. This is a larger model trained on diverse material, which generally makes it more robust across different handwriting styles. That said, larger training sets also make it harder to push the CER down further.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words171,082
Lines31,035
Training Pages531
Model ID48703
Languages
Church SlavicCroatian
Centuries
14th c.15th c.