Research Centre of the Slovenian Academy of Sciences and Arts, Marko Kunavar (marko.kunavar01 · PyLaia · Published April 4, 2023

Slovenian 18th century manuscript

Text Recognition

Description

Based on 18th century collection of sermons Pridige Od Kerſzhanske Pokore by Konrad Branka OFM, written in Bohorič orthography.

Try this model

Use this modelOpen in Transkribus
Very low error rate2.8% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 2.8% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material. This is a smaller, specialised model. It may achieve a very low CER on material similar to its training data, but could be less robust on unfamiliar handwriting or layouts.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words25,322
Lines2,119
Training Pages43
Model ID51128
Languages
LatinSlovenian
Centuries
18th c.