Nazar Kotsur · PyLaia · Published September 29, 2023

Printed Ukrainian 20th century

Text Recognition

Description

This is a general model created with the intention to be used for proofreading books of various orthographies on Ukrainian Wikisource. Main material for this model were books written in 20th century of various orthographies, but also includes some 19th and 21st century books. In the future it will be further updated and improved. The training and books transcription were done by Nazar Kotsur, a student of Ivan Franko National University of Lviv.

Try this model

Use this modelOpen in Transkribus
Very low error rate2.2% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 2.2% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words19,768
Lines2,995
Training Pages96
Model ID55365
Languages
Ukrainian
Centuries
19th c.20th c.21st c.