Badische Landesbibliothek · PyLaia · Published March 25, 2024

Latin/German Bilingual Incunabula (Reichenau)

Text Recognition

Description

This model is trained to recognize the Gothic typefaces found in Latin/German bilingual incunabula and early prints. It was developed by the project “Digitalisierung und Volltexerkennung der ehemals Reichenauer Inkunabeln” at the Badische Landesbibliothek, which was funded by the Stiftung Kulturgut Baden-Württemberg. The Ground Truth used to train and evaluate this model is based on a collection of incunabula and post-incunabula of the former Reichenau monastery, now held at the Badische Landesbibliothek in Karlsruhe. In addition to excerpts from truly bilingual incunabula, the set also contains some monolingual material to improve model performance. The transcription of the Ground Truth followed the guidelines documented at https://doi.org/10.57962/regionalia-22875 and uses a range of Unicode characters to represent Latin abbreviations. In training, the Transkribus Print M1 model was used as a base model. This model was created by the Badische Landesbibliothek and is published under the CC-BY-SA 4.0 license.

Try this model

Latin/German Bilingual Incunabula (Reichenau)
Use this modelOpen in Transkribus
Very low error rate0.5% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 0.5% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words113,794
Lines16,709
Training Pages354
Model ID61316
Languages
GermanLatin
Centuries
15th c.16th c.