Badische Landesbibliothek · PyLaia · Published March 24, 2024

German Incunabula (Reichenau)

Text Recognition

Description

This model is trained to recognize the Gothic typefaces found in German incunabula and early prints. It was developed by the project “Digitalisierung und Volltexerkennung der ehemals Reichenauer Inkunabeln” at the Badische Landesbibliothek, which was funded by the Stiftung Kulturgut Baden-Württemberg. The Ground Truth used to train and evaluate this model is based on the collection of incunabula and post-incunabula of the former Reichenau monastery, now held at the Badische Landesbibliothek in Karlsruhe. The transcription of the Ground Truth followed the guidelines documented at https://doi.org/10.57962/regionalia-22875 and uses a range of Unicode characters to represent Latin abbreviations. In training, the Transkribus Print M1 model was used as a base model. This model was created by the Badische Landesbibliothek and is published under the CC-BY-SA 4.0 license.

Try this model

German Incunabula (Reichenau)
Use this modelOpen in Transkribus
Very low error rate0.4% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 0.4% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words60,559
Lines8,696
Training Pages185
Model ID61285
Languages
German
Centuries
15th c.16th c.