johanheinsen · PyLaia · Published November 21, 2022

18th Century Danish 1.03

Text Recognition

Description

This model was trained by associate professor Johan Heinsen, Aalborg University. It was trained on handwriting from various military court records as well as court cases relating to inmates in various convict labour institutions in the eighteenth century. An example of the material can be found here: https://ao.sa.dk/ao/data.ashx?bid=80654798 Because the material deals mostly with soldiers and male convicts, the model has difficulty with pronouns. It also struggles dealing with hands that are highly slanted or overly compact.

Try this model

Use this modelOpen in Transkribus
Low error rate6.9% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 6.9% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material. This is a larger model trained on diverse material, which generally makes it more robust across different handwriting styles. That said, larger training sets also make it harder to push the CER down further.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words293,449
Lines50,982
Training Pages952
Model ID47906
Centuries
17th c.18th c.