Kerstin Manninger · Fields · Published August 29, 2024

Page Layout of printed books (around 1800)

Field ExtractionScholar+

Description

This model recognizes the elements of printed pages of books, such as the main text, page numbers, catch words, signature marks as well as footnotes and running heads. It was trained on books, that were published between 1775 and 1810, mainly consisting of German novels.
Page Layout of printed books (around 1800)
Open in Transkribus
High precision82.18% MaP

Mean Average Precision (MaP) measures how accurately the model detects field regions (higher is better). This model scored 82.18% on its validation set. MaP is harder to compare across models than CER, because the score depends heavily on how many distinct region types the model must distinguish. A model detecting a handful of simple fields will naturally score higher than one trained to recognise many fine-grained regions, even if both perform well in practice.

This score reflects performance on the model's own validation data. Your results will depend on how closely your documents match the training material and the complexity of the structures you need to detect.

Words183,067
Lines43,326
Training Pages9,616
Model ID163317