P2PaLA Train Parameters
Training Parameters for the P2PaLA structure tool (under construction)
Structure types
These are the structure types that are tagged using Transkriubs on region level. Do not use whitespace in those structure types and be careful with case sensitivity, i.e. we recommend using only lowercase letters. Also we recommend to use dashes (-) and underscores (_) as the only special character, although other may work too.
Example:
paragraph heading footnote page-number
Merged structure types
Merged structure types are used to treat certain structure types the same as others during training (e.g. ‘footnote-continued’ or ‘footer’ like ‘footnote’). Expected is a list of the structure types, separated by a colon with the structure types to merge.
Example:
footnote:footnote-continued,footer heading:header
Here, regions tagged with ‘footnote-continue’ and ‘footer’ are regarded as ‘footnote’ while ‘header’ is regarded as ‘heading’ during training.
Training Mode
You can specify whether the model should be able to detect (text-)regions, baselines or both.
Related Articles

Can AI save bad scans?
The starting point for any kind of document digitization, whether done by hand or through sophisticated text recognition algorithms, is a good-quality image. Take a look at the one below. It is a...

Mapping Medieval Vienna: The digital edition of historical land registers supported by Transkribus
A central goal of the research project 'Mapping Medieval Vienna' is to make the Viennese land registers of the 15th century available to the public. This is because the land register entries contain...

Supporting Future Scholars: The Transkribus Scholarship Programme
Imagine you are a student who wants to dive into the personal story of one of the few famous child authors in history; or who wants to discover what made the authors of the Spanish Golden Age of...