hbaudry · PyLaia · Published November 9, 2022

Latin Portuguese Print 17th century

Text Recognition

Description

This model is based on the Index of censorship printed by Pedro Craesbeeck, a key Lisbon printer of the early seventeenth century. It has been carried out by Hervé Baudry <hbaaudry@fcsh.unl.pt> as part of the research project “The relevance of book expurgation in the procedures of the Portuguese Inquisition (1536-1821): a systematic and individualized approach”, realised with the support of CHAM (NOVA FCSH/UAc) through the strategic project sponsored by the National Portuguese Agency for Research (FCT, UIDB/04666/2020).

Try this model

Use this modelOpen in Transkribus
Very low error rate1.5% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 1.5% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material. This is a smaller, specialised model. It may achieve a very low CER on material similar to its training data, but could be less robust on unfamiliar handwriting or layouts.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words23,363
Lines4,127
Training Pages46
Model ID45992
Centuries
17th c.