evasanchezsalido · PyLaia · Published December 2, 2022

Diario de Madrid 1788-1825

Text Recognition

Description

Model trained within the proyect CLARA-HD at UNED, Spain. It has been trained with 18th and 19th century newspaper prints from "Diario de Madrid (1788-1825)". For more information or details please contact Eva Sánchez Salido at evasanchezsalido@gmail.com or Ana García Serrano at agarcia@lsi.uned.es. If you use this model, please cite: Menta, A., Sánchez-Salido, E., & García-Serrano, A. (2022). Transcripción de periódicos históricos: Aproximación CLARA-HD. Proceedings of the Annual Conference of the Spanish Association for Natural Language Processing 2022: Projects and Demonstrations (SEPLN-PD 2022).

Try this model

Use this modelOpen in Transkribus
Very low error rate1% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 1% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words91,640
Lines10,040
Training Pages193
Model ID48440
Languages
Castilian