cossio · PyLaia · Published February 13, 2023

15th Century Spanish Gothic Hybrid Script (model A)

Text Recognition

Description

Mid-15th century Spanish Gothic Hybrid script (gótica cursiva híbrida), dated in 1455. Presence of Aragonese linguistic influence. Based on Biblioteca Nacional de España ms. 3369 (http://bdh.bne.es/bnesearch/detalle/bdh0000024487). Semipaleographic transcription system by the Hispanic Seminary of Medieval Studies. Trained on 17386 words, 3084 lines (37 double column bifolios). CER on Train: 0.1%, CER on Validation: 3.31%. Developed for the Lucidarios Project (https://lucidarios.hypotheses.org/), as model Lucidario A (v. 0.4) by Mario Cossío Olavide (cossio@umn.edu).

Try this model

15th Century Spanish Gothic Hybrid Script (model A)
Use this modelOpen in Transkribus
Very low error rate3.3% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 3.3% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material. This is a smaller, specialised model. It may achieve a very low CER on material similar to its training data, but could be less robust on unfamiliar handwriting or layouts.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words17,386
Lines3,084
Training Pages37
Model ID50029
Languages
Castilian