yngvil.beyer · PyLaia · Published September 23, 2024

SamiskOCR_smi

Text Recognition

Description

This model is the print M1 base model (model-id 39995) fine-tuned on manually annotated Sámi data

Try this model

Use this modelOpen in Transkribus
Very low error rate1.49% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 1.49% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words25,573
Lines6,141
Training Pages58
Model ID181725
Languages
Sami Languages