correa.dale.j · PyLaia · Published August 16, 2025

Early 20th Century Arabic Periodicals v2.0

Text Recognition

Description

This model is intended for use on Arabic periodicals with multi-columned text, featuring images and advertisements, that were printed in the early 20th century. It was created by a team of librarian and students using materials held at the University of Texas at Austin.

Try this model

Early 20th Century Arabic Periodicals v2.0
Use this modelOpen in Transkribus
Low error rate7.87% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 7.87% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words54,805
Lines7,360
Training Pages80
Model ID386877
Languages
Arabic
Centuries
20th c.21st c.