jmgronski · PyLaia · Published July 11, 2025

Mame Loshn Maven

Text Recognition

Description

Mame Loshn Maven is the all purpose Yiddish handwriting model. It merges the training sets of Civil Records and Letter Readers to deliver robust performance across a variety of genres and dialects. Use it when you don’t know what you’ll encounter—or as a springboard to create your own specialist models. Ideal for: Anyone who deals with handwritten multi- documents on diverse topics which require a broad vocabulary. Credits This work was made possible by L’Dor V’Dor AI Lab Yiddish team with the generous support of the American Jewish Joint Distribution Committee (JDC), LitvakSIG, YIVO, and numerous individual volunteers who contributed documents and transcriptions. This model used the Dybbuk for Yiddish Handwriting model as a base developed by Sinai Rusinek and her team For more information, please visit: https://ldvdf.org

Try this model

Mame Loshn Maven
Use this modelOpen in Transkribus
Low error rate7.68% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 7.68% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material. This is a larger model trained on diverse material, which generally makes it more robust across different handwriting styles. That said, larger training sets also make it harder to push the CER down further.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words139,504
Lines33,487
Training Pages867
Model ID371445
Languages
Yiddish