Swedish National Archives · PyLaia · Published December 5, 2022

Gothenburg_police_reports_1868-1902

Text Recognition

Description

This model is trained from reports from the Gothenburg Police Detective department 1868-1902, held at the Swedish National Archives in Gothenburg. The groundtruth for the model training consists of transcibed spreads from 1873, 1880, 1888, and 1896. Link to archive finding aid: https://sok.riksarkivet.se/arkiv/gj8w3gHtrH6cyG018W43t3 (material used to train model in series A II) The training of this model is part of a research and development project at the Swedish National Archives, in collaboration with GPS400: Centre for Collaborative Visual Research at the University of Gothenburg, and Vinnova: Sweden's innovation agency, as well as participants of the public though Citizen Science activities at the Regional State Archives in Gothenburg, where participants have transcribed most of the groundtruth spreads for training this model.

Try this model

Use this modelOpen in Transkribus
Very low error rate2.3% CER

Character Error Rate (CER) measures the percentage of characters incorrectly recognised. Lower is better. This model scored 2.3% on its validation set. As a rule of thumb, a CER below 10% is considered good for most handwritten material.

Measured on the model's own validation data. Results on your documents may differ depending on handwriting style, document condition, language, and how closely your material resembles the training data.

Words165,060
Lines27,932
Training Pages429
Model ID48511
Centuries
19th c.