+ General model for "Fraktur" released

Thanks to the Library Labs of the Austrian National Library and the NewsEye project we are happy to announce the release of a free model which is capable to read German Fraktur documents especially from the 19th and 20th century in a convincing quality outperforming most standard OCR engines. The model is based on training data coming from the ANNO collection of the Austrian National Library and comprises 442.141 words. It shows a CER of 1,55% on the training set and 1,65% on the test set without any dictionary support. Note: the model is trained on German language documents. It will provide less convincing results for other languages, such as Swedish or Finnish Fraktur. However models for these languages are also in preparation and may be released in the coming months. The Fraktur model is available for every registered user in Transkribus and called: ONB _Newseye_GT_M1+. Have fun!

Related Articles

+ Printed vs. handwritten text lines - automatically separated

+ Printed vs. handwritten text lines - automatically separated

The Transkribus team collaborates with the Pattern Recognition team of the University Erlangen-Nürnberg (also member of READ-COOP SCE) and the collegues were so great to make an interesting...

+ Paper on Transkribus and handwritten text recognition (HTR) in archives now open access

+ Paper on Transkribus and handwritten text recognition (HTR) in archives now open access

A general paper about Transkribus was published in the Journal of Documentation. Transforming scholarship in the archives through handwritten text recognition gives an overview of the current use of...

+ Digitisation blog of the University Archive Greifswald

+ Digitisation blog of the University Archive Greifswald

Dr. Dirk Alvermann of the University Archive Greifswald is one of the pioneers of Transkribus. He already started working with the first version of Transkribus in 2015. Now, he received a grant from...