For Researchers & DH Projects

One platform. Full control. From scan to publication.

Transkribus gives you a complete, transparent pipeline for historical documents: text recognition, entity tagging, structured data extraction, and publication. Every step controllable, every result reproducible.

Text Recognition Entity Tagging Field Models Table Extraction Digital Editions REST API

Start for free See the pipeline

50 free credits every month · No credit card required

95%+Accuracy on trained models

300+Public models to choose from

100+Languages & scripts supported

250+Universities & archives as co-owners

Used by researchers at thousands of institutions

Text Recognition & AI Training

Get accurate transcriptions of handwritten and printed documents. Use a public model or train your own on your specific script, language, and hand — then retrain and improve as you go.

Train custom text recognition models on your corpus

Train layout models for complex page structures

300+ public models for common scripts and periods

Compare model versions, track accuracy improvements

For advanced workflows

Integrate Transkribus into your research infrastructure

Use the Transkribus API to run recognition, training, and extraction programmatically. Pipe results directly into your databases, analysis tools, or publication pipelines.

REST API with full model access

Batch-process thousands of pages unattended

JSON/XML output for any downstream pipeline

Explore the API

{
  "processId": 47725,
  "status": "FINISHED",
  "pages": 1,
  "content": {
    "text": "Dear Sir, I hereby confirm\nthe delivery of 200 units.",
    "regions": [
      {
        "id": "region_1",
        "type": "paragraph",
        "lines": [
          { "text": "Dear Sir, I hereby confirm" },
          { "text": "the delivery of 200 units." }
        ]
      }
    ]
  }
}

Built for large-scale research

Run projects at scale without thinking about IT

Transkribus handles the infrastructure so you can focus on research. Manage thousands of pages, coordinate teams across institutions, produce training data systematically, and tailor models to your specific corpora.

Manage large collections and coordinate distributed teams with role-based access

Systematically produce and curate training data across your corpus

Train, compare, and refine models tailored to your specific scripts and hands

Zero infrastructure overhead — runs in the browser, hosted and maintained for you

Learn about collaboration

Specialist guide

Medieval manuscript transcription

Working with Gothic textura, Caroline minuscule, Beneventan, or Insular scripts? Our dedicated guide covers the specific challenges of medieval manuscripts — abbreviation systems, ligatures, multi-layered text — and how to train custom models for your corpus.

Medieval manuscript guide

Learn the fundamentals

What is handwritten text recognition?

New to HTR? Understand how deep learning reads historical handwriting, how it differs from OCR, and what makes Transkribus different — with side-by-side comparisons, coverage details, and FAQ.

What is HTR?

Specialist guide

Early modern handwriting recognition (1500–1800)

Secretary hand, chancery scripts, Bastarda, humanist italic — the early modern period produced an enormous diversity of scripts across Europe. Our dedicated guide covers what models exist, how to train custom ones, and what accuracy to expect for this period.

Early modern handwriting guide

Measure your results

Character Error Rate (CER) explained

CER is the standard metric for evaluating HTR accuracy — and the number your grant reviewers will ask about. Understand how it's calculated, what benchmarks to expect for different document types, and how to report it in your methodology.

CER explained

For grant writing

How to include HTR in your grant proposal

A practical guide to structuring the HTR methodology section, estimating costs, planning timelines, and justifying AI-assisted transcription to reviewers at DFG, ERC, NEH, AHRC, and other funders.

Grant proposal methodology guide

Non-Latin scripts

Hebrew manuscript transcription

From medieval Geniza fragments to modern responsa — Transkribus supports Hebrew manuscripts across centuries and scribal traditions. Train custom models for Ashkenazi, Sephardi, or Mizrachi hands.

Hebrew manuscripts guide

Community engagement

Crowdsourcing transcription with AI

Combine AI pre-transcription with volunteer correction. Transkribus reads the handwriting first — your volunteers verify, correct, and enrich. Faster results, better engagement, higher quality than transcribing from scratch.

Crowdsourcing guide

Crowdsourced transcription with AI support

Trusted by researchers worldwide.

From national archives to university departments, see how researchers use Transkribus to unlock their collections.

How the Hanse.Quelle.Lesen! project made Hanseatic records accessible through Citizen Science

“The Hanse.Quelle.Lesen! project is a collaboration between the Research Centre for Hanse and Baltic History (FGHO), based at the European Hansemuseum Lübeck, and the Archive of the Hanseatic City of...”

ResearchMuseums

Read full story

Strategic AI integration: How Archion implemented the Transkribus API

“Archion is a major digital portal for family history research, providing online access to more than 200,000 church books with more than 32 million images from more than 25 German archives. These...”

ArchivesGermany

Read full story

Enevældens Nyheder Online: An award-winning project to create digital versions of historical newspapers

“If you wanted to study social control under absolutist rule, there are many historical sources that could be of interest. Administrative records, land registers, and royal decrees are just some of...”

ResearchDanish

Read full story

Sustainable efficiency: How the University of Georgia transcribed 20,000 pages in two months

“The Finding Their Names: Discovery and Description of Enslavement Events project is a major initiative by the Hargrett Rare Book and Manuscript Library at the University of Georgia. Funded by a...”

UniversitiesEnglish

Read full story

AI Made for German: Unlocking German-language archives with Transkribus

“Most AI transcription tools are designed with English-language material as their default setting. They excel at modern printed text and increasingly handle contemporary handwriting, but historical...”

ResearchGerman

Read full story

Unlocking the secrets of the New Spain Fleets with Patricia Murrieta-Flores

“Historians of colonial Latin America don’t suffer from a lack of primary sources. Across archives in Europe and the Americas lie millions of pages documenting the colonial maritime routes known as...”

ResearchSpanish

Read full story

How the University of Helsinki teaches Transkribus to students

“Text recognition and AI are quickly becoming essential tools for historians, offering powerful ways to study, process, and access huge collections of archival material. To make sure the next...”

UniversitiesSwedish

Read full story

How the German Archives for Diaries preserve personal history for future generations

“The German Archives for Diaries (Deutsches Tagebucharchiv, DTA) in Emmendingen has one core mission: Collecting and archiving autobiographical records and making them accessible for academic and...”

GermanArchives

Read full story

Unlocking Sakya texts: Creating a workflow for cataloguing Tibetan manuscripts

“Tibetan is one of the world’s major literary languages, with vast collections of philosophical, religious, historical, grammatical, and medical texts written in the language. It is also the language...”

TibetanBaselines model

Read full story

Charting new waters: How 3 projects opened up maritime archives with AI transcription

“Maritime history is a field that explores humanity's relationship with the oceans, seas, and waterways of the world. While ships and naval battles are the focus of some maritime historians, others...”

SpanishDutch

Read full story

View all success stories

Start building your research pipeline today

Join thousands of researchers who use Transkribus to turn historical documents into structured, publishable data.

Start for free Talk to us about your project

50 free credits every month · No credit card required

200M+Pages processed

500K+Users worldwide

95%+Accuracy on trained models

One platform. Full control. From scan to publication.

A controllable pipeline for every research question

Text Recognition & AI Training

Integrate Transkribus into your research infrastructure

Run projects at scale without thinking about IT

Medieval manuscript transcription

What is handwritten text recognition?

Early modern handwriting recognition (1500–1800)

Character Error Rate (CER) explained

How to include HTR in your grant proposal

Hebrew manuscript transcription

Crowdsourcing transcription with AI

Trusted by researchers worldwide.

How the Hanse.Quelle.Lesen! project made Hanseatic records accessible through Citizen Science

Strategic AI integration: How Archion implemented the Transkribus API

Enevældens Nyheder Online: An award-winning project to create digital versions of historical newspapers

Sustainable efficiency: How the University of Georgia transcribed 20,000 pages in two months

AI Made for German: Unlocking German-language archives with Transkribus

Unlocking the secrets of the New Spain Fleets with Patricia Murrieta-Flores

How the University of Helsinki teaches Transkribus to students

How the German Archives for Diaries preserve personal history for future generations

Unlocking Sakya texts: Creating a workflow for cataloguing Tibetan manuscripts

Charting new waters: How 3 projects opened up maritime archives with AI transcription

Built on trust, powered by community.

Your data stays yours

Hosted in Europe

Cooperative, not a startup

Start building your research pipeline today