The problem
The Hidden Collections Crisis: Archive Digitization Backlogs Keep Growing

The solution
Reduce Archival Backlog with AI: From Unprocessed Boxes to Searchable Records

How to process an archival collection in 4 steps
Upload scanned collections
Upload entire series or fonds as multi-page PDFs, TIFFs, or image batches. Transkribus handles layout detection — columns, tables, marginalia — automatically.
Select an AI model
Choose from 300+ public models filtered by language, century, and script type. For mixed collections, run multiple models on different document groups within the same project.
Run batch recognition
Queue thousands of pages for processing. Transkribus runs text recognition in the background — no manual intervention required. Monitor progress from the dashboard.
Export and integrate
Export results as PAGE XML, ALTO XML, TEI-XML, plain text, or searchable PDF. Ingest directly into ArchivesSpace, AtoM, or publish via Transkribus Sites.
At scale
Automated Archival Processing with the Metagrapho API

Frequently Asked Questions
Ready to address your archival backlog?
Speak with our team about institutional plans for large-scale collection processing, or create a free account to evaluate Transkribus on your own materials.
Used by 2,000+ archives and libraries worldwide