+ Finding patterns in eighteenth-century weddings - new blog from Xerox

+ Finding patterns in eighteenth-century weddings - new blog from Xerox

Xerox Research Centre Europe is one of the READ research partners, with responsibility for Document Understanding.  Document Understanding is a crucial part of the process of training computers to recognise historical documents, as Hervé Déjean from the Xerox team explains in this blog.

Document Understanding involves analysing the layout of a document in order to extract human understandable information about its content. Hervé’s blog presents a useful overview of the concept and offers specific details about how this method can be applied to historical documents.

Image from Passau Diocesan Archives

Hervé describes how he has been using Sequential Pattern Mining Techniques on eighteenth-century wedding registers provided by Passau Diocesan Archives, another partner in the READ project.  Document Understanding helps to ensure that we can group information from a document into a meaningful sequence – in this case, ensuring the right groom is matched with the right bride on the right day!

Related Articles

+ Meet the READ project partners - Sofia Ares Oliveira

+ Meet the READ project partners - Sofia Ares Oliveira

What’s your name? Sofia Ares Oliveira. Where do you work? Digital Humanities Laboratory at Ecole Polytechnique Fédérale de Lausanne (EPFL). Tell us a bit about your background… I studied Electrical...

+ A new model for Humanities research - collaboration with HumaReC

+ A new model for Humanities research - collaboration with HumaReC

HumaReC is a new research platform developed by the Swiss Institute for Bioinformatics. It is part of a project to investigate the digital production and publication of Humanities data using an...

+ Welcoming The British Library to the READ project network!

+ Welcoming The British Library to the READ project network!

We are very happy to welcome The British Library into the READ project network as a Memorandum of Understanding partner. The British Library collection is vast, containing more than 150 million items...