+ What's that written in the margin? Handwritten Text Recognition, Marginalia and John Stuart Mill

+ What's that written in the margin? Handwritten Text Recognition, Marginalia and John Stuart Mill

Some people are horrified by the thought of writing notes on the pages of books.  But for the English philosopher John Stuart Mill (1806 – 1873), marginal notes were a useful way to record his thoughts and observations as he read.

Mill’s collection of books is now in the possession of Somerville College at the University of Oxford.  The John Stuart Mill Collection holds more than 1500 books once owned by Mill.  Many of these texts contain annotations and markings made by Mill.

The John Stuart Mill Collection, Somerville College, University of Oxford [Image by Louise Seaward]
The John Stuart Mill Collection, Somerville College, University of Oxford [Image by Louise Seaward]

Somerville College, in collaboration with the University of Alabama, is currently undertaking a project to digitise and categorise this marginalia.  These partners have now begun to work with Transkribus, with a view to applying Handwritten Text Recognition to Mill’s scribblings.

READ partners from Xerox Research Centre Europe and the Computer Vision Lab at Vienna Technical University are working with hundreds of images from the Mill collection.  They aim to use Document Understanding to distinguish between the printed and handwritten text on the pages of these books and also use Handwritten Text Recognition to transcribe the comments which Mill wrote in the margins.   Transcripts of the Mill marginalia would be an invaluable resource to scholars and would complement the forthcoming Mill Marginalia database.

This is an exciting experiment for the READ project, as the methods and results of this endeavour could be applicable to other collections where marginal annotations appear on printed texts.  Many other writers, including Oscar Wilde and Mark Twain, were habitual annotators and technology from the READ project could help us to understand how they read, processed and understood books and articles.

Related Articles

+ Welcoming The British Library to the READ project network!

+ Welcoming The British Library to the READ project network!

We are very happy to welcome The British Library into the READ project network as a Memorandum of Understanding partner. The British Library collection is vast, containing more than 150 million items...

+ Meet the READ project partners - Eva Lang

+ Meet the READ project partners - Eva Lang

What’s your name? Eva Lang. Where do you work? Passau Diocesan Archives. Tell us a bit about your background… I hold a Computer Science Diploma (equiv. MSc) from the University of Passau, where I...

+ Gothenburg calling! Report from Digital Humanities in the Nordic Countries conference

+ Gothenburg calling! Report from Digital Humanities in the Nordic Countries conference

The READ project visited Sweden last week for the second Digital Humanities in the Nordic Countries conference. The conference was hosted by the University of Gothenburg and organised by the Digital...