Founding offer · lifetime membership for a single £24, exclusive to our first members · closes 20 June Claim your place →
Global Research Partnerships £24 Lifetime Log inCreate free account

Funded Projects › FP7

tranScriptorium · tranScriptorium

FP7Status: CLOSED1 January 201331 December 2015EU funding €2,399,739

Huge amounts of handwritten historical documents are being published by on-line digital libraries world wide. However, for these raw digital images to be really useful, they need be annotated with informative content. The tranScriptorium project aims to develop innovative, efficient and cost-effective solutions for the indexing, search and full transcription of historical handwritten document images, using modern, holistic Handwritten Text Recognition (HTR) technology. For typical handwritten text images of historical documents, currently available text image recognition technologies are not suitable. Traditional Optical Character Recognition (OCR) is simply not usable since characters can not be isolated automatically in these images. Therefore, holistic, segmentation-free HTR techniques, often borrowed from the field of Automatic Speech Recognition are needed. Yet, state-of-the-art holistic HTR approaches still lack the required accuracy, mainly due to the usual poor quality, degradations and writing style variability of historical document images. To cope with this lack of recognition accuracy for handwritten text images of historical documents, three actions are planned in tranScriptorium: i) improve basic image preprocessing and holistic HTR techniques; ii) develop novel indexing and keyword searching approaches, mainly based on byproducts of holistic HTR decoding and word spotting techniques; and iii) capitalize on new, user-friendly interactive-predictive HTR approaches for computer-assisted operation, which minimize the user intervention needed to achieve full, high quality transcripts. HTR tools based on tranScriptorium techniques will be incorporated into HTR web platforms that will be accessible to users through two different means: i) a content provider portal that provides access to handwritten historical documents for casual, individual researchers; and b) a specialized HTR web portal for structured crowd-sourcing transcription projects.

Consortium · 6 organisations

coordinator

UNIVERSITAT POLITECNICA DE VALENCIA

ES · €513,836

participant

UNIVERSITY OF LONDON

UK · €214,900

participant

NATIONAL CENTER FOR SCIENTIFIC RESEARCH ""DEMOKRITOS""""

EL · €513,812

participant

UNIVERSITY COLLEGE LONDON

€294,451

participant

UNIVERSITAET INNSBRUCK

AT · €369,700

participant

STICHTING INSTITUUT VOOR DE NEDERLANDSE TAAL

NL · €493,040

Research fields

View the official record on CORDIS →

← Find collaborators and more funded projects

Source: CORDIS, Publications Office of the European Union. Global Research Partnerships surfaces open EU research data to help you find collaborators; we are not affiliated with the European Union.