Transcription
Assistant Project @ E-Scripts
This is a Java Application connected with the
Emergent Transcriptions System, which represents an attempt at streamlining the
process of “unearthing” the largely-untapped goldmine of historical knowledge
contained in handwritten manuscripts stored in the archives all around the
world. The basic idea is to make it
possible for scholars to easily access the historical information contained in
ancient manuscripts by instituting a virtuous cycle of automatically-accruing
and ever-improving transcriptions of these ancient records.
Please, read the paper entitled Making
History: an Emergent System for the Systematic Accrual of Transcriptions of
Historic Manuscripts for more detailed information.
The Transcription
Assistant v.0.2
The
Transcription Assistant™ (TA) is an open-source Java application that runs on
an end-user machine through the Java Virtual Machine supported by the Java
Application Interface. With the TA,
scholars will be able to create a project for each paper or research
topic. A project will include several
manuscript pages from a variety of collections that together contribute to the
development of an historical paper on a specific subject matter.
For each manuscript used in a project, after an XPG image
or an MML transcription is downloaded, the transcriber will use our Transcription Assistant to transcribe all or part of the
document. Once the transcription is
done, transcribers are invited to submit their transcriptions back to the
archive so they can be made available to other transcribers. Specific incentives are used to make this
sharing inviting to the user.
The TA is designed to greatly facilitate the process of
transcription. It consists of a main
screen split into two windows (vertically or horizontally, according to the
user’s preference). On one window is
loaded the manuscript image and on the other is visualized the transcription,
either as a positionally accurate print preview, or
as word-processable text, or in MML format (see tabs
at bottom left of Figure).
The first step the TA takes, as soon as a new XPG image
is loaded, is to automatically detect word boundaries in the manuscript image
and create boxes around each word. The “autoboxdraw” function uses a sophisticated smearing
algorithm, developed by one of our past projects [4]. The automatic boxing thresholds and settings
are user-adjustable to fit different document conditions. Boxing is quite successful in the current
version of the TA application, though a certain amount of manual adjustment is
expected to be always necessary.
Fortunately, the process of word-boxing needs to only be done once, so
the time expenditure is worth the effort.
We are currently working on making the manual act of box correction a rewardable, emergent feature of our system as well.
Once each word (or abbreviation) has been boxed, the user
can begin the actual process of transcription.
Un-transcribed boxes are red. The
current box (in black in Figure 1) can be clicked on, to reveal a text field
right above it where the transcription can be typed. To emulate the functions of a word processor,
the user can move to the next box by simply hitting the space bar in between
boxes. Once a transcription has been
typed in, its box will turn green and its translation will be entered into the
transcription MML and will appear in its exact relative location in the preview
window (below the image in Figure above).
The current version of TA allows the user to
“right-click” on a box to annotate the transcription. A primary form of annotation has to do with
differentiating graphics or symbols from text.
If a box is tagged as an “image” or “symbol” a cropped piece of the
manuscript will be copied into the preview window (and into the underlying MML)
as shown in Figure 1 (lower left of preview window). Currently we provide the
following other types of annotations for text boxes: (i)
manuscript annotations such as for stricken text or corrected text; (ii)
tagging of abbreviations; (iii) identification of numbers and (iv) identification
of handwriting changes (different author).
We foresee adding more of these annotations – such as the tagging of
currency and marginalia – as well as second-order tags to identify proper
names, names of places, professions, dates and the like.
The one-to-one pairing of
word-boxes-to-text-transcriptions is broken only by abbreviations, wherein a
single word box can be exploded into more than one transcription word. In any case, this pervasive one-to-one correspondence
allows the accrual of handwriting-recognition capabilities, which are planned
for future versions of the software. We
foresee that when users will experience difficulties in transcribing a specific
word, they will be able to ask for help by hitting a help key (such as F1).
Using the manuscript metadata for a bounded search –
limited to manuscripts that are likely to have the “same hand” – the system
will be able to pattern-match the handwriting in the box where the user is
having trouble, with a storehouse of boxes from previously completed
transcriptions from the same source, yielding transcription suggestions ranked
by their different levels of matching.
The user will thus be able to pick the suggestion that best fits the
sentence being transcribed. We foresee
making this advanced capability available “for a fee” in order to fuel our
incentive program. We want to entice
users to submit completed transcriptions in order to get the credit they need,
so they can later spend their banked credits to “buy” services like this “transcription
help”.
Once a transcription has been completed, the user can
save it and/or upload it back to the originating archive server for credit accounting. After several manuscripts have been worked
on, the user can also save the project and wrap things up for the day.
More
technical details about the TA will be provided in a forthcoming paper.
FABIO
CARRERA [Home,
Bio, News, Teaching, Research, Publications, Grants, Service, Contact]