Tag Archives: xml

Digital Humanities Workshop


Data visualisation of the charters of the Vicars Choral, from the ChartEx project

In September the project held a workshop bringing together three Digital Humanities projects: ChartEx, Traces Through Time and READ – the Research Environment for Ancient Documents. READ  is a software framework for the study of ancient texts and their supports. It is being designed to support editorial, paleographic and lexicographic work on documents from any part of the ancient world, with an initial focus on South Asian manuscripts and inscriptions. READ is being used by researchers in Munich working on Buddhist texts from Gandhara. Stefam Baums, lead researcher on the Gandhari project, and Stephen White, who is responsible for programming READ, talked about how the software framework is being developed as a collaborative process involving feedback between the software designer and scholars working on the manuscripts.

Roger Evans, from the University of Brighton, talked about his work behind interfaces for exploring and visualising digital records of old documents. In the ChartEx project, he used natural language processing to identify relationships between people and places appearing in medieval charters from the 12th to 16th centuries. He showed how the mapping of events recorded in the charters can then be used for visualisations of connections between people and places at specific points in history. In a similar way, on a larger scale, the Traces Through Time project at the National Archives is creating a system that will link related records from across the archives by identifying individuals and the records associated with them.

Indic epigraphy in the digital era

backgrd img

At the November seminar project researcher Dániel Balogh introduced the epigraphic database that is to be one of the outputs of the project and talked about the possibilities of exploiting information technology for such purposes. The preparation and web-based publication of searchable electronic versions of any text including epigraphic texts is in itself a tremendous aid for all scholars. In addition to the diplomatic edition of an inscription, an electronic version can contain practically unlimited meta-information. This may pertain to the inscription itself (legibility issues; palaeographic considerations), to the text as an abstract entity (clearly trackable editorial alterations; metrical, semantic and syntactic structure), and to the inscribed object as a physical entity (descriptive data; history).

The EpiDoc standard is a TEI-XML system for encoding such properties of epigraphs in a way that ensures compatibility and allows complex querying of the available texts. This standard was developed with Latin and Greek inscriptions in mind and only a few pioneer projects have applied it to South(-East) Asian texts. Adapting it for Sanskrit and Indic scripts poses a few hurdles on account of the fusion of word boundaries (saṃdhi) and to the syllabic nature of such scripts, but these hurdles can be overcome. The presentation also looked briefly into other current technologies and software to aid epigraphic work.