Tsvetana Dimitrova
Andrej Boyadzhiev
Electronic Edition and Linguistic Annotation of Slavic Fragments
-
Summary/Abstract
The paper introduces a project on edition and linguistic annotation of Medieval and Early Modern South Slavic manuscript fragments. The main topic is implementation of various approaches on integration of electronic edtion, manuscript description and linguistic annotation. A corpus will include fragments from parchment manuscripts kept in Bulgarian repositories. We will illustrate the approach with several pieces of texts from various fragments. The representation will be supplied with textual, as well as part-of-speech and basic syntactic annotation. On the basis of it an attempt will be made at experimental anaphora and related morpho-syntactic annotation. The work will offer a discussion on the features that will be useful for such annotation. The project relies on eXist database (http://exist-db.org) and the initiatives: Repertorium (http://repertorium.obdurodon.org/), PROIEL (http://www.hf.uio.no/ifikk/english/ research/ projects/proiel/) and TOROT (http://site.uit.no/slavhistcorp/files/2015/04/Eckhoff.pdf).
Subject: Language studies Language and Literature Studies Theoretical Linguistics Applied Linguistics Studies of Literature Computational linguistics South Slavic Languages Philology South Slavic manuscripts Fragments Linguistic annotation Linguistic corpora Electronic text edition Electronic description XML technologies