Eastern Slavic Languages

New Developments in Tagging Pre-modern Orthodox Slavic Texts

  • Summary/Abstract

    Pre-modern Orthodox Slavic texts pose certain difficulties when it comes to part-of-speech and full morphological tagging. Orthographic and morphological heterogeneity makes it hard to apply resources that rely on normalized data, which is why previous attempts to train part-of-speech (POS) taggers for pre-modern Slavic often apply normalization routines. In the current paper, we further explore the normalization path; at the same time, we use the statistical CRF-tagger MarMoT and a newly developed neural network tagger that cope better with variation than previously applied rule-based or statistical taggers. Furthermore, we conduct transfer experiments to apply Modern Russian resources to pre-modern data. Our experiments show that while transfer experiments could not improve tagging performance significantly, state-of-the-art taggers reach between 90% and more than 95% tagging accuracy and thus approach the tagging accuracy of modern standard languages with rich morphology. Remarkably, these results are achieved without the need for normalization, which makes our research of practical relevance to the Paleoslavistic community.


The Earliest Slavonic Translation of the Song of Songs from Greek: A Possible Influence from the Vulgate?


Byzantines, Bulgarians and Serbs in the Vita of Saint Vladimir in the Gesta Regum Sclavorum


The Isaiah Code: Highlights in the History of a Catena in Slavic Tradition Scripta & e-Scripta vol. 16-17, 2017 floyd Wed, 07/12/2017 - 20:59

This study seeks to trace out the structure of the Book of Prophet Isaiah with commentaries and to explore what that structure reveals about the text in some manuscripts of the East Slavonic and South Slavonic traditions. There are three conclusions made as a result of the present study. Firstly, the analysis of the structure and the identification of the readings in Catena Slavonica in Isaiam shows a translation of a catena which occupies an intermediate position between the Catena in Isaiam by John Drungarios and the one by Andrew the Presbyter whichever is the earliest. The CSI resembles both. Secondly, the value of the CSI should not be underestimated, because it includes a translation of scholia by Theodulus whose work is now almost entirely lost. Therefore the CSI could provide new evidence for the content of the lost Byzantine original of Theodulus’ Commentary on Isaiah. Thirdly, the comparison of the numerals in the margin of РНБ F.I.461 with the sequence and number of the biblical pericopes and relevant scholia in the Russian manuscripts clearly and unequivocally demonstrates that although F.I. 461 is the earliest evidence of Preslav translation in a Tărnovo redaction, it is still a single link in the chain of the Slavonic tradition and has a many shortcomings compared to the CSI in the Russian tradition.

Subject: History Language studies Language and Literature Studies Cultural history Studies of Literature Middle Ages Eastern Slavic Languages Philology Translation Studies

The Parable of the Unicorn in the Story of Barlaam and Josaphat


Двойная рецепция при формировании княжеской службы: служба св. Александру Невскому как модель

A Double Reception in the Formation of a Princely Service: The Service of St Alexander Nevsky as a Model

  • Summary/Abstract

    The Service of St Alexander Nevsky was written by Monk Michael of the Roždestvenskij monastyr’ (Nativity monastery) in Vladimir. He was one of the writers, belonging with the circle of Metropolitan Macarius, who composed princely services (and sometimes vitas) for new Russian saints. Most of the services are compilations of verses and hymns and more or less exact borrowings (and sometimes compositions according to models). In the Service of St Alexander Nevsky, the most refined of Monk Michael’s works, the hymnographer utilized various models to combine them into one canon, thus giving it the colour of an original work. It is important to add that Monk Michael used Slavic translations instead of original Greek texts, a fact proved by textological comparison. The service, dedicated to a saint prince, canonized in the sixteenth century, was the only one included in the Menaion. Together with the especial respect and veneration of the new saint, it was one of the reasons why his service became a model of other princely services. It is worth noting that instead of hymns, originally borrowed for the new service, exactly the adapted hymns to St Alexander were taken as standard for princely services, thus allowing a double reception of the translated hymns. For the purpose of the investigation the author analyzes the services of St Roman of Ugleč, St Daniel of Moscow, the Service of Finding of his Relics including, as well as the service to St Dowmant of Pskov.


Linguistics vs. Digital Editions: The Tromsø Old Russian and OCS Treebank

  • Summary/Abstract

    This article provides a description of the Tromsø Old Russian and OCS Treebank (TOROT), which, along with its parent treebank, the PROIEL corpus (built by members of the project Pragmatic Resources in Old Indo-European Languages), is the only existing treebank of Old Church Slavonic, Old East Slavic and Middle Russian texts. The TOROT is a part of a larger family of treebanks of ancient languages which all use the PROIEL open-source annotion web tool and annotation schemes. In this article we present principles and selected problems at several levels of analysis in the TOROT, and then briefly discuss ways that corpus linguists and edition philologists can fruitfully collaborate and complement each other.


Исторический корпус как цель и инструмент корпусной палеославистик

Diachronic OCS Corpus as an Object and an Instrument of Corpus Palaeoslavitic


Subscribe to Eastern Slavic Languages