Building and Using a Richly Annotated Interlinear Diachronic Corpus : The Case of Old High German Tatian

Svetlana Petrova*, Michael Solf*, Julia Ritz**, Christian Chiarcos**, Amir Zeldes*
 
Collaborative Research Centre 632 “Information Structure”
* Humboldt Universität zu Berlin, Unter den Linden 6, D-10099 Berlin
s.petrova@staff.|michael.solf@|amir.zeldes@rz.hu-berlin.de
** Potsdam University, Karl-Liebknecht-Str. 24, D-14471 Potsdam
julia|chiarcos@ling.uni-potsdam.de
 
The present paper reports on the development and evaluation of a historical corpus designed to support detailed empirical studies on the interaction of information structure and syntax in Old High German (OHG). The creation and exploration of this corpus are part of a more general investigation concerning the role of information-structural factors in the explanation of word order variation and change in the Germanic languages. The paper also describes corpus design principles, methodologies, relevant formats and specifications, and the technical infrastructure employed during the creation of the corpus, as well as its accessibility by means of the linguistic database of information structure ANNIS.