The TEI and current standards for structuring linguistic data. An overview

M Stührenberg - Journal of the text encoding initiative, 2012 - journals.openedition.org
The TEI has served for many years as a mature annotation format for corpora of different
types, including linguistically annotated data. Although it is based on the consensus of a …

[图书][B] Different Views on Markup: Linguistic Modeling of Information and Markup Languages. Contributions to Language Technology

D Goecke, H Lüngen, D Metzing, M Stührenberg, A Witt - 2015 - ids-pub.bsz-bw.de
In this chapter, two different ways of grouping information represented in document markup
are examined: annotation levels, referring to conceptual levels of description, and …

[HTML][HTML] Visualization of concurrent markup

D Jettka, M Stührenberg - Balisage: The Markup Conference, 2011 - balisage.net
The present paper deals with the visualization of concurrent markup. An initial discussion of
the underlying model of XML instances demonstrates that valid XML exceeds the expressive …

Evaluace chybové anotace v žákovském korpusu češtiny

B Štindlová - 2011 - dspace.cuni.cz
Název práce: Evaluace chybové anotace v žákovském korpusu češtiny Autor: Barbora
Štindlová Ústav: Ústav českého jazyka a teorie komunikace, Filozofická fakulta, Univerzita …

Guidance through the standards jungle for linguistic resources

M Stührenberg, A Werthmann… - Proceedings of the LREC …, 2012 - ids-pub.bsz-bw.de
Research today is often performed in collaborated projects composed of project partners
with different backgrounds and from different institutions and countries. Standards can be a …

[HTML][HTML] Refining the taxonomy of xml schema languages. a new approach for categorizing xml schema languages in terms of processing complexity

M Stührenberg, C Wurm - Balisage: The Markup Conference, 2010 - balisage.net
This paper presents a refined taxonomy of XML schema languages based on the work by
Murata et al., 2005. It can be seen as first building block for a more elaborate formal analysis …

[HTML][HTML] The MLCD Overlap Corpus (MOC)

Y Marcoux, C Huitfeldt… - Balisage: The Markup …, 2012 - balisage.net
Abstract The MLCD Overlap Corpus (MOC) is a collection of samples of texts and text
fragments with overlapping structures. The main immediate goal of the MOC project is to …

[HTML][HTML] What, when, where? Spatial and temporal annotations with XStandoff

M Stührenberg - Balisage: The Markup Conference, 2013 - balisage.net
Balisage: What, when, where? Spatial and temporal annotations with XStandoff Skip to
contents. Link to the Balisage Proceedings Home Page at https://www.balisage.net/Proceedings …

Markup infrastructure for the anaphoric bank: Supporting web collaboration

M Poesio, N Diewald, M Stührenberg… - Modeling, Learning, and …, 2012 - Springer
Modern NLP systems rely either on unsupervised methods, or on data created as part of
governmental initiatives such as MUC, ACE, or GALE. The data created in these efforts tend …

[图书][B] Integrated linguistic annotation models and their application in the domain of antecedent detection

A Witt, M Stührenberg, D Goecke, D Metzing - 2012 - Springer
Seamless integration of various, often heterogeneous linguistic resources in terms of their
output formats and a combined analysis of the respective annotation layers are crucial tasks …