[图书][B] Discourse processing

M Stede - 2012 - books.google.com
Discourse Processing here is framed as marking up a text with structural descriptions on
several levels, which can serve to support many language-processing or text-mining tasks …

[HTML][HTML] SGF-An integrated model for multiple annotations and its application in a linguistic domain

M Stührenberg, D Goecke - Balisage: The Markup Conference, 2008 - balisage.net
Seamless integration of various, often heterogeneous linguistic resources (in terms of their
output formats) and merging of the respective annotation layers are crucial tasks for …

[HTML][HTML] A toolkit for multi-dimensional markup

M Stührenberg, D Jettka - Balisage: The Markup Conference, 2009 - balisage.net
In this paper we describe the extended standoff approach defined by XStandoff (the
successor of the Sekimo Generic Format, SGF), together with the accompanied collection of …

[PDF][PDF] eHumanities Desktop-an online system for corpus management and analysis in support of computing in the humanities

R Gleim, U Waltinger, A Ernst, A Mehler… - Proceedings of the …, 2009 - aclanthology.org
This paper introduces eHumanities Desktop-an online system for corpus management and
analysis in support of Computing in the Humanities. Design issues and the overall …

[PDF][PDF] eHumanities Desktop—eine webbasierte Arbeitsumgebung für die geisteswissenschaftliche Fachinformatik

A Mehler, R Gleim, U Waltinger, A Ernst… - und …, 2009 - duepublico2.uni-due.de
In diesem Beitrag beschreiben wir den eHumanities Desktop3. Es handelt sich dabei um
eine rein webbasierte Umgebung für die texttechnologische Arbeit mit Korpora, welche von …

Markup infrastructure for the anaphoric bank: Supporting web collaboration

M Poesio, N Diewald, M Stührenberg… - Modeling, Learning, and …, 2012 - Springer
Modern NLP systems rely either on unsupervised methods, or on data created as part of
governmental initiatives such as MUC, ACE, or GALE. The data created in these efforts tend …

[PDF][PDF] Less destructive cleaning of web documents by using standoff annotation

M Stührenberg - Proceedings of the 9th Web as Corpus Workshop …, 2014 - aclanthology.org
Standoff annotation, that is, the separation of primary data and markup, can be an interesting
option to annotate web pages since it does not demand the removal of annotations already …

[PDF][PDF] An extensible Online System for Corpus Management and Analysis

R Gleim, A Mehler - Citeseer
eHumanities Desktop Page 1 eHumanities Desktop An extensible Online System for Corpus
Management and Analysis Rüdiger Gleim Goethe Universität Frankfurt gleim@em.uni-frankfurt.de …

[HTML][HTML] Balisage: The Markup Conference 2008

M Stührenberg, C Wurm - balisage.net
This paper presents a refined taxonomy of XML schema languages based on the work by
Murata et al., 2005. It can be seen as first building block for a more elaborate formal analysis …

[PDF][PDF] eHumanities Desktop

R Gleim, P Warner, A Mehler - 2010 - pdfs.semanticscholar.org
This article addresses challenges in maintaining and annotating image resources in the field
of iconographic research. We focus on the task of bringing together generic and extensible …