作者
Susana Afonso, Eckhard Bick, Renato Haber, Diana Santos
发表日期
2002
来源
quot; In Manuel González Rodrigues; Carmen Paz Suarez Araujo (ed) Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)(Las Palmas de Gran Canaria Espanha 29-31 de Maio de 2002) Paris: ELRA
出版商
ELRA
简介
This paper reviews the first year of the creation of a publicly available treebank for Portuguese, Floresta Sintá (c) tica, a collaboration project between the VISL and the Computational Processing of Portuguese projects. After briefly describing the main goals and the organization of the project, the creation of the annotated objects is presented in detail: preparing the text to be annotated, applying the Constraint Grammar based PALAVRAS parser, revising its output manually in a two-stage process, and carefully documenting the linguistic options. Some examples of the kind of interesting problems dealt with are presented, and the paper ends with a brief description of the tools developed, the project results so far, and a mention to a preliminary inter-annotator test and what was learned from it.
引用总数
20022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024671033391012181521162424201165351083
学术搜索中的文章
S Afonso, E Bick, R Haber, D Santos - quot; In Manuel González Rodrigues; Carmen Paz …, 2002