Building the Italian syntactic-semantic treebank

S Montemagni, F Barsotti, M Battista… - Treebanks: Building and …, 2003 - Springer
The paper reports on the design and construction of a multi-layered corpus of Italian,
annotated at the syntactic and lexico-semantic levels, whose development is supported by …

[PDF][PDF] The american national corpus: A standardized resource of american english

N Ide, C Macleod - Proceedings of corpus linguistics, 2001 - ucrel.lancs.ac.uk
Linguistic research has become heavily reliant on text corpora over the past ten years. Such
resources are becoming increasingly available through efforts such as the Linguistic Data …

The Italian PAROLE corpus: an overview

R Marinelli, L Biagini, R Bindi, S Goggi… - … Linguistics in Pisa …, 2003 - torrossa.com
The PAROLE project (Preparatory Action for Linguistic Resources Organization for
Language Engineering) has produced a set of harmonized corpora and lexicons for a large …

Multilingual natural language generation for multilingual software: a functional linguistic approach

JA Bateman… - Applied Artificial …, 1999 - Taylor & Francis
In this paper we present an implemented account of multilingual linguistic resources for
multilingual text generation that improves significantly on the degree of reuse of resources …

Ein kleines und erweitertes Tagset fürs Deutsche

C Thielen, A Schiller - Lexikon & Text, 1996 - degruyter.com
Die Bereitstellung großer, linguistisch annotierter (getaggter) Textkorpora ist eine wichtige
Voraussetzung für verschiedene computerlinguistische Anwendungen (maschinelle …

Development and perspectives of the Latin morphological analyser" LEMLAT"

M Passarotti - Linguistica computazionale: XX/XXI, 2000/2001, 2000 - torrossa.com
This article deals with the development of a new version of the Latin morphological analyser
LEMLAT. LEMLAT lemmatization currently needs to be enriched with new information about …

Language Processing with Perl and Prolog

P Nugues - Springer Berlin Heidelberg. Retrieved February, 2014 - Springer
In the past 20 years, natural language processing and computational linguistics have
considerably matured. The move has mainly been driven by the massive increase of textual …

A standard tag set expounding traditional morphological features for Arabic language part-of-speech tagging

M Sawalha, E Atwell - Word Structure, 2013 - euppublishing.com
The SALMA Morphological Features Tag Set (SALMA, Sawalha Atwell Leeds Morphological
Analysis tag set for Arabic) captures long-established traditional morphological features of …

[PDF][PDF] The American National Corpus: A Standardized Resource for American English.

C Macleod, N Ide, R Grishman - LREC, 2000 - cs.vassar.edu
At the first conference on Language Resources and Evaluation, Granada 1998, Charles
Fillmore, Nancy Ide, Daniel Jurafsky, and Catherine Macleod proposed creating an …

Evaluation of TnT tagger for spanish

RM Carrasco, A Gelbukh - Proceedings of the Fourth Mexican …, 2003 - ieeexplore.ieee.org
Part of speech (POS) tagger is a necessary module in many natural language text
processing tasks. A POS tagger is a program that accepts an unprepared raw text in input …