Automatic language classification by means of syntactic dependency networks

O Abramov, A Mehler - Journal of Quantitative Linguistics, 2011 - Taylor & Francis
This article presents an approach to automatic language classification by means of linguistic
networks. Networks of 11 languages were constructed from dependency treebanks, and the …

I3rab: A new Arabic dependency treebank based on Arabic grammatical theory

D Halabi, E Fayyoumi, A Awajan - Transactions on Asian and Low …, 2021 - dl.acm.org
Treebanks are valuable linguistic resources that include the syntactic structure of a
language sentence in addition to part-of-speech tags and morphological features. They are …

Geography of social ontologies: Testing a variant of the Sapir-Whorf Hypothesis in the context of Wikipedia

A Mehler, O Pustylnikov, N Diewald - Computer Speech & Language, 2011 - Elsevier
In this article, we test a variant of the Sapir-Whorf Hypothesis in the area of complex network
theory. This is done by analyzing social ontologies as a new resource for automatic …

Cat3LB and Cast3LB: From constituents to dependencies

M Civit, MA Martí, N Bufí - … Conference on Natural Language Processing (in …, 2006 - Springer
In this paper we present the conversion of two treebanks (Cat3LB for Catalan, and Cast3LB
for Spanish) from its original constituent format into dependencies. The process has been …

[PDF][PDF] Towards a uniform representation of treebanks: Providing interoperability for dependency tree data

O Pustylnikov, A Mehler - Programme Committee 7, 2008 - academia.edu
In this paper we present a corpus representation format which unifies the representation of a
wide range of dependency treebanks within a single model. This approach provides …

[PDF][PDF] A Unified Database of Dependency Treebanks: Integrating, Quantifying & Evaluating Dependency Data.

O Pustylnikov, A Mehler, R Gleim - LREC, 2008 - Citeseer
This paper describes a database of 11 dependency treebanks which were unified by means
of a two-dimensional graph format. The format was evaluated with respect to storage …

GramCat and GramEsp: two grammars for chunking

M Civit, M Antònia Martí - … Processing and Web Mining: Proceedings of the …, 2005 - Springer
In this article we present two grammars (GramCat and GramEsp) for chunking of unrestricted
Catalan and Spanish texts. With these grammars we extend the classical notion of chunk as …

Enhanced Arabic Dependency Parsing based on Irab Theory

D Halabi - 2021 - search.proquest.com
A statistical parser is an important tool in many Natural Language Processing (NLP) tasks,
such as machine translation, information retrieval, and Question Answering. Treebanks are …