Extending lexical association measures for collocation extraction

Handling the impact of low frequency events on co-occurrence based measures of word similarity-a case study of pointwise mutual information

F Role, M Nadif - … on Knowledge Discovery and Information Retrieval, 2011 - scitepress.org

Statistical measures of word similarity are widely used in many areas of information retrieval
and text mining. Among popular word co-occurrence based measures is Pointwise Mutual …

被引用次数：72 相关文章所有 3 个版本

[PDF] arxiv.org

Beneath (or beyond) the surface: Discovering voice-leading patterns with skip-grams

DRW Sears, G Widmer - Journal of Mathematics and Music, 2021 - Taylor & Francis

Recurrent voice-leading patterns like the Mi-Re-Do compound cadence (MRDCC) rarely
appear on the musical surface in complex polyphonic textures, so finding these patterns …

被引用次数：15 相关文章所有 4 个版本

[PDF] academia.edu

Term extraction from sparse, ungrammatical domain-specific documents

A Ittoo, G Bouma - Expert Systems with Applications, 2013 - Elsevier

Existing term extraction systems have predominantly targeted large and well-written
document collections, which provide reliable statistical and linguistic evidence to support …

被引用次数：40 相关文章所有 9 个版本

[PDF] koreascience.kr

KR-WordRank: An unsupervised Korean word extraction method based on WordRank

H Kim, S Cho, P Kang - Journal of Korean Institute of Industrial …, 2014 - koreascience.kr

A Word is the smallest unit for text analysis, and the premise behind most text-mining
algorithms is that the words in given documents can be perfectly recognized. However, the …

被引用次数：26 相关文章

[PDF] euralex.org

[PDF][PDF] Finding multiwords of more than two words

A Kilgarriff, P Rychlý, V Kovář, V Baisa - Proceedings of the 15th …, 2012 - euralex.org

The prospects for automatically identifying two-word multiwords in corpora have been
explored in depth, and there are now well-established methods in widespread use.(We use …

被引用次数：34 相关文章所有 9 个版本

[PDF] springer.com

CLAD: A corpus-derived Chinese lexical association database

SY Lin, HC Chen, TH Chang, WE Lee… - Behavior Research …, 2019 - Springer

The application of word associations has become increasingly widespread. However, the
association norms produced by traditional free association tests tend not to exceed 10,000 …

被引用次数：15 相关文章所有 9 个版本

Multi-word terms selection for information retrieval

C Bechikh Ali, H Haddad, Y Slimani - Information Discovery and …, 2023 - emerald.com

Purpose A number of approaches and algorithms have been proposed over the years as a
basis for automatic indexing. Many of these approaches suffer from precision inefficiency at …

被引用次数：3 相关文章

[PDF] hu-berlin.de

Measuring coselectional constraint in learner corpora: A graph-based approach

AV Shadrova - 2020 - edoc.hu-berlin.de

The thesis located in corpus linguistics analyzes the acquisition of coselectional constraint in
learners of German as a second language in a quasi-longitudinal design based on the …

被引用次数：12 相关文章所有 4 个版本

TermeX: A Tool for Collocation Extraction

D Delač, Z Krleža, J Šnajder, B Dalbelo Bašić… - … and Intelligent Text …, 2009 - Springer

Collocations–word combinations occurring together more often than by chance–have a wide
range of NLP applications. Many approaches for automating collocation extraction based on …

被引用次数：28 相关文章所有 7 个版本

[PDF] academia.edu

Improving product quality and reliability with customer experience data

A Brombacher, E Hopma, A Ittoo, Y Lu… - Quality and …, 2012 - Wiley Online Library

Advance technology development and wide use of the World Wide Web have made it
possible for new product development organizations to access multi‐sources of data‐related …

被引用次数：25 相关文章所有 8 个版本

高级搜索

QQ 群