Document clustering and cluster topic extraction in multilingual corpora

B Zhong, X Pan, PED Love, J Sun, C Tao - Advanced Engineering …, 2020 - Elsevier

Learning from past accidents is pivotal for improving safety in construction. However, hazard
records are typically documented and stored as unstructured or semi-structured free-text …

被引用次数：133 相关文章

[PDF] psu.edu

Discovering key concepts in verbose queries

M Bendersky, WB Croft - Proceedings of the 31st annual international …, 2008 - dl.acm.org

Current search engines do not, in general, perform well with longer, more verbose queries.
One of the main issues in processing these queries is identifying the key concepts that will …

被引用次数：344 相关文章所有 11 个版本

[PDF] siam.org

Efficient document clustering via online nonnegative matrix factorizations

F Wang, C Tan, P Li, AC König - Proceedings of the 2011 SIAM International …, 2011 - SIAM

Abstract In recent years, Nonnegative Matrix Factorization (NMF) has received considerable
interest from the data mining and information retrieval fields. NMF has been successfully …

被引用次数：106 相关文章所有 9 个版本

Clustering of biomedical documents using ontology-based TF-IGM enriched semantic smoothing model for telemedicine applications

R Sandhiya, M Sundarambal - Cluster Computing, 2019 - Springer

Clustering of biomedical documents has become a vital research concept due to its
importance in the clinical and telemedicine applications. The clustering of the medical …

被引用次数：9 相关文章所有 3 个版本

[PDF] iccs-meeting.org

A Model for Predicting n-gram Frequency Distribution in Large Corpora

JF Silva, JC Cunha - International Conference on Computational Science, 2021 - Springer

The statistical extraction of multiwords (n-grams) from natural language corpora is
challenged by computationally heavy searching and indexing, which can be improved by …

被引用次数：1 相关文章所有 3 个版本

[PDF] uni-obuda.hu

[PDF][PDF] Concept lattice structure with attribute lattices

L Kovács - Production Systems and Information Engineering, 2006 - uni-obuda.hu

There is an increasing interest on application of concept lattices in the different information
systems. The concept lattice may be used for representation of the concept generalisation …

被引用次数：8 相关文章所有 5 个版本

A hierarchical classification mechanism for organization document management

JL Hou, FH Lin - The international journal of advanced manufacturing …, 2006 - Springer

In light of the popularity of digital documents in manufacturing systems and manufacturing
support systems, implementation of electronic solutions for enterprise document …

被引用次数：8 相关文章所有 8 个版本

Kernel-based clustering with automatic cluster number selection

CD Wang, JH Lai, D Huang - 2011 IEEE 11th International …, 2011 - ieeexplore.ieee.org

Kernel k-means is one of the most well-known kernel-based clustering methods for
discovering nonlinearly separable clusters. However, like its original counterpart k-means …

被引用次数：5 相关文章所有 3 个版本

[PDF] core.ac.uk

Extracção de Unigramas Relevantes

JMJ Ventura - 2008 - search.proquest.com

A extracção automática de Unidades Lexicais Multipalavra (ULM) a partir de corporaé
actualmente uma área de grande aplicabilidade. Porém, os avanços na aplicação das …

被引用次数：6 相关文章所有 3 个版本

[PDF] psu.edu

[PDF][PDF] Identification of document language in hard contexts

JF da Silva, JGP Lopes - Proceedings of the SIGIR 2006 Workshop on …, 2006 - Citeseer

Automatic determination of the language in which a document is written is not yet a
completely solved problem. Generically it is solved as a classification problem and, for most …

被引用次数：5 相关文章

高级搜索

QQ 群