Hazard analysis: A deep learning and text mining framework for accident prevention

B Zhong, X Pan, PED Love, J Sun, C Tao - Advanced Engineering …, 2020 - Elsevier
Learning from past accidents is pivotal for improving safety in construction. However, hazard
records are typically documented and stored as unstructured or semi-structured free-text …

Discovering key concepts in verbose queries

M Bendersky, WB Croft - Proceedings of the 31st annual international …, 2008 - dl.acm.org
Current search engines do not, in general, perform well with longer, more verbose queries.
One of the main issues in processing these queries is identifying the key concepts that will …

Efficient document clustering via online nonnegative matrix factorizations

F Wang, C Tan, P Li, AC König - Proceedings of the 2011 SIAM International …, 2011 - SIAM
Abstract In recent years, Nonnegative Matrix Factorization (NMF) has received considerable
interest from the data mining and information retrieval fields. NMF has been successfully …

Clustering of biomedical documents using ontology-based TF-IGM enriched semantic smoothing model for telemedicine applications

R Sandhiya, M Sundarambal - Cluster Computing, 2019 - Springer
Clustering of biomedical documents has become a vital research concept due to its
importance in the clinical and telemedicine applications. The clustering of the medical …

A Model for Predicting n-gram Frequency Distribution in Large Corpora

JF Silva, JC Cunha - International Conference on Computational Science, 2021 - Springer
The statistical extraction of multiwords (n-grams) from natural language corpora is
challenged by computationally heavy searching and indexing, which can be improved by …

[PDF][PDF] Concept lattice structure with attribute lattices

L Kovács - Production Systems and Information Engineering, 2006 - uni-obuda.hu
There is an increasing interest on application of concept lattices in the different information
systems. The concept lattice may be used for representation of the concept generalisation …

A hierarchical classification mechanism for organization document management

JL Hou, FH Lin - The international journal of advanced manufacturing …, 2006 - Springer
In light of the popularity of digital documents in manufacturing systems and manufacturing
support systems, implementation of electronic solutions for enterprise document …

Kernel-based clustering with automatic cluster number selection

CD Wang, JH Lai, D Huang - 2011 IEEE 11th International …, 2011 - ieeexplore.ieee.org
Kernel k-means is one of the most well-known kernel-based clustering methods for
discovering nonlinearly separable clusters. However, like its original counterpart k-means …

Extracção de Unigramas Relevantes

JMJ Ventura - 2008 - search.proquest.com
A extracção automática de Unidades Lexicais Multipalavra (ULM) a partir de corporaé
actualmente uma área de grande aplicabilidade. Porém, os avanços na aplicação das …

[PDF][PDF] Identification of document language in hard contexts

JF da Silva, JGP Lopes - Proceedings of the SIGIR 2006 Workshop on …, 2006 - Citeseer
Automatic determination of the language in which a document is written is not yet a
completely solved problem. Generically it is solved as a classification problem and, for most …