Refining a taxonomy by using annotated suffix trees and Wikipedia resources

E Chernyak, B Mirkin - Annals of Data Science, 2015 - Springer
A step-by-step approach to taxonomy construction is presented. On the first step, the upper
layer frame of taxonomy is built manually according to educational materials. On the next …

A method for refining a taxonomy by using annotated suffix trees and wikipedia resources

E Chernyak, B Mirkin - Procedia computer science, 2014 - Elsevier
A two-step approach to taxonomy construction is presented. On the first step the frame of
taxonomy is built manually according to some representative educational materials. On the …

[PDF][PDF] Использование ресурсов Интернета для построения таксономии/Черняк ЕЛ, Миркин БГ

ЕЛ Черняк - Использование ресурсов Интернета для …, 2013 - publications.hse.ru
В работе предложен двухшаговый подход к построению предметных таксономий на
русском языке. На первом шаге строятся высокие уровни таксономии на основе …

[PDF][PDF] An AST method for scoring string-to-text similiarity in semantic text analysis

E Chernyak, B Mirkin - researchgate.net
A suffix-tree based method for measuring similarity of a key phrase to an unstructured text is
proposed. The measure involves less computation and it does not depend on the length of …

[PDF][PDF] Annotated Suffix Trees for Text Clustering.

E Chernyak, DI Ilvovsky - CDUD@ CLA, 2016 - publications.hse.ru
In this paper an extension of tf-idf weighting on annotated suffix tree (AST) structure is
described. The new weighting scheme can be used for computing similarity between texts …

[PDF][PDF] Some Thoughts on Using Annotated Suffix Trees for Natural Language Processing.

E Chernyak - DMNLP@ PKDD/ECML, 2015 - ceur-ws.org
The paper defines an annotated suffix tree (AST)-a data structure used to calculate and store
the frequencies of all the fragments of the given string or a collection of strings. The AST is …

AST Method for Scoring String-to-text Similarity

E Chernyak, B Mirkin - Clusters, Orders, and Trees: Methods and …, 2014 - Springer
A suffix-tree-based method for measuring similarity of a key phrase to an unstructured text is
proposed. The measure involves less computation and it does not depend on the length of …

[PDF][PDF] АвтомАтическое дострАивАние тАксономии нА русском языке нА основе ресурсов википедии

ЕЛ Черняк, БГ Миркин - publications.hse.ru
A two-step approach to devising a hierarchical taxonomy of a domain is outlined. As the first
step, a coarse “high-rank” taxonomy frame is built manually using the materials of the …