Enriching consumer health vocabulary using enhanced GloVe word embedding

M Ibrahim, S Gauch, O Salman… - arXiv preprint arXiv …, 2020 - arxiv.org
arXiv preprint arXiv:2004.00150, 2020arxiv.org
Open-Access and Collaborative Consumer Health Vocabulary (OAC CHV, or CHV for short),
is a collection of medical terms written in plain English. It provides a list of simple, easy, and
clear terms that laymen prefer to use rather than an equivalent professional medical term.
The National Library of Medicine (NLM) has integrated and mapped the CHV terms to their
Unified Medical Language System (UMLS). These CHV terms mapped to 56000
professional concepts on the UMLS. We found that about 48% of these laymen's terms are …
Open-Access and Collaborative Consumer Health Vocabulary (OAC CHV, or CHV for short), is a collection of medical terms written in plain English. It provides a list of simple, easy, and clear terms that laymen prefer to use rather than an equivalent professional medical term. The National Library of Medicine (NLM) has integrated and mapped the CHV terms to their Unified Medical Language System (UMLS). These CHV terms mapped to 56000 professional concepts on the UMLS. We found that about 48% of these laymen's terms are still jargon and matched with the professional terms on the UMLS. In this paper, we present an enhanced word embedding technique that generates new CHV terms from a consumer-generated text. We downloaded our corpus from a healthcare social media and evaluated our new method based on iterative feedback to word embedding using ground truth built from the existing CHV terms. Our feedback algorithm outperformed unmodified GLoVe and new CHV terms have been detected.
arxiv.org
以上显示的是最相近的搜索结果。 查看全部搜索结果