Searching for evidence of scientific news in scholarly big data

MRU Hoque, D Bradley, C Kwan, A Chiatti… - Proceedings of the 10th …, 2019 - dl.acm.org
Proceedings of the 10th International Conference on Knowledge Capture, 2019dl.acm.org
Public digital media can often mix factual information with fake scientific news, which is
typically difficult to pinpoint, especially for non-professionals. These scientific news articles
create illusions and misconceptions, thus ultimately influence the public opinion, with
serious consequences at a broader social scale. Yet, existing solutions aiming at
automatically verifying the credibility of news articles are still unsatisfactory. We propose to
verify scientific news by retrieving and analyzing its most relevant source papers from an …
Public digital media can often mix factual information with fake scientific news, which is typically difficult to pinpoint, especially for non-professionals. These scientific news articles create illusions and misconceptions, thus ultimately influence the public opinion, with serious consequences at a broader social scale. Yet, existing solutions aiming at automatically verifying the credibility of news articles are still unsatisfactory. We propose to verify scientific news by retrieving and analyzing its most relevant source papers from an academic digital library (DL), e.g., arXiv. Instead of querying keywords or regular named entities extracted from news articles, we query domain knowledge entities (DKEs) extracted from the text. By querying each DKE, we retrieve a list of candidate scholarly papers. We then design a function to rank them and select the most relevant scholarly paper. After exploring various representations, experiments indicate that the term frequency-inverse document frequency (TF-IDF) representation with cosine similarity outperforms baseline models based on word embedding. This result demonstrates the efficacy of using DKEs to retrieve scientific papers which are relevant to a specific news article. It also indicates that word embedding may not be the best document representation for domain specific document retrieval tasks. Our method is fully automated and can be effectively applied to facilitating fake and misinformed news detection across many scientific domains.
ACM Digital Library
以上显示的是最相近的搜索结果。 查看全部搜索结果

Google学术搜索按钮

example.edu/paper.pdf
搜索
获取 PDF 文件
引用
References