作者
Eli Cortez, Altigran S da Silva, Marcos André Gonçalves, Filipe Mesquita, Edleno S de Moura
发表日期
2009/6
期刊
Journal of the American Society for Information Science and Technology
卷号
60
期号
6
页码范围
1144-1158
出版商
Wiley Subscription Services, Inc., A Wiley Company
简介
In this article we present FLUX‐CiM, a novel method for extracting components (e.g., author names, article titles, venues, page numbers) from bibliographic citations. Our method does not rely on patterns encoding specific delimiters used in a particular citation style. This feature yields a high degree of automation and flexibility, and allows FLUX‐CiM to extract from citations in any given format. Differently from previous methods that are based on models learned from user‐driven training, our method relies on a knowledge base automatically constructed from an existing set of sample metadata records from a given field (e.g., computer science, health sciences, social sciences, etc.). These records are usually available on the Web or other public data repositories. To demonstrate the effectiveness and applicability of our proposed method, we present a series of experiments in which we apply it to extract bibliographic …
引用总数
200920102011201220132014201520162017201820192020202120222023202427341212411131
学术搜索中的文章
E Cortez, AS da Silva, MA Gonçalves, F Mesquita… - Journal of the American Society for Information Science …, 2009