作者
Wei Wang, Yue Ning, Huzefa Rangwala, Naren Ramakrishnan
发表日期
2016/10/24
图书
Proceedings of the 25th ACM International on Conference on Information and Knowledge Management
页码范围
509-518
简介
State-of-the-art event encoding approaches rely on sentence or phrase level labeling, which are both time consuming and infeasible to extend to large scale text corpora and emerging domains. Using a multiple instance learning approach, we take advantage of the fact that while labels at the sentence level are difficult to obtain, they are relatively easy to gather at the document level. This enables us to view the problems of event detection and extraction in a unified manner. Using distributed representations of text, we develop a multiple instance formulation that simultaneously classifies news articles and extracts sentences indicative of events without any engineered features. We evaluate our model in its ability to detect news articles about civil unrest events (from Spanish text) across ten Latin American countries and identify the key sentences pertaining to these events. Our model, trained without annotated …
引用总数
201820192020202120222023202467511672
学术搜索中的文章
W Wang, Y Ning, H Rangwala, N Ramakrishnan - Proceedings of the 25th ACM International on …, 2016