Precedents, progress, and prospects in political event data

PA Schrodt - International Interactions, 2012 - Taylor & Francis
The past decade has seen a renaissance in the development of political event data sets.
This has been due to at least three sets of factors. First, there have been technological …

Text preprocessing for unsupervised learning: Why it matters, when it misleads, and what to do about it

MJ Denny, A Spirling - Political analysis, 2018 - cambridge.org
Despite the popularity of unsupervised techniques for political science text-as-data research,
the importance and implications of preprocessing decisions in this domain have received …

The MID4 dataset, 2002–2010: Procedures, coding rules and description

G Palmer, V d'Orazio, M Kenwick… - … and Peace Science, 2015 - journals.sagepub.com
Understanding the causes of interstate conflict continues to be a primary goal of the field of
international relations. To that end, scholars continue to rely on large datasets of conflict in …

ThunderSVM: A fast SVM library on GPUs and CPUs

Z Wen, J Shi, Q Li, B He, J Chen - Journal of Machine Learning Research, 2018 - jmlr.org
Support Vector Machines (SVMs) are classic supervised learning models for classification,
regression and distribution estimation. A survey conducted by Kaggle in 2017 shows that …

Automatic learning path creation using OER: a systematic literature mapping

A Siren, V Tzerpos - IEEE Transactions on Learning …, 2022 - ieeexplore.ieee.org
Learning paths are curated sequences of resources organized in a way that a learner has all
the prerequisite knowledge needed to achieve their learning goals. In this article, we …

The MID5 Dataset, 2011–2014: Procedures, coding rules, and description

G Palmer, RW McManus, V D'orazio… - … and Peace Science, 2022 - journals.sagepub.com
This article introduces the latest iteration of the most widely used dataset on interstate
conflicts, the Militarized Interstate Dispute (MID) 5 dataset. We begin by outlining the data …

[图书][B] Text as data: A new framework for machine learning and the social sciences

J Grimmer, ME Roberts, BM Stewart - 2022 - books.google.com
A guide for using computational text analysis to learn about the social world From social
media posts and text messages to digital government documents and archives, researchers …

[图书][B] Automated data collection with R: A practical guide to web scraping and text mining

S Munzert, C Rubba, P Meißner, D Nyhuis - 2014 - books.google.com
A hands on guide to web scraping and text mining for both beginners and experienced
users of R Introduces fundamental concepts of the main architecture of the web and …

Computer‐assisted keyword and document set discovery from unstructured text

G King, P Lam, ME Roberts - American Journal of Political …, 2017 - Wiley Online Library
The (unheralded) first step in many applications of automated text analysis involves
selecting keywords to choose documents from a large text corpus for further study. Although …

Cross-lingual classification of political texts using multilingual sentence embeddings

H Licht - Political Analysis, 2023 - cambridge.org
Established approaches to analyze multilingual text corpora require either a duplication of
analysts' efforts or high-quality machine translation (MT). In this paper, I argue that …