Interpretable machine learning for discovery: Statistical challenges and opportunities

GI Allen, L Gan, L Zheng - Annual Review of Statistics and Its …, 2023 - annualreviews.org
New technologies have led to vast troves of large and complex data sets across many
scientific domains and industries. People routinely use machine learning techniques not …

A scaling model for estimating time‐series party positions from texts

JB Slapin, SO Proksch - American Journal of Political Science, 2008 - Wiley Online Library
Recent advances in computational content analysis have provided scholars promising new
ways for estimating party positions. However, existing text‐based methods face challenges …

[PDF][PDF] Retrospectives: Who invented instrumental variable regression?

JH Stock, F Trebbi - Journal of Economic Perspectives, 2003 - pubs.aeaweb.org
The instrumental variables estimator first appeared explicitly in Appendix B of The Tariff on
Animal and Vegetable Oils by Philip G. Wright (1928). It has been suggested that this …

Feature evaluation by filter, wrapper, and embedded approaches

U Stańczyk - Feature selection for data and pattern recognition, 2015 - Springer
The choice of particular variables for construction of a set of characteristic features relevant
to classification can be executed in a kind of external process with respect to a classification …

Humanities data in R

T Arnold, L Tilton - Exploring networks, geospatial data, images, and text, 2015 - Springer
There has been a rapid increase in the application of computational methods to humanities
data in recent years. Numerous workshops, lectures, bootcamps, blogs, and texts have …

Vocabulary richness measure in genres

M Kubát, J Milička - Journal of Quantitative Linguistics, 2013 - Taylor & Francis
This article deals with the one of the oldest and most traditional fields in quantitative
linguistics, the concept of vocabulary richness. Although there are several methods for …

[图书][B] Constructions: A new approach to formularity, discourse, and syntax in Homer

C Bozzone - 2015 - search.proquest.com
This dissertation argues that formulaic phenomena in Homer are best described by using
the linguistic concept of construction (borrowed from Construction Grammar). Through a …

A comparative study of language models for book and author recognition

Ö Uzuner, B Katz - International conference on natural language …, 2005 - Springer
Linguistic information can help improve evaluation of similarity between documents;
however, the kind of linguistic information to be used depends on the task. In this paper, we …

[PDF][PDF] Using syntactic information to identify plagiarism

O Uzuner, B Katz, T Nahnsen - Proceedings of the second …, 2005 - aclanthology.org
Proceedings of the... Page 1 Proceedings of the 2nd Workshop on Building Educational
Applications Using NLP, pages 37–44, Ann Arbor, June 2005. cOAssociation for Computational …

[PDF][PDF] Machine learning approach to authorship attribution of literary texts

U Stańczyk, KA Cyran - International journal of applied mathematics and …, 2007 - wseas.us
Machine learning approaches are employed in the variety of feature extraction and
classification tasks because of their efficiency in dealing with huge amount of data. The …