[PDF][PDF] Statistical comparisons of classifiers over multiple data sets

J Demšar - The Journal of Machine learning research, 2006 - jmlr.org
While methods for comparing two learning algorithms on a single data set have been
scrutinized for quite some time already, the issue of statistical tests for comparisons of more …

Predictive data mining in clinical medicine: current issues and guidelines

R Bellazzi, B Zupan - International journal of medical informatics, 2008 - Elsevier
BACKGROUND: The widespread availability of new computational methods and tools for
data analysis and predictive modeling requires medical informatics researchers and …

Interpreting interpretability: understanding data scientists' use of interpretability tools for machine learning

H Kaur, H Nori, S Jenkins, R Caruana… - Proceedings of the …, 2020 - dl.acm.org
Machine learning (ML) models are now routinely deployed in domains ranging from criminal
justice to healthcare. With this newfound ubiquity, ML has moved beyond academia and …

Rethinking explainability as a dialogue: A practitioner's perspective

H Lakkaraju, D Slack, Y Chen, C Tan… - arXiv preprint arXiv …, 2022 - arxiv.org
As practitioners increasingly deploy machine learning models in critical domains such as
health care, finance, and policy, it becomes vital to ensure that domain experts function …

Label-free spectroscopic SARS-CoV-2 detection on versatile nanoimprinted substrates

D Paria, KS Kwok, P Raj, P Zheng, DH Gracias… - Nano …, 2022 - ACS Publications
Widespread testing and isolation of infected patients is a cornerstone of viral outbreak
management, as underscored during the ongoing COVID-19 pandemic. Here, we report a …

API design for machine learning software: experiences from the scikit-learn project

L Buitinck, G Louppe, M Blondel, F Pedregosa… - arXiv preprint arXiv …, 2013 - arxiv.org
Scikit-learn is an increasingly popular machine learning li-brary. Written in Python, it is
designed to be simple and efficient, accessible to non-experts, and reusable in various …

[PDF][PDF] Turn on, tune in, drop out: Anticipating student dropouts in massive open online courses

D Yang, T Sinha, D Adamson… - Proceedings of the 2013 …, 2013 - www-cs.stanford.edu
In this paper, we explore student dropout behavior in Massive Open Online Courses
(MOOC). We use as a case study a recent Coursera class from which we develop a survival …

A user's guide to support vector machines

A Ben-Hur, J Weston - Data mining techniques for the life sciences, 2010 - Springer
Abstract The Support Vector Machine (SVM) is a widely used classifier in bioinformatics.
Obtaining the best results with SVMs requires an understanding of their workings and the …

[PDF][PDF] Pattern for python

T De Smedt, W Daelemans - The Journal of Machine Learning Research, 2012 - jmlr.org
Pattern is a package for Python 2.4+ with functionality for web mining (Google+ Twitter+
Wikipedia, web spider, HTML DOM parser), natural language processing (tagger/chunker, n …

KEEL: a software tool to assess evolutionary algorithms for data mining problems

J Alcalá-Fdez, L Sanchez, S Garcia, MJ del Jesus… - Soft Computing, 2009 - Springer
This paper introduces a software tool named KEEL which is a software tool to assess
evolutionary algorithms for Data Mining problems of various kinds including as regression …