Incorporating prior knowledge in support vector machines for classification: A review

F Lauer, G Bloch - Neurocomputing, 2008 - Elsevier
For classification, support vector machines (SVMs) have recently been introduced and
quickly became the state of the art. Now, the incorporation of prior knowledge into SVMs is …

Anomaly detection using one-class neural networks

R Chalapathy, AK Menon, S Chawla - arXiv preprint arXiv:1802.06360, 2018 - arxiv.org
We propose a one-class neural network (OC-NN) model to detect anomalies in complex
data sets. OC-NN combines the ability of deep networks to extract a progressively rich …

SimpleMKL

A Rakotomamonjy, F Bach, S Canu… - Journal of Machine …, 2008 - hal.science
Multiple kernel learning aims at simultaneously learning a kernel and the associated
predictor in supervised learning settings. For the support vector machine, an efficient and …

[PDF][PDF] Core vector machines: Fast SVM training on very large data sets.

IW Tsang, JT Kwok, PM Cheung, N Cristianini - Journal of Machine …, 2005 - jmlr.org
Standard SVM training has O (m3) time and O (m2) space complexities, where m is the
training set size. It is thus computationally infeasible on very large data sets. By observing …

Support vector machines in R

A Karatzoglou, D Meyer, K Hornik - Journal of statistical software, 2006 - jstatsoft.org
Being among the most popular and efficient classification and regression methods currently
available, implementations of support vector machines exist in almost every popular …

[PDF][PDF] Fast kernel classifiers with online and active learning.

A Bordes, S Ertekin, J Weston, L Botton… - Journal of machine …, 2005 - jmlr.org
Very high dimensional learning systems become theoretically possible when training
examples are abundant. The computing cost then becomes the limiting factor. Any efficient …

[PDF][PDF] Nonparametric quantile estimation

I Takeuchi, Q Le, T Sears, A Smola - 2006 - jmlr.org
In regression, the desired estimate of y| x is not always given by a conditional mean,
although this is most common. Sometimes one wants to obtain a good estimate that satisfies …

[PDF][PDF] Breaking the curse of kernelization: Budgeted stochastic gradient descent for large-scale svm training

Z Wang, K Crammer, S Vucetic - The Journal of Machine Learning …, 2012 - jmlr.org
Online algorithms that process one example at a time are advantageous when dealing with
very large data or with data streams. Stochastic Gradient Descent (SGD) is such an …

Parallelizing support vector machines on distributed computers

K Zhu, H Wang, H Bai, J Li, Z Qiu… - Advances in neural …, 2007 - proceedings.neurips.cc
Abstract Support Vector Machines (SVMs) suffer from a widely recognized scalability
problem in both memory use and computational time. To improve scalability, we have …

Learning relevant image features with multiple-kernel classification

D Tuia, G Camps-Valls, G Matasci… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
The increase in spatial and spectral resolution of the satellite sensors, along with the
shortening of the time-revisiting periods, has provided high-quality data for remote sensing …