The dual-sparse topic model: mining focused topics and focused terms in short text

T Lin, W Tian, Q Mei, H Cheng - … of the 23rd international conference on …, 2014 - dl.acm.org
Topic modeling has been proved to be an effective method for exploratory text mining. It is a
common assumption of most topic models that a document is generated from a mixture of …

Comparing subspace clusterings

A Patrikainen, M Meila - IEEE Transactions on Knowledge and …, 2006 - ieeexplore.ieee.org
We present the first framework for comparing subspace clusterings. We propose several
distance measures for subspace clusterings, including generalizations of well-known …

Decoupling sparsity and smoothness in the discrete hierarchical dirichlet process

C Wang, D Blei - Advances in neural information …, 2009 - proceedings.neurips.cc
We present a nonparametric hierarchical Bayesian model of document collections that
decouples sparsity and smoothness in the component distributions (ie, the``topics). In the …

The discrete basis problem

P Miettinen, T Mielikäinen, A Gionis, G Das… - Knowledge Discovery in …, 2006 - Springer
Matrix decomposition methods represent a data matrix as a product of two smaller matrices:
one containing basis vectors that represent meaningful concepts in the data, and another …

Generalized feature embedding for supervised, unsupervised, and online learning tasks

E Golinko, X Zhu - Information Systems Frontiers, 2019 - Springer
Feature embedding is an emerging research area which intends to transform features from
the original space into a new space to support effective learning. Many feature embedding …

The aspect Bernoulli model: multiple causes of presences and absences

E Bingham, A Kabán, M Fortelius - Pattern Analysis and Applications, 2009 - Springer
We present a probabilistic multiple cause model for the analysis of binary (0–1) data. A
distinctive feature of the aspect Bernoulli (AB) model is its ability to automatically detect and …

BeSOM: Bernoulli on self-organizing map

M Lebbah, N Rogovschi… - 2007 International joint …, 2007 - ieeexplore.ieee.org
This paper introduces a probabilistic self-organizing map for clustering, analysis and
visualization of multivariate binary data. We propose a probabilistic formalism dedicated to …

Co-occurrence models in music genre classification

P Ahrendt, J Larsen, C Goutte - 2005 IEEE Workshop on …, 2005 - ieeexplore.ieee.org
Music genre classification has been investigated using many different methods, but most of
them build on probabilistic models of feature vectors xr which only represent the short time …

A nonnegative blind source separation model for binary test data

R Schachtner, G Poppel… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org
A novel method called binNMF is introduced which aimed to extract hidden information from
multivariate binary data sets. The method treats the problem in the spirit of blind source …

Comparison of component models in analysing the distribution of dialectal features

A Leino, S Hyvönen - … Journal of Humanities and Arts Computing, 2008 - euppublishing.com
Component models such as factor analysis can be used to analyse spatial distributions of a
large number of different features–for instance the isogloss data in a dialect atlas, or the …