作者
Ian EH Yen, Xiangru Huang, Wei Dai, Pradeep Ravikumar, Inderjit Dhillon, Eric Xing
发表日期
2017/8/4
图书
Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
页码范围
545-553
简介
Extreme Classification comprises multi-class or multi-label prediction where there is a large number of classes, and is increasingly relevant to many real-world applications such as text and image tagging. In this setting, standard classification methods, with complexity linear in the number of classes, become intractable, while enforcing structural constraints among classes (such as low-rank or tree-structure) to reduce complexity often sacrifices accuracy for efficiency. The recent PD-Sparse method addresses this via an algorithm that is sub-linear in the number of variables, by exploiting primal-dual sparsity inherent in a specific loss function, namely the max-margin loss. In this work, we extend PD-Sparse to be efficiently parallelized in large-scale distributed settings. By introducing separable loss functions, we can scale out the training, with network communication and space efficiency comparable to those in one …
引用总数
2017201820192020202120222023202421431233228217
学术搜索中的文章
IEH Yen, X Huang, W Dai, P Ravikumar, I Dhillon… - Proceedings of the 23rd ACM SIGKDD International …, 2017