Hedging your bets: Optimizing accuracy-specificity trade-offs in large scale visual recognition

S Bates, A Angelopoulos, L Lei, J Malik… - Journal of the ACM …, 2021 - dl.acm.org

While improving prediction accuracy has been the focus of machine learning in recent years,
this alone does not suffice for reliable decision-making. Deploying learning systems in …

被引用次数：176 相关文章所有 3 个版本

[PDF] academia.edu

Exploring video captioning techniques: A comprehensive survey on deep learning methods

S Islam, A Dash, A Seum, AH Raj, T Hossain… - SN Computer …, 2021 - Springer

Video captioning is an automated collection of natural language phrases that explains the
contents in video frames. Because of the incomparable performance of deep learning in the …

被引用次数：33 相关文章所有 6 个版本

[PDF] thecvf.com

Im2Calories: towards an automated mobile vision food diary

A Meyers, N Johnston, V Rathod… - Proceedings of the …, 2015 - openaccess.thecvf.com

We present a system which can recognize the contents of your meal from a single image,
and then predict its nutritional contents, such as calories. The simplest version assumes that …

被引用次数：539 相关文章所有 12 个版本

[PDF] arxiv.org

NBDT: Neural-backed decision trees

A Wan, L Dunlap, D Ho, J Yin, S Lee, H Jin… - arXiv preprint arXiv …, 2020 - arxiv.org

Machine learning applications such as finance and medicine demand accurate and
justifiable predictions, barring most deep learning methods from use. In response, previous …

被引用次数：177 相关文章所有 8 个版本

[PDF] thecvf.com

HD-CNN: hierarchical deep convolutional neural networks for large scale visual recognition

Z Yan, H Zhang, R Piramuthu… - Proceedings of the …, 2015 - openaccess.thecvf.com

In image classification, visual separability between different object categories is highly
uneven, and some categories are more difficult to distinguish than others. Such difficult …

被引用次数：502 相关文章所有 20 个版本

[PDF] aaai.org

Deeptype: multilingual entity linking by neural type system evolution

J Raiman, O Raiman - Proceedings of the AAAI Conference on Artificial …, 2018 - ojs.aaai.org

The wealth of structured (eg Wikidata) and unstructured data about the world available today
presents an incredible opportunity for tomorrow's Artificial Intelligence. So far, integration of …

被引用次数：213 相关文章所有 11 个版本

[PDF] cv-foundation.org

Youtube2text: Recognizing and describing arbitrary activities using semantic hierarchies and zero-shot recognition

S Guadarrama, N Krishnamoorthy… - Proceedings of the …, 2013 - cv-foundation.org

Despite a recent push towards large-scale object recognition, activity recognition remains
limited to narrow domains and small vocabularies of actions. In this paper, we tackle the …

被引用次数：602 相关文章所有 14 个版本

[PDF] aaai.org

Dynamic deep neural networks: Optimizing accuracy-efficiency trade-offs by selective execution

L Liu, J Deng - Proceedings of the AAAI Conference on Artificial …, 2018 - ojs.aaai.org

Abstract We introduce Dynamic Deep Neural Networks (D2NN), a new type of feed-forward
deep neural network that allows selective execution. Given an input, only a subset of D2NN …

被引用次数：221 相关文章所有 9 个版本

[PDF] mit.edu

TreeTalk: Composition and Compression of Trees for Image Descriptions

P Kuznetsova, V Ordonez, TL Berg… - Transactions of the …, 2014 - direct.mit.edu

We present a new tree based approach to composing expressive image descriptions that
makes use of naturally occuring web images with captions. We investigate two related tasks …

被引用次数：298 相关文章所有 10 个版本

[PDF] cnrs.fr

Reasoning about object affordances in a knowledge base representation

Y Zhu, A Fathi, L Fei-Fei - Computer Vision–ECCV 2014: 13th European …, 2014 - Springer

Abstract Reasoning about objects and their affordances is a fundamental problem for visual
intelligence. Most of the previous work casts this problem as a classification task where …

被引用次数：299 相关文章所有 7 个版本

高级搜索

QQ 群