On calibration of scene-text recognition models

XY Zhang, GS Xie, X Li, T Mei… - Proceedings of the IEEE, 2023 - ieeexplore.ieee.org

Learning to reject is a special kind of self-awareness (the ability to know what you do not
know), which is an essential factor for humans to become smarter. Although machine …

被引用次数：27 相关文章

[PDF] thecvf.com

Document understanding dataset and evaluation (dude)

J Van Landeghem, R Tito… - Proceedings of the …, 2023 - openaccess.thecvf.com

We call on the Document AI (DocAI) community to re-evaluate current methodologies and
embrace the challenge of creating more practically-oriented benchmarks. Document …

被引用次数：20 相关文章所有 9 个版本

[PDF] neurips.cc

Training uncertainty-aware classifiers with conformalized deep learning

BS Einbinder, Y Romano, M Sesia… - Advances in Neural …, 2022 - proceedings.neurips.cc

Deep neural networks are powerful tools to detect hidden patterns in data and leverage
them to make predictions, but they are not designed to understand uncertainty and estimate …

被引用次数：36 相关文章所有 8 个版本

[PDF] arxiv.org

Out-of-vocabulary challenge report

S Garcia-Bordils, A Mafla, AF Biten, O Nuriel… - … on Computer Vision, 2022 - Springer

This paper presents final results of the Out-Of-Vocabulary 2022 (OOV) challenge. The OOV
contest introduces an important aspect that is not commonly studied by Optical Character …

被引用次数：17 相关文章所有 8 个版本

[PDF] thecvf.com

Clipter: Looking at the bigger picture in scene text recognition

A Aberdam, D Bensaïd, A Golts… - Proceedings of the …, 2023 - openaccess.thecvf.com

Reading text in real-world scenarios often requires understanding the context surrounding it,
especially when dealing with poor-quality text. However, current scene text recognizers are …

被引用次数：12 相关文章所有 8 个版本

[PDF] thecvf.com

Textual alchemy: Coformer for scene text understanding

G Deshmukh, O Susladkar… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract The paper presents CoFormer (Convolutional Fourier Transformer), a robust and
adaptable transformer architecture designed for a range of scene text tasks. CoFormer …

被引用次数：2 相关文章所有 3 个版本

[图书][B] Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part XX

S Avidan, G Brostow, M Cissé, GM Farinella, T Hassner - 2022 - books.google.com

The 39-volume set, comprising the LNCS books 13661 until 13699, constitutes the refereed
proceedings of the 17th European Conference on Computer Vision, ECCV 2022, held in Tel …

被引用次数：6 相关文章所有 6 个版本

[PDF] arxiv.org

Context-aware selective label smoothing for calibrating sequence recognition model

S Huang, Y Luo, Z Zhuang, JG Yu, M He… - Proceedings of the 29th …, 2021 - dl.acm.org

Despite the success of deep neural network (DNN) on sequential data (ie, scene text and
speech) recognition, it suffers from the over-confidence problem mainly due to overfitting in …

被引用次数：7 相关文章所有 3 个版本

[PDF] arxiv.org

Textadain: Paying attention to shortcut learning in text recognizers

O Nuriel, S Fogel, R Litman - European Conference on Computer Vision, 2022 - Springer

Leveraging the characteristics of convolutional layers, neural networks are extremely
effective for pattern recognition tasks. However in some cases, their decisions are based on …

被引用次数：6 相关文章所有 7 个版本

[PDF] thecvf.com

Perception and Semantic Aware Regularization for Sequential Confidence Calibration

Z Peng, Y Luo, T Chen, K Xu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Deep sequence recognition (DSR) models receive increasing attention due to their superior
application to various applications. Most DSR models use merely the target sequences as …

被引用次数：1 相关文章所有 5 个版本

高级搜索

QQ 群