Unambiguous scene text segmentation with referring expression comprehension

J Chen, Y Huang, T Lv, L Cui… - Advances in Neural …, 2024 - proceedings.neurips.cc

Diffusion models have gained increasing attention for their impressive generation abilities
but currently struggle with rendering accurate and coherent text. To address this issue, we …

被引用次数：93 相关文章所有 5 个版本

[PDF] ieee.org

Vehicle-damage-detection segmentation algorithm based on improved mask RCNN

Q Zhang, X Chang, SB Bian - IEEE Access, 2020 - ieeexplore.ieee.org

Traffic congestion due to vehicular accidents seriously affects normal travel, and accurate
and effective mitigating measures and methods must be studied. To resolve traffic accident …

被引用次数：151 相关文章所有 3 个版本

BRPPNet: Balanced privacy protection network for referring personal image privacy protection

J Lin, X Dai, K Nai, J Yuan, Z Li, X Zhang, S Li - Expert Systems with …, 2023 - Elsevier

Traditional personal image privacy protection usually suffers from the overprotection
problem, where one or more undesired persons in an image may be inevitably shielded …

被引用次数：9 相关文章所有 2 个版本

[PDF] acm.org

Tpsnet: Reverse thinking of thin plate splines for arbitrary shape scene text representation

W Wang, Y Zhou, J Lv, D Wu, G Zhao, N Jiang… - Proceedings of the 30th …, 2022 - dl.acm.org

The research focus of scene text detection and recognition has shifted to arbitrary shape text
in recent years, where the text shape representation is a fundamental problem. An ideal …

被引用次数：35 相关文章所有 3 个版本

A new DCT-PCM method for license plate number detection in drone images

H Mokayed, P Shivakumara, HH Woon… - Pattern Recognition …, 2021 - Elsevier

License plate number detection in drone images is a complex problem because the images
are generally captured at oblique angles and pose several challenges like perspective …

被引用次数：32 相关文章所有 6 个版本

Piglet: Pixel-level grounding of language expressions with transformers

C González, N Ayobi, I Hernández… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

This paper proposes Panoptic Narrative Grounding, a spatially fine and general formulation
of the natural language visual grounding problem. We establish an experimental framework …

被引用次数：8 相关文章所有 5 个版本

[PDF] thecvf.com

Panoptic narrative grounding

C González, N Ayobi, I Hernández… - Proceedings of the …, 2021 - openaccess.thecvf.com

Abstract This paper proposes Panoptic Narrative Grounding, a spatially fine and general
formulation of the natural language visual grounding problem. We establish an experimental …

被引用次数：23 相关文章所有 8 个版本

Geometry sensitive cross-modal reasoning for composed query based image retrieval

F Zhang, M Xu, C Xu - IEEE Transactions on Image Processing, 2021 - ieeexplore.ieee.org

Composed Query Based Image Retrieval (CQBIR) aims at retrieving images relevant to a
composed query containing a reference image with a requested modification expressed via …

被引用次数：19 相关文章所有 5 个版本

[PDF] bonviewpress.com

License plate number detection in drone images

H Mokayed, P Shivakumara… - Artificial Intelligence …, 2025 - ojs.bonviewpress.com

This work aims to figure out a way to accurately identify license plate numbers in photos
taken by drones. This technology is used in practical applications like managing parking and …

被引用次数：14 相关文章

[PDF] thecvf.com

ScanFormer: Referring Expression Comprehension by Iteratively Scanning

W Su, P Miao, H Dou, X Li - … of the IEEE/CVF Conference on …, 2024 - openaccess.thecvf.com

Abstract Referring Expression Comprehension (REC) aims to localize the target objects
specified by free-form natural language descriptions in images. While state-of-the-art …

被引用次数：5 相关文章

高级搜索

QQ 群