Robust small-scale pedestrian detection with cued recall via memory learning

G Cheng, X Yuan, X Yao, K Yan, Q Zeng… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

With the rise of deep convolutional neural networks, object detection has achieved
prominent advances in past years. However, such prosperity could not camouflage the …

被引用次数：207 相关文章所有 7 个版本

[PDF] thecvf.com

Small object detection via coarse-to-fine proposal generation and imitation learning

X Yuan, G Cheng, K Yan, Q Zeng… - Proceedings of the …, 2023 - openaccess.thecvf.com

The past few years have witnessed the immense success of object detection, while current
excellent detectors struggle on tackling size-limited instances. Concretely, the well-known …

被引用次数：23 相关文章所有 5 个版本

[PDF] wiley.com Full View

Deep learning serves traffic safety analysis: A forward‐looking review

A Razi, X Chen, H Li, H Wang, B Russo… - IET Intelligent …, 2023 - Wiley Online Library

This paper explores deep learning (DL) methods that are used or have the potential to be
used for traffic video analysis, emphasising driving safety for both autonomous vehicles and …

被引用次数：22 相关文章所有 9 个版本

[PDF] aaai.org

Synctalkface: Talking face generation with precise lip-syncing via audio-lip memory

SJ Park, M Kim, J Hong, J Choi, YM Ro - Proceedings of the AAAI …, 2022 - ojs.aaai.org

The challenge of talking face generation from speech lies in aligning two different modal
information, audio and video, such that the mouth region corresponds to input audio …

被引用次数：62 相关文章所有 8 个版本

[PDF] thecvf.com

Black-box unsupervised domain adaptation with bi-directional atkinson-shiffrin memory

J Zhang, J Huang, X Jiang, S Lu - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Black-box unsupervised domain adaptation (UDA) learns with source predictions of target
data without accessing either source data or source models during training, and it has clear …

被引用次数：8 相关文章所有 5 个版本

[PDF] thecvf.com

Squid: Deep feature in-painting for unsupervised anomaly detection

T Xiang, Y Zhang, Y Lu, AL Yuille… - Proceedings of the …, 2023 - openaccess.thecvf.com

Radiography imaging protocols focus on particular body regions, therefore producing
images of great similarity and yielding recurrent anatomical structures across patients. To …

被引用次数：33 相关文章所有 8 个版本

[PDF] arxiv.org

Pedestrian detection in low-light conditions: A comprehensive survey

B Ghari, A Tourani, A Shahbahrami… - Image and Vision …, 2024 - Elsevier

Pedestrian detection remains a critical problem in various domains, such as computer
vision, surveillance, and autonomous driving. In particular, accurate and instant detection of …

被引用次数：1 相关文章所有 6 个版本

Cromm-vsr: Cross-modal memory augmented visual speech recognition

M Kim, J Hong, SJ Park, YM Ro - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Visual Speech Recognition (VSR) is a task that recognizes speech from external
appearances of the face (, lips) into text. Since the information from the visual lip movements …

被引用次数：27 相关文章所有 3 个版本

OTP-NMS: toward optimal threshold prediction of NMS for crowded pedestrian detection

Y Tang, M Liu, B Li, Y Wang… - IEEE transactions on …, 2023 - ieeexplore.ieee.org

Pedestrian detection is still a challenging task for computer vision, especially in crowded
scenes where the overlaps between pedestrians tend to be large. The non-maximum …

被引用次数：10 相关文章所有 6 个版本

[PDF] aaai.org

Towards versatile pedestrian detector with multisensory-matching and multispectral recalling memory

JU Kim, S Park, YM Ro - Proceedings of the AAAI Conference on …, 2022 - ojs.aaai.org

Recently, automated surveillance cameras can change a visible sensor and a thermal
sensor for all-day operation. However, existing single-modal pedestrian detectors mainly …

被引用次数：20 相关文章所有 6 个版本

高级搜索

QQ 群