Towards large-scale small object detection: Survey and benchmarks

G Cheng, X Yuan, X Yao, K Yan, Q Zeng… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
With the rise of deep convolutional neural networks, object detection has achieved
prominent advances in past years. However, such prosperity could not camouflage the …

Small object detection via coarse-to-fine proposal generation and imitation learning

X Yuan, G Cheng, K Yan, Q Zeng… - Proceedings of the …, 2023 - openaccess.thecvf.com
The past few years have witnessed the immense success of object detection, while current
excellent detectors struggle on tackling size-limited instances. Concretely, the well-known …

Deep learning serves traffic safety analysis: A forward‐looking review

A Razi, X Chen, H Li, H Wang, B Russo… - IET Intelligent …, 2023 - Wiley Online Library
This paper explores deep learning (DL) methods that are used or have the potential to be
used for traffic video analysis, emphasising driving safety for both autonomous vehicles and …

Synctalkface: Talking face generation with precise lip-syncing via audio-lip memory

SJ Park, M Kim, J Hong, J Choi, YM Ro - Proceedings of the AAAI …, 2022 - ojs.aaai.org
The challenge of talking face generation from speech lies in aligning two different modal
information, audio and video, such that the mouth region corresponds to input audio …

Black-box unsupervised domain adaptation with bi-directional atkinson-shiffrin memory

J Zhang, J Huang, X Jiang, S Lu - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Black-box unsupervised domain adaptation (UDA) learns with source predictions of target
data without accessing either source data or source models during training, and it has clear …

Squid: Deep feature in-painting for unsupervised anomaly detection

T Xiang, Y Zhang, Y Lu, AL Yuille… - Proceedings of the …, 2023 - openaccess.thecvf.com
Radiography imaging protocols focus on particular body regions, therefore producing
images of great similarity and yielding recurrent anatomical structures across patients. To …

Pedestrian detection in low-light conditions: A comprehensive survey

B Ghari, A Tourani, A Shahbahrami… - Image and Vision …, 2024 - Elsevier
Pedestrian detection remains a critical problem in various domains, such as computer
vision, surveillance, and autonomous driving. In particular, accurate and instant detection of …

Cromm-vsr: Cross-modal memory augmented visual speech recognition

M Kim, J Hong, SJ Park, YM Ro - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Visual Speech Recognition (VSR) is a task that recognizes speech from external
appearances of the face (, lips) into text. Since the information from the visual lip movements …

OTP-NMS: toward optimal threshold prediction of NMS for crowded pedestrian detection

Y Tang, M Liu, B Li, Y Wang… - IEEE transactions on …, 2023 - ieeexplore.ieee.org
Pedestrian detection is still a challenging task for computer vision, especially in crowded
scenes where the overlaps between pedestrians tend to be large. The non-maximum …

Towards versatile pedestrian detector with multisensory-matching and multispectral recalling memory

JU Kim, S Park, YM Ro - Proceedings of the AAAI Conference on …, 2022 - ojs.aaai.org
Recently, automated surveillance cameras can change a visible sensor and a thermal
sensor for all-day operation. However, existing single-modal pedestrian detectors mainly …