Seesr: Towards semantics-aware real-world image super-resolution

R Wu, T Yang, L Sun, Z Zhang, S Li… - Proceedings of the …, 2024 - openaccess.thecvf.com
Owe to the powerful generative priors the pre-trained text-to-image (T2I) diffusion models
have become increasingly popular in solving the real-world image super-resolution …

Yolov10 to its genesis: A decadal and comprehensive review of the you only look once series

R Sapkota, R Qureshi, M Flores-Calero… - Available at SSRN …, 2024 - papers.ssrn.com
This review systematically examines the progression of the You Only Look Once (YOLO)
object detection algorithms from YOLOv1 to the recently unveiled YOLOv10. Employing a …

Univs: Unified and universal video segmentation with prompts as queries

M Li, S Li, X Zhang, L Zhang - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
Despite the recent advances in unified image segmentation (IS) developing a unified video
segmentation (VS) model remains a challenge. This is mainly because generic category …

YOLOv10 to Its Genesis: A Decadal and Comprehensive Review of The You Only Look Once (YOLO) Series

R Sapkota, R Qureshi, MF Calero, C Badjugar… - arXiv preprint arXiv …, 2024 - arxiv.org
This review systematically examines the progression of the You Only Look Once (YOLO)
object detection algorithms from YOLOv1 to the recently unveiled YOLOv10. Employing a …

Dual memory networks: A versatile adaptation approach for vision-language models

Y Zhang, W Zhu, H Tang, Z Ma… - Proceedings of the …, 2024 - openaccess.thecvf.com
With the emergence of pre-trained vision-language models like CLIP how to adapt them to
various downstream classification tasks has garnered significant attention in recent …

Position-based anchor optimization for point supervised dense nuclei detection

J Yao, L Han, G Guo, Z Zheng, R Cong, X Huang… - Neural Networks, 2024 - Elsevier
Nuclei detection is one of the most fundamental and challenging problems in
histopathological image analysis, which can localize nuclei to provide effective computer …

End-to-end semi-supervised approach with modulated object queries for table detection in documents

I Ehsan, T Shehzadi, D Stricker, MZ Afzal - International Journal on …, 2024 - Springer
Table detection, a pivotal task in document analysis, aims to precisely recognize and locate
tables within document images. Although deep learning has shown remarkable progress in …

Ranking-based adaptive query generation for DETRs in crowded pedestrian detection

F Gao, J Leng, J Gan, X Gao - Neurocomputing, 2025 - Elsevier
Abstract Variants of DEtection TRansformer (DETRs) have shown promising performance in
crowded pedestrian detection. However, we observe that DETRs are sensitive to the hyper …

RC-DETR: Improving DETRs in crowded pedestrian detection via rank-based contrastive learning

F Gao, J Leng, J Gan, X Gao - Neural Networks, 2025 - Elsevier
Abstract The variants of DEtection TRansformer (DETRs) have achieved impressive
performance in general object detection. However, they suffer notable performance …

Deep Omni-supervised Learning for Rib Fracture Detection from Chest Radiology Images

Z Chai, L Luo, H Lin, PA Heng… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Deep learning (DL)-based rib fracture detection has shown promise of playing an important
role in preventing mortality and improving patient outcome. Normally, developing DL-based …