- 学术资源搜索

[HTML][HTML] A comprehensive review of yolo architectures in computer vision: From yolov1 to yolov8 and yolo-nas

J Terven, DM Córdova-Esparza… - Machine Learning and …, 2023 - mdpi.com

YOLO has become a central real-time object detection system for robotics, driverless cars,
and video monitoring applications. We present a comprehensive analysis of YOLO's …

被引用次数：921 相关文章所有 6 个版本

[PDF] arxiv.org

A comprehensive survey on pretrained foundation models: A history from bert to chatgpt

C Zhou, Q Li, C Li, J Yu, Y Liu, G Wang… - arXiv preprint arXiv …, 2023 - arxiv.org

Pretrained Foundation Models (PFMs) are regarded as the foundation for various
downstream tasks with different data modalities. A PFM (eg, BERT, ChatGPT, and GPT-4) is …

被引用次数：415 相关文章所有 2 个版本

[PDF] neurips.cc

Segnext: Rethinking convolutional attention design for semantic segmentation

MH Guo, CZ Lu, Q Hou, Z Liu… - Advances in Neural …, 2022 - proceedings.neurips.cc

We present SegNeXt, a simple convolutional network architecture for semantic
segmentation. Recent transformer-based models have dominated the field of se-mantic …

被引用次数：431 相关文章所有 6 个版本

[PDF] neurips.cc

Convolutions die hard: Open-vocabulary segmentation with single frozen convolutional clip

Q Yu, J He, X Deng, X Shen… - Advances in Neural …, 2024 - proceedings.neurips.cc

Open-vocabulary segmentation is a challenging task requiring segmenting and recognizing
objects from an open set of categories in diverse environments. One way to address this …

被引用次数：69 相关文章所有 5 个版本

[PDF] thecvf.com

Oneformer: One transformer to rule universal image segmentation

J Jain, J Li, MT Chiu, A Hassani… - Proceedings of the …, 2023 - openaccess.thecvf.com

Abstract Universal Image Segmentation is not a new concept. Past attempts to unify image
segmentation include scene parsing, panoptic segmentation, and, more recently, new …

被引用次数：222 相关文章所有 8 个版本

[PDF] nature.com

SLEAP: A deep learning system for multi-animal pose tracking

TD Pereira, N Tabris, A Matsliah, DM Turner, J Li… - Nature …, 2022 - nature.com

The desire to understand how the brain generates and patterns behavior has driven rapid
methodological innovation in tools to quantify natural animal behavior. While advances in …

被引用次数：315 相关文章所有 10 个版本

[PDF] springer.com

Visual attention network

MH Guo, CZ Lu, ZN Liu, MM Cheng, SM Hu - Computational Visual Media, 2023 - Springer

While originally designed for natural language processing tasks, the self-attention
mechanism has recently taken various computer vision areas by storm. However, the 2D …

被引用次数：544 相关文章所有 8 个版本

[PDF] thecvf.com

PIDNet: A real-time semantic segmentation network inspired by PID controllers

J Xu, Z Xiong, SP Bhattacharyya - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Two-branch network architecture has shown its efficiency and effectiveness in real-time
semantic segmentation tasks. However, direct fusion of high-resolution details and low …

被引用次数：220 相关文章所有 8 个版本

[PDF] thecvf.com

Daformer: Improving network architectures and training strategies for domain-adaptive semantic segmentation

L Hoyer, D Dai, L Van Gool - Proceedings of the IEEE/CVF …, 2022 - openaccess.thecvf.com

As acquiring pixel-wise annotations of real-world images for semantic segmentation is a
costly process, a model can instead be trained with more accessible synthetic data and …

被引用次数：413 相关文章所有 9 个版本

Review the state-of-the-art technologies of semantic segmentation based on deep learning

Y Mo, Y Wu, X Yang, F Liu, Y Liao - Neurocomputing, 2022 - Elsevier

The goal of semantic segmentation is to segment the input image according to semantic
information and predict the semantic category of each pixel from a given label set. With the …

被引用次数：349 相关文章所有 3 个版本

高级搜索

QQ 群

[HTML][HTML] A comprehensive review of yolo architectures in computer vision: From yolov1 to yolov8 and yolo-nas

A comprehensive survey on pretrained foundation models: A history from bert to chatgpt

Segnext: Rethinking convolutional attention design for semantic segmentation

Convolutions die hard: Open-vocabulary segmentation with single frozen convolutional clip

Oneformer: One transformer to rule universal image segmentation

SLEAP: A deep learning system for multi-animal pose tracking

Visual attention network

PIDNet: A real-time semantic segmentation network inspired by PID controllers

Daformer: Improving network architectures and training strategies for domain-adaptive semantic segmentation

Review the state-of-the-art technologies of semantic segmentation based on deep learning

引用