Acnet: Strengthening the kernel skeletons for powerful cnn via asymmetric convolution blocks

Z Chen, Z Wu, E Zamfir, K Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com

This paper reviews the NTIRE 2024 challenge on image super-resolution (x4) highlighting
the solutions proposed and the outcomes obtained. The challenge involves generating …

被引用次数：53 相关文章所有 3 个版本

[PDF] thecvf.com

NTIRE 2023 challenge on efficient super-resolution: Methods and results

Y Li, Y Zhang, R Timofte, L Van Gool… - Proceedings of the …, 2023 - openaccess.thecvf.com

This paper reviews the NTIRE 2023 challenge on efficient single-image super-resolution
with a focus on the proposed solutions and results. The aim of this challenge is to devise a …

被引用次数：117 相关文章所有 12 个版本

[PDF] thecvf.com

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

CY Wang, A Bochkovskiy… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Real-time object detection is one of the most important research topics in computer vision.
As new approaches regarding architecture optimization and training optimization are …

被引用次数：6201 相关文章所有 10 个版本

[PDF] thecvf.com

Scaling up your kernels to 31x31: Revisiting large kernel design in cnns

X Ding, X Zhang, J Han, G Ding - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by
recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few …

被引用次数：771 相关文章所有 10 个版本

[PDF] thecvf.com

Repvit: Revisiting mobile cnn from vit perspective

A Wang, H Chen, Z Lin, J Han… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Abstract Recently lightweight Vision Transformers (ViTs) demonstrate superior performance
and lower latency compared with lightweight Convolutional Neural Networks (CNNs) on …

被引用次数：45 相关文章所有 4 个版本

[PDF] arxiv.org

Hrda: Context-aware high-resolution domain-adaptive semantic segmentation

L Hoyer, D Dai, L Van Gool - European conference on computer vision, 2022 - Springer

Unsupervised domain adaptation (UDA) aims to adapt a model trained on the source
domain (eg synthetic data) to the target domain (eg real-world data) without requiring further …

被引用次数：178 相关文章所有 11 个版本

[PDF] thecvf.com

FastViT: A fast hybrid vision transformer using structural reparameterization

PKA Vasu, J Gabriel, J Zhu, O Tuzel… - Proceedings of the …, 2023 - openaccess.thecvf.com

The recent amalgamation of transformer and convolutional designs has led to steady
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …

被引用次数：76 相关文章所有 5 个版本

[PDF] thecvf.com

Mobileone: An improved one millisecond mobile backbone

PKA Vasu, J Gabriel, J Zhu, O Tuzel… - Proceedings of the …, 2023 - openaccess.thecvf.com

Efficient neural network backbones for mobile devices are often optimized for metrics such
as FLOPs or parameter count. However, these metrics may not correlate well with latency of …

被引用次数：118 相关文章所有 4 个版本

[PDF] thecvf.com

Repvgg: Making vgg-style convnets great again

X Ding, X Zhang, N Ma, J Han… - Proceedings of the …, 2021 - openaccess.thecvf.com

We present a simple but powerful architecture of convolutional neural network, which has a
VGG-like inference-time body composed of nothing but a stack of 3x3 convolution and …

被引用次数：1684 相关文章所有 15 个版本

[PDF] thecvf.com

Cylindrical and asymmetrical 3d convolution networks for lidar segmentation

X Zhu, H Zhou, T Wang, F Hong, Y Ma… - Proceedings of the …, 2021 - openaccess.thecvf.com

State-of-the-art methods for large-scale driving-scene LiDAR segmentation often project the
point clouds to 2D space and then process them via 2D convolution. Although this …

被引用次数：518 相关文章所有 5 个版本

高级搜索

QQ 群