Ntire 2024 challenge on image super-resolution (x4): Methods and results

Z Chen, Z Wu, E Zamfir, K Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
This paper reviews the NTIRE 2024 challenge on image super-resolution (x4) highlighting
the solutions proposed and the outcomes obtained. The challenge involves generating …

NTIRE 2023 challenge on efficient super-resolution: Methods and results

Y Li, Y Zhang, R Timofte, L Van Gool… - Proceedings of the …, 2023 - openaccess.thecvf.com
This paper reviews the NTIRE 2023 challenge on efficient single-image super-resolution
with a focus on the proposed solutions and results. The aim of this challenge is to devise a …

YOLOv7: Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors

CY Wang, A Bochkovskiy… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Real-time object detection is one of the most important research topics in computer vision.
As new approaches regarding architecture optimization and training optimization are …

Scaling up your kernels to 31x31: Revisiting large kernel design in cnns

X Ding, X Zhang, J Han, G Ding - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
We revisit large kernel design in modern convolutional neural networks (CNNs). Inspired by
recent advances in vision transformers (ViTs), in this paper, we demonstrate that using a few …

Repvit: Revisiting mobile cnn from vit perspective

A Wang, H Chen, Z Lin, J Han… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract Recently lightweight Vision Transformers (ViTs) demonstrate superior performance
and lower latency compared with lightweight Convolutional Neural Networks (CNNs) on …

Hrda: Context-aware high-resolution domain-adaptive semantic segmentation

L Hoyer, D Dai, L Van Gool - European conference on computer vision, 2022 - Springer
Unsupervised domain adaptation (UDA) aims to adapt a model trained on the source
domain (eg synthetic data) to the target domain (eg real-world data) without requiring further …

FastViT: A fast hybrid vision transformer using structural reparameterization

PKA Vasu, J Gabriel, J Zhu, O Tuzel… - Proceedings of the …, 2023 - openaccess.thecvf.com
The recent amalgamation of transformer and convolutional designs has led to steady
improvements in accuracy and efficiency of the models. In this work, we introduce FastViT, a …

Mobileone: An improved one millisecond mobile backbone

PKA Vasu, J Gabriel, J Zhu, O Tuzel… - Proceedings of the …, 2023 - openaccess.thecvf.com
Efficient neural network backbones for mobile devices are often optimized for metrics such
as FLOPs or parameter count. However, these metrics may not correlate well with latency of …

Repvgg: Making vgg-style convnets great again

X Ding, X Zhang, N Ma, J Han… - Proceedings of the …, 2021 - openaccess.thecvf.com
We present a simple but powerful architecture of convolutional neural network, which has a
VGG-like inference-time body composed of nothing but a stack of 3x3 convolution and …

Cylindrical and asymmetrical 3d convolution networks for lidar segmentation

X Zhu, H Zhou, T Wang, F Hong, Y Ma… - Proceedings of the …, 2021 - openaccess.thecvf.com
State-of-the-art methods for large-scale driving-scene LiDAR segmentation often project the
point clouds to 2D space and then process them via 2D convolution. Although this …