Reinventing 2d convolutions for 3d images

J Yang, X Huang, Y He, J Xu, C Yang… - IEEE Journal of …, 2021 - ieeexplore.ieee.org
There have been considerable debates over 2D and 3D representation learning on 3D
medical images. 2D approaches could benefit from large-scale 2D pretraining, whereas they …

3d depthwise convolution: Reducing model parameters in 3d vision tasks

R Ye, F Liu, L Zhang - Advances in Artificial Intelligence: 32nd Canadian …, 2019 - Springer
Standard 3D convolution operations usually require larger amounts of memory and
computation cost than 2D convolution operations. The fact increases the difficulty of the …

Resource efficient 3d convolutional neural networks

O Kopuklu, N Kose, A Gunduz… - Proceedings of the …, 2019 - openaccess.thecvf.com
Recently, convolutional neural networks with 3D kernels (3D CNNs) have been very popular
in computer vision community as a result of their superior ability of extracting spatio-temporal …

[PDF][PDF] Scaling up kernels in 3d cnns

Y Chen, J Liu, X Qi, X Zhang, J Sun, J Jia - arXiv preprint arXiv …, 2022 - arxiv.org
Recent advances in 2D CNNs and vision transformers (ViTs) reveal that large kernels are
essential for enough receptive fields and high performance. Inspired by this literature, we …

Convolutional networks with oriented 1d kernels

A Kirchmeyer, J Deng - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
In computer vision, 2D convolution is arguably the most important operation performed by a
ConvNet. Unsurprisingly, it has been the focus of intense software and hardware …

Learning from 2d: Contrastive pixel-to-point knowledge transfer for 3d pretraining

YC Liu, YK Huang, HY Chiang, HT Su, ZY Liu… - arXiv preprint arXiv …, 2021 - arxiv.org
Most 3D neural networks are trained from scratch owing to the lack of large-scale labeled 3D
datasets. In this paper, we present a novel 3D pretraining method by leveraging 2D …

QTTNet: Quantized tensor train neural networks for 3D object and video recognition

D Lee, D Wang, Y Yang, L Deng, G Zhao, G Li - Neural Networks, 2021 - Elsevier
Relying on the rapidly increasing capacity of computing clusters and hardware,
convolutional neural networks (CNNs) have been successfully applied in various fields and …

T3d: Towards 3d medical image understanding through vision-language pre-training

C Liu, C Ouyang, Y Chen, CC Quilodrán-Casas… - arXiv preprint arXiv …, 2023 - arxiv.org
Expert annotation of 3D medical image for downstream analysis is resource-intensive,
posing challenges in clinical applications. Visual self-supervised learning (vSSL), though …

Med3d: Transfer learning for 3d medical image analysis

S Chen, K Ma, Y Zheng - arXiv preprint arXiv:1904.00625, 2019 - arxiv.org
The performance on deep learning is significantly affected by volume of training data.
Models pre-trained from massive dataset such as ImageNet become a powerful weapon for …

[HTML][HTML] 3D deep learning on medical images: a review

SP Singh, L Wang, S Gupta, H Goli, P Padmanabhan… - Sensors, 2020 - mdpi.com
The rapid advancements in machine learning, graphics processing technologies and the
availability of medical imaging data have led to a rapid increase in the use of deep learning …