The AI gambit: leveraging artificial intelligence to combat climate change—opportunities, challenges, and recommendations

J Cowls, A Tsamados, M Taddeo, L Floridi - Ai & Society, 2023 - Springer
In this article, we analyse the role that artificial intelligence (AI) could play, and is playing, to
combat global climate change. We identify two crucial opportunities that AI offers in this …

Ds-transunet: Dual swin transformer u-net for medical image segmentation

A Lin, B Chen, J Xu, Z Zhang, G Lu… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Automatic medical image segmentation has made great progress owing to powerful deep
representation learning. Inspired by the success of self-attention mechanism in transformer …

Multiscale vision transformers

H Fan, B Xiong, K Mangalam, Y Li… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract We present Multiscale Vision Transformers (MViT) for video and image recognition,
by connecting the seminal idea of multiscale feature hierarchies with transformer models …

Crossvit: Cross-attention multi-scale vision transformer for image classification

CFR Chen, Q Fan, R Panda - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com
The recently developed vision transformer (ViT) has achieved promising results on image
classification compared to convolutional neural networks. Inspired by this, in this paper, we …

[HTML][HTML] Comparison of CNN-based deep learning architectures for rice diseases classification

MT Ahad, Y Li, B Song, T Bhuiyan - Artificial Intelligence in Agriculture, 2023 - Elsevier
Although convolutional neural network (CNN) paradigms have expanded to transfer
learning and ensemble models from original individual CNN architectures, few studies have …

[PDF][PDF] The computational limits of deep learning

NC Thompson, K Greenewald, K Lee… - arXiv preprint arXiv …, 2020 - assets.pubpub.org
Deep learning's recent history has been one of achievement: from triumphing over humans
in the game of Go to world-leading performance in image classification, voice recognition …

Movinets: Mobile video networks for efficient video recognition

D Kondratyuk, L Yuan, Y Li, L Zhang… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract We present Mobile Video Networks (MoViNets), a family of computation and
memory efficient video networks that can operate on streaming video for online inference …

Res2net: A new multi-scale backbone architecture

SH Gao, MM Cheng, K Zhao, XY Zhang… - IEEE transactions on …, 2019 - ieeexplore.ieee.org
Representing features at multiple scales is of great importance for numerous vision tasks.
Recent advances in backbone convolutional neural networks (CNNs) continually …

Drop an octave: Reducing spatial redundancy in convolutional neural networks with octave convolution

Y Chen, H Fan, B Xu, Z Yan… - Proceedings of the …, 2019 - openaccess.thecvf.com
In natural images, information is conveyed at different frequencies where higher frequencies
are usually encoded with fine details and lower frequencies are usually encoded with global …

Mood: Multi-level out-of-distribution detection

Z Lin, SD Roy, Y Li - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Abstract Out-of-distribution (OOD) detection is essential to prevent anomalous inputs from
causing a model to fail during deployment. While improved OOD detection methods have …