A survey on vision mamba: Models, applications and challenges

R Xu, S Yang, Y Wang, B Du, H Chen - arXiv preprint arXiv:2404.18861, 2024 - arxiv.org
Mamba, a recent selective structured state space model, performs excellently on long
sequence modeling tasks. Mamba mitigates the modeling constraints of convolutional …

Zigma: A dit-style zigzag mamba diffusion model

VT Hu, SA Baumann, M Gui, O Grebenkova… - … on Computer Vision, 2024 - Springer
The diffusion model has long been plagued by scalability and quadratic complexity issues,
especially within transformer-based structures. In this study, we aim to leverage the long …

A survey of mamba

H Qu, L Ning, R An, W Fan, T Derr, H Liu, X Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
As one of the most representative DL techniques, Transformer architecture has empowered
numerous advanced models, especially the large language models (LLMs) that comprise …

Mamba in vision: A comprehensive survey of techniques and applications

MM Rahman, AA Tutul, A Nath, L Laishram… - arXiv preprint arXiv …, 2024 - arxiv.org
Mamba is emerging as a novel approach to overcome the challenges faced by
Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) in computer vision …

KMM: Key Frame Mask Mamba for Extended Motion Generation

Z Zhang, H Gao, A Liu, Q Chen, F Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Human motion generation is a cut-edge area of research in generative computer vision, with
promising applications in video creation, game development, and robotic manipulation. The …

3DET-Mamba: Causal Sequence Modelling for End-to-End 3D Object Detection

M Li, J Yuan, S Chen, L Zhang, A Zhu, X Chen… - The Thirty-eighth Annual … - openreview.net
Transformer-based architectures have been proven successful in detecting 3D objects from
point clouds. However, the quadratic complexity of the attention mechanism struggles to …