A fine-grained visual attention approach for fingerspelling recognition in the wild

M Al-Qurishi, T Khalid, R Souissi - IEEE Access, 2021 - ieeexplore.ieee.org

People with hearing impairments are found worldwide; therefore, the development of
effective local level sign language recognition (SLR) tools is essential. We conducted a …

被引用次数：123 相关文章所有 3 个版本

[PDF] arxiv.org

Aggregating global features into local vision transformer

K Patel, AM Bur, F Li, G Wang - 2022 26th International …, 2022 - ieeexplore.ieee.org

Local Transformer-based classification models have recently achieved promising results
with relatively low computational costs. However, the effect of aggregating spatial global …

被引用次数：47 相关文章所有 6 个版本

[PDF] aclanthology.org

Contrastive token-wise meta-learning for unseen performer visual temporal-aligned translation

L Li, T Jin, X Cheng, Y Wang, W Lin… - Findings of the …, 2023 - aclanthology.org

Visual temporal-aligned translation aims to transform the visual sequence into natural
words, including important applicable tasks such as lipreading and fingerspelling …

被引用次数：6 相关文章

[PDF] thecvf.com

Accumulated trivial attention matters in vision transformers on small datasets

X Chen, Q Hu, K Li, C Zhong… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Vision Transformers has demonstrated competitive performance on computer vision tasks
benefiting from their ability to capture long-range dependencies with multi-head self …

被引用次数：13 相关文章所有 5 个版本

[PDF] google.com

[PDF][PDF] Fuzzynet: A fuzzy attention module for polyp segmentation

KB Patel, F Li, G Wang - NeurIPS'22 Workshop on All Things …, 2022 - drive.google.com

Polyp segmentation is essential for accelerating the diagnosis of colon cancer. However, it is
challenging because of the diverse color, texture, and varying lighting effects of the polyps …

被引用次数：13 相关文章所有 2 个版本

[PDF] arxiv.org

Miti-detr: Object detection based on transformers with mitigatory self-attention convergence

W Ma, T Zhang, G Wang - arXiv preprint arXiv:2112.13310, 2021 - arxiv.org

Object Detection with Transformers (DETR) and related works reach or even surpass the
highly-optimized Faster-RCNN baseline with self-attention network architectures. Inspired by …

被引用次数：19 相关文章所有 3 个版本

[PDF] researchgate.net

FingerSpeller: Camera-Free Text Entry Using Smart Rings for American Sign Language Fingerspelling Recognition

D Martin, Z Leng, T Gemicioglu, J Womack… - Proceedings of the 25th …, 2023 - dl.acm.org

Camera-based text entry using American Sign Language (ASL) fingerspelling has become
more feasible due to recent advancements in recognition technology. However, there are …

被引用次数：3 相关文章所有 2 个版本

[PDF] isca-archive.org

[PDF][PDF] Multimodal Continuous Fingerspelling Recognition via Visual Alignment Learning

K Papadimitriou, G Potamianos - Proceedings of the Interspeech, 2024 - isca-archive.org

Continuous fingerspelling recognition from videos is paramount for real-time sign language
(SL) interpretation, enhancing accessibility. Despite deep learning progress, challenges …

被引用次数：1 相关文章所有 2 个版本

[PDF] thecvf.com

Fingerspelling PoseNet: Enhancing Fingerspelling Translation with Pose-Based Transformer Models

P Fayyazsanavi, N Nejatishahidin… - Proceedings of the …, 2024 - openaccess.thecvf.com

We address the task of American Sign Language fingerspelling translation using videos in
the wild. We exploit advances in more accurate hand pose estimation and propose a novel …

被引用次数：2 相关文章所有 5 个版本

Machine Learning in ASL Fingerspelling Recognition: A Literature Review

J Pinnington, A Souag… - 2024 IEEE 24th …, 2024 - ieeexplore.ieee.org

The recognition of American Sign Language (ASL) fingerspelling through machine learning
has seen significant advancements over the past few years. This literature review explores …

高级搜索

QQ 群