Deep learning for sign language recognition: Current techniques, benchmarks, and open issues

M Al-Qurishi, T Khalid, R Souissi - IEEE Access, 2021 - ieeexplore.ieee.org
People with hearing impairments are found worldwide; therefore, the development of
effective local level sign language recognition (SLR) tools is essential. We conducted a …

Aggregating global features into local vision transformer

K Patel, AM Bur, F Li, G Wang - 2022 26th International …, 2022 - ieeexplore.ieee.org
Local Transformer-based classification models have recently achieved promising results
with relatively low computational costs. However, the effect of aggregating spatial global …

Contrastive token-wise meta-learning for unseen performer visual temporal-aligned translation

L Li, T Jin, X Cheng, Y Wang, W Lin… - Findings of the …, 2023 - aclanthology.org
Visual temporal-aligned translation aims to transform the visual sequence into natural
words, including important applicable tasks such as lipreading and fingerspelling …

Accumulated trivial attention matters in vision transformers on small datasets

X Chen, Q Hu, K Li, C Zhong… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Vision Transformers has demonstrated competitive performance on computer vision tasks
benefiting from their ability to capture long-range dependencies with multi-head self …

[PDF][PDF] Fuzzynet: A fuzzy attention module for polyp segmentation

KB Patel, F Li, G Wang - NeurIPS'22 Workshop on All Things …, 2022 - drive.google.com
Polyp segmentation is essential for accelerating the diagnosis of colon cancer. However, it is
challenging because of the diverse color, texture, and varying lighting effects of the polyps …

Miti-detr: Object detection based on transformers with mitigatory self-attention convergence

W Ma, T Zhang, G Wang - arXiv preprint arXiv:2112.13310, 2021 - arxiv.org
Object Detection with Transformers (DETR) and related works reach or even surpass the
highly-optimized Faster-RCNN baseline with self-attention network architectures. Inspired by …

FingerSpeller: Camera-Free Text Entry Using Smart Rings for American Sign Language Fingerspelling Recognition

D Martin, Z Leng, T Gemicioglu, J Womack… - Proceedings of the 25th …, 2023 - dl.acm.org
Camera-based text entry using American Sign Language (ASL) fingerspelling has become
more feasible due to recent advancements in recognition technology. However, there are …

[PDF][PDF] Multimodal Continuous Fingerspelling Recognition via Visual Alignment Learning

K Papadimitriou, G Potamianos - Proceedings of the Interspeech, 2024 - isca-archive.org
Continuous fingerspelling recognition from videos is paramount for real-time sign language
(SL) interpretation, enhancing accessibility. Despite deep learning progress, challenges …

Fingerspelling PoseNet: Enhancing Fingerspelling Translation with Pose-Based Transformer Models

P Fayyazsanavi, N Nejatishahidin… - Proceedings of the …, 2024 - openaccess.thecvf.com
We address the task of American Sign Language fingerspelling translation using videos in
the wild. We exploit advances in more accurate hand pose estimation and propose a novel …

Machine Learning in ASL Fingerspelling Recognition: A Literature Review

J Pinnington, A Souag… - 2024 IEEE 24th …, 2024 - ieeexplore.ieee.org
The recognition of American Sign Language (ASL) fingerspelling through machine learning
has seen significant advancements over the past few years. This literature review explores …