Local Transformer-based classification models have recently achieved promising results with relatively low computational costs. However, the effect of aggregating spatial global …
L Li, T Jin, X Cheng, Y Wang, W Lin… - Findings of the …, 2023 - aclanthology.org
Visual temporal-aligned translation aims to transform the visual sequence into natural words, including important applicable tasks such as lipreading and fingerspelling …
X Chen, Q Hu, K Li, C Zhong… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Vision Transformers has demonstrated competitive performance on computer vision tasks benefiting from their ability to capture long-range dependencies with multi-head self …
KB Patel, F Li, G Wang - NeurIPS'22 Workshop on All Things …, 2022 - drive.google.com
Polyp segmentation is essential for accelerating the diagnosis of colon cancer. However, it is challenging because of the diverse color, texture, and varying lighting effects of the polyps …
W Ma, T Zhang, G Wang - arXiv preprint arXiv:2112.13310, 2021 - arxiv.org
Object Detection with Transformers (DETR) and related works reach or even surpass the highly-optimized Faster-RCNN baseline with self-attention network architectures. Inspired by …
Camera-based text entry using American Sign Language (ASL) fingerspelling has become more feasible due to recent advancements in recognition technology. However, there are …
Continuous fingerspelling recognition from videos is paramount for real-time sign language (SL) interpretation, enhancing accessibility. Despite deep learning progress, challenges …
We address the task of American Sign Language fingerspelling translation using videos in the wild. We exploit advances in more accurate hand pose estimation and propose a novel …
The recognition of American Sign Language (ASL) fingerspelling through machine learning has seen significant advancements over the past few years. This literature review explores …