Recent advances in natural language processing, primarily propelled by Large Language Models (LLMs), have showcased their remarkable capabilities grounded in in-context …
Tensor networks (TNs) and neural networks (NNs) are two fundamental data modeling approaches. TNs were introduced to solve the curse of dimensionality in large-scale tensors …
Tensor network (TN) has demonstrated remarkable efficacy in the compact representation of high-order data. In contrast to the TN methods with pre-determined structures, the recently …
J Xiong, C Li, M Yang, X Hu, B Hu - … of the 45th International ACM SIGIR …, 2022 - dl.acm.org
Math Word Problems (MWP) aims to automatically solve mathematical questions given in texts. Previous studies tend to design complex models to capture additional information in …
P Zhen, X Yan, W Wang, H Wei… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Deep learning-based action recognition has become ubiquitous in the video analysis area; however, large neural networks require enormous computations to achieve high …
The growing interest in satellite imagery has triggered the need for efficient mechanisms to extract valuable information from these vast data sources, providing deeper insights. Even …
Z Wu, Z Xu, D Zeng, Q Wang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Federated learning (FL) has surged in prominence due to its capability of collaborative model training without direct data sharing. However, the vast disparity in local data …
X Ou, Z Chen, C Zhu, Y Liu - Journal of Systems Engineering …, 2023 - ieeexplore.ieee.org
Deep neural networks (DNNs) have achieved great success in many data processing applications. However, high computational complexity and storage cost make deep learning …
Dense linear layers are the dominant computational bottleneck in foundation models. Identifying more efficient alternatives to dense matrices has enormous potential for building …