With the widespread adoption of deep learning (DL) applications in recent years, training DL models has become increasingly prevalent. Nevertheless, training these models is typically …
HD TIAN, MZ ZHANG, R CHANG - ZTE technology journal, 2024 - zte.com.cn
Achieving efficient training has become one of the key factors affecting the popularization of large model applications. The main technologies of efficient training of large models are …
Y Wang, Y Liu, Z Chen - 2024 4th International Conference on …, 2024 - ieeexplore.ieee.org
With the development of emerging technologies such as cloud computing and large AI models (such as LLM), many applications have placed higher demands on the intensive …
S Lin, Q Yang, Z Yang, Y Wang, S Zhao - 2024 - jhc.sjtu.edu.cn
Recent years have witnessed a wide adoption of Remote Direct Memory Access (RDMA) to accelerate distributed systems. As the scale of distributed applications keeps increasing …
Deep neural networks (DNNs) have achieved unparalleled performance in numerous fields, including computer vision, natural language processing, and recommendation systems …
Training large language models (LLMs) demands increasingly larger datasets for optimal performance [13]. In practice, these datasets may include hundreds of terabytes (TB) or even …