Z Jiang, H Lin, Y Zhong, Q Huang, Y Chen… - … USENIX Symposium on …, 2024 - usenix.org
We present the design, implementation and engineering experience in building and deploying MegaScale, a production system for training large language models (LLMs) at the …
Intra-host networks, including heterogeneous devices and interconnect fabrics, have become increasingly complex and crucial. However, intra-host networks today do not …
R Zhuang, J Han, K Xue, J Li, Q Sun… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
With the development of datacenter networks (DCNs) towards high bandwidth and low latency, the demands of high-level datacenter applications are heading towards high …
Z Jiang, H Lin, Y Zhong, Q Huang, Y Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
We present the design, implementation and engineering experience in building and deploying MegaScale, a production system for training large language models (LLMs) at the …