An adaptive placement and parallelism framework for accelerating rlhf training

J Duan, S Zhang, Z Wang, L Jiang, W Qu, Q Hu… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Language Models (LLMs) like GPT and LLaMA are revolutionizing the AI industry with
their sophisticated capabilities. Training these models requires vast GPU clusters and …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

Hybridflow: A flexible and efficient rlhf framework

G Sheng, C Zhang, Z Ye, X Wu, W Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Reinforcement Learning from Human Feedback (RLHF) is widely used in Large Language
Model (LLM) alignment. Traditional RL can be modeled as a dataflow, where each node …

被引用次数：1 相关文章所有 4 个版本

[PDF] arxiv.org

AntDT: A Self-Adaptive Distributed Training Framework for Leader and Straggler Nodes

Y Xiao, L Ju, Z Zhou, S Li, Z Huan, D Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Many distributed training techniques like Parameter Server and AllReduce have been
proposed to take advantage of the increasingly large data and rich features. However …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Understanding and Alleviating Memory Consumption in RLHF for LLMs

J Zhou, H Yang, M Xiang, H Guan, T Liu - arXiv preprint arXiv …, 2024 - arxiv.org

Fine-tuning with Reinforcement Learning with Human Feedback (RLHF) is essential for
aligning large language models (LLMs). However, RLHF often encounters significant …

[PDF][PDF] Dynamic Fast Device Placement Strategies for Real-Time Resource Allocation

H Zhang, Z Chen, XLY Liu, J Wu, L Wang - researchgate.net

Large-scale distributed systems are increasingly reliant on efficient resource allocation to
meet the demands of real-time applications. However, the challenges of maintaining low …

[PDF] researchgate.net

[PDF][PDF] Leveraging Machine Learning Techniques for Efficient Fast Device Placement

S Volkov, J Wang, D Ivanov, A Petrov, J Smith, D Zhao - researchgate.net

Modern device placement poses significant challenges due to the increasing complexity of
environments and user demands. Our study introduces a method that leverages advanced …

[引用][C] Adaptive Placement Algorithms for Energy-Efficient ML Graph Execution

Y Shen, M Wang, Y Liu, M Johnson, J Smith, E Brown

高级搜索

QQ 群