Hybrid Computing for Interactive Datacenter Applications

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

我的图书馆

Hybrid Computing for Interactive Datacenter Applications

在引用文章中搜索

[PDF] washington.edu

[PDF][PDF] Splitwise: Efficient generative llm inference using phase splitting

P Patel, E Choukse, C Zhang, A Shah, Í Goiri… - Power, 2023 - homes.cs.washington.edu

Generative large language model (LLM) applications are growing rapidly, leading to large-
scale deployments of expensive and power-hungry GPUs. Our characterization of LLM …

被引用次数：30 相关文章所有 2 个版本

[PDF] acm.org

Hybrid Heterogeneous Clusters Can Lower the Energy Consumption of LLM Inference Workloads

G Wilkins, S Keshav, R Mortier - Proceedings of the 15th ACM …, 2024 - dl.acm.org

Both the training and use of Large Language Models (LLMs) require large amounts of
energy. Their increasing popularity, therefore, raises critical concerns regarding the energy …

被引用次数：1 相关文章

[PDF] acm.org

An agile pathway towards carbon-aware clouds

P Patel, T Gregersen, T Anderson - … of the 2nd Workshop on Sustainable …, 2023 - dl.acm.org

Climate change is a pressing threat to planetary well-being that can be addressed only by
rapid near-term actions across all sectors. Yet, the cloud computing sector, with its …

Hybrid Heterogeneous Clusters Can Lower the Energy Consumption of LLM Inference Workloads

R Mortier, S Keshav, G Wilkins - 2024 - repository.cam.ac.uk

Both the training and use of Large Language Models (LLMs) require large amounts of
energy. Their increasing popularity, therefore, raises critical concerns regarding the energy …

高级搜索

QQ 群

Hybrid Computing for Interactive Datacenter Applications

[PDF][PDF] Splitwise: Efficient generative llm inference using phase splitting

Hybrid Heterogeneous Clusters Can Lower the Energy Consumption of LLM Inference Workloads

An agile pathway towards carbon-aware clouds

Hybrid Heterogeneous Clusters Can Lower the Energy Consumption of LLM Inference Workloads

引用