Ceil: Generalized contextual imitation learning

J Liu, L He, Y Kang, Z Zhuang… - Advances in Neural …, 2023 - proceedings.neurips.cc
In this paper, we present ContExtual Imitation Learning (CEIL), a general and broadly
applicable algorithm for imitation learning (IL). Inspired by the formulation of hindsight …

Design from policies: Conservative test-time adaptation for offline policy optimization

J Liu, H Zhang, Z Zhuang, Y Kang… - Advances in Neural …, 2024 - proceedings.neurips.cc
In this work, we decouple the iterative bi-level offline RL (value estimation and policy
extraction) from the offline training phase, forming a non-iterative bi-level paradigm and …

Macro placement by wire-mask-guided black-box optimization

Y Shi, K Xue, S Lei, C Qian - Advances in Neural …, 2024 - proceedings.neurips.cc
The development of very large-scale integration (VLSI) technology has posed new
challenges for electronic design automation (EDA) techniques in chip floorplanning. During …

Beyond ood state actions: Supported cross-domain offline reinforcement learning

J Liu, Z Zhang, Z Wei, Z Zhuang, Y Kang… - Proceedings of the …, 2024 - ojs.aaai.org
Offline reinforcement learning (RL) aims to learn a policy using only pre-collected and fixed
data. Although avoiding the time-consuming online interactions in RL, it poses challenges …

Clue: Calibrated latent guidance for offline reinforcement learning

J Liu, L Zu, L He, D Wang - Conference on Robot Learning, 2023 - proceedings.mlr.press
Offline reinforcement learning (RL) aims to learn an optimal policy from pre-collected and
labeled datasets, which eliminates the time-consuming data collection in online RL …

Offline imitation learning with variational counterfactual reasoning

Z Sun, B He, J Liu, X Chen, C Ma… - Advances in Neural …, 2023 - proceedings.neurips.cc
In offline imitation learning (IL), an agent aims to learn an optimal expert behavior policy
without additional online environment interactions. However, in many real-world scenarios …

Reinforcement Learning within Tree Search for Fast Macro Placement

Z Geng, J Wang, Z Liu, S Xu, Z Tang… - … on Machine Learning, 2024 - openreview.net
Macro placement is a crucial step in modern chip design, and reinforcement learning (RL)
has recently emerged as a promising technique for improving the placement quality …

Addendum: A graph placement methodology for fast chip design

A Goldie, A Mirhoseini, M Yazgan, JW Jiang… - Nature, 2024 - nature.com
AlphaChip was one of the first RL methods deployed to solve a real-world engineering
problem, and its publication triggered an explosion of work on AI for chip design 2–16 …

Benchmarking end-to-end performance of ai-based chip placement algorithms

Z Wang, Z Geng, Z Tu, J Wang, Y Qian, Z Xu… - arXiv preprint arXiv …, 2024 - arxiv.org
The increasing complexity of modern very-large-scale integration (VLSI) design highlights
the significance of Electronic Design Automation (EDA) technologies. Chip placement is a …

Hardware phi-1.5 b: A large language model encodes hardware domain specific knowledge

W Fu, S Li, Y Zhao, H Ma, R Dutta… - 2024 29th Asia and …, 2024 - ieeexplore.ieee.org
In the rapidly evolving semiconductor industry, where research, design, verification, and
manufacturing are intricately linked, the potential of Large Language Models to revolutionize …