Topologies of reasoning: Demystifying chains, trees, and graphs of thoughts

M Besta, F Memedi, Z Zhang, R Gerstenberger… - arXiv preprint arXiv …, 2024 - arxiv.org
The field of natural language processing (NLP) has witnessed significant progress in recent
years, with a notable focus on improving large language models'(LLM) performance through …

A Heterogeneous Chiplet Architecture for Accelerating End-to-End Transformer Models

H Sharma, P Dhingra, JR Doppa, U Ogras… - arXiv preprint arXiv …, 2023 - arxiv.org
Transformers have revolutionized deep learning and generative modeling, enabling
unprecedented advancements in natural language processing tasks. However, the size of …

The Survey of Chiplet-based Integrated Architecture: An EDA perspective

S Chen, H Zhang, Z Ling, J Zhai, B Yu - arXiv preprint arXiv:2411.04410, 2024 - arxiv.org
Enhancing performance while reducing costs is the fundamental design philosophy of
integrated circuits (ICs). With advancements in packaging technology, interposer-based …

A high scalability memory noc with shared-inside hierarchical-groupings for triplet-based many-core architecture

C Li, F Shi, F Yin, K Soliman, J Wei - ACM Transactions on Architecture …, 2024 - dl.acm.org
Innovative processor architecture designs are shifting towards Many-Core Architectures
(MCAs) to meet the future demands of high-performance computing as the limits of Moore's …

NxtSPR: A deadlock-free shortest path routing dedicated to relaying for Triplet-Based many-core Architecture

C Li, K Soliman, F Yin, J Wei, F Shi - Parallel Computing, 2024 - Elsevier
Deadlock-free routing is a significant challenge in Network-on-Chip (NoC) design as it
affects the network's latency, power consumption, and load balance, impacting the …

Low-Depth Spatial Tree Algorithms

Y Baumann, T Ben-Nun, M Besta, L Gianinazzi… - arXiv preprint arXiv …, 2024 - arxiv.org
Contemporary accelerator designs exhibit a high degree of spatial localization, wherein two-
dimensional physical distance determines communication costs between processing …

Switch-Less Dragonfly on Wafers: A Scalable Interconnection Architecture based on Wafer-Scale Integration

Y Feng, K Ma - SC24: International Conference for High …, 2024 - ieeexplore.ieee.org
Existing high-performance computing (HPC) interconnection architectures are based on
high-radix switches, which limits the injection/local performance and introduces …

RapidChiplet: A Toolchain for Rapid Design Space Exploration of Chiplet Architectures

P Iff, B Bruggmann, M Besta, L Benini… - arXiv preprint arXiv …, 2023 - arxiv.org
Chiplet architectures are a promising paradigm to overcome the scaling challenges of
monolithic chips. Chiplets offer heterogeneity, modularity, and cost-effectiveness. The …

[PDF][PDF] Demystifying Chains, Trees, and Graphs of Thoughts

M Besta, F Memedi, Z Zhang… - arXiv preprint arXiv …, 2024 - aegjcef.unixer.de
The field of natural language processing (NLP) has witnessed significant progress in recent
years, with a notable focus on improving large language models'(LLM) performance through …

Ring Road: A Scalable Polar-Coordinate-based 2D Network-on-Chip Architecture

Y Feng, W Li, K Ma - 2024 57th IEEE/ACM International …, 2024 - ieeexplore.ieee.org
Networks-on-chip (NoCs) are scaled out to build large-scale multi-chip networks to meet the
growing demand for computing. However, the traditional router-based NoC architecture has …