Flattened butterfly: a cost-efficient topology for high-radix networks

J Kim, WJ Dally, D Abts - Proceedings of the 34th annual international …, 2007 - dl.acm.org
Increasing integrated-circuit pin bandwidth has motivateda corresponding increase in the
degree or radix of interconnection networksand their routers. This paper introduces the …

On application-level approaches to avoiding TCP throughput collapse in cluster-based storage systems

E Krevat, V Vasudevan, A Phanishayee… - Proceedings of the 2nd …, 2007 - dl.acm.org
TCP Incast plagues scalable cluster-based storage built atop standard TCP/IP-over-
Ethernet, often resulting in much lower client read bandwidth than can be provided by the …

Job scheduling and data replication on data grids

RS Chang, JS Chang, SY Lin - Future Generation Computer Systems, 2007 - Elsevier
In data grids, many distributed scientific and engineering applications often require access
to a large amount of data (terabytes or petabytes). Data access time depends on bandwidth …

Testing network-on-chip communication fabrics

C Grecu, A Ivanov, R Saleh… - IEEE Transactions on …, 2007 - ieeexplore.ieee.org
Network-on-chip (NoC) communication fabrics will be increasingly used in many large
multicore system-on-chip designs in the near future. A relevant challenge that arises from …

Tightly-coupled multi-layer topologies for 3-D NoCs

H Matsutani, M Koibuchi… - … Conference on Parallel …, 2007 - ieeexplore.ieee.org
Three-dimensional network-on-chip (3-D NoC) is an emerging research topic exploring the
network architecture of 3-D ICs that stack several smaller wafers for reducing wire length …

Node-disjoint paths in hierarchical hypercube networks

RY Wu, GH Chen, YL Kuo, GJ Chang - Information Sciences, 2007 - Elsevier
The hierarchical hypercube network is suitable for massively parallel systems. One of its
appealing properties is the low number of connections per processor, which can facilitate …

[图书][B] Handbook of parallel computing: models, algorithms and applications

S Rajasekaran, J Reif - 2007 - books.google.com
The ability of parallel computing to process large data sets and handle time-consuming
operations has resulted in unprecedented advances in biological and scientific computing …

Performance, cost, and energy evaluation of fat h-tree: A cost-efficient tree-based on-chip network

H Matsutani, M Koibuchi… - 2007 IEEE International …, 2007 - ieeexplore.ieee.org
Fat H-Tree is a novel tree-based interconnection network providing a torus structure, which
is formed by combining two folded H-Tree networks, and is an attractive alternative to tree …

Hardware supported multicast in fat-tree-based InfiniBand networks

J Zhou, XY Lin, YC Chung - The Journal of Supercomputing, 2007 - Springer
The multicast operation is a very commonly used operation in parallel applications. It can be
used to implement many collective communication operations as well. Therefore, its …

Investigating solution convergence in a global ocean model using a 2048‐processor cluster of distributed shared memory machines

C Hill, D Menemenlis, B Ciotti… - Scientific …, 2007 - Wiley Online Library
Up to 1920 processors of a cluster of distributed shared memory machines at the NASA
Ames Research Center are being used to simulate ocean circulation globally at horizontal …