Why globally re-shuffle? Revisiting data shuffling in large scale deep learning TT Nguyen, F Trahay, J Domke, A Drozd, E Vatai, J Liao, M Wahib, ... 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022 | 30 | 2022 |
Scaling distributed deep learning workloads beyond the memory capacity with KARMA M Wahib, H Zhang, TT Nguyen, A Drozd, J Domke, L Zhang, R Takano, ... SC20: International Conference for High Performance Computing, Networking …, 2020 | 22 | 2020 |
Hierarchical distributed-memory multi-leader mpi-allreduce for deep learning workloads TT Nguyen, M Wahib, R Takano 2018 Sixth International Symposium on Computing and Networking Workshops …, 2018 | 19 | 2018 |
Efficient MPI‐AllReduce for large‐scale deep learning on GPU‐clusters T Thao Nguyen, M Wahib, R Takano Concurrency and Computation: Practice and Experience 33 (12), e5574, 2021 | 16 | 2021 |
Feddrl: Deep reinforcement learning-based adaptive aggregation for non-iid data in federated learning NH Nguyen, PL Nguyen, TD Nguyen, TT Nguyen, DL Nguyen, ... Proceedings of the 51st International Conference on Parallel Processing, 1-11, 2022 | 13 | 2022 |
An oracle for guiding large-scale model/hybrid parallel training of convolutional neural networks AN Kahira, TT Nguyen, LB Gomez, R Takano, RM Badia, M Wahib Proceedings of the 30th International Symposium on High-Performance Parallel …, 2021 | 11 | 2021 |
Spatial-temporal coverage maximization in vehicle-based mobile crowdsensing for air quality monitoring TAN Dinh, AD Nguyen, TT Nguyen, TH Nguyen, P Le Nguyen 2022 IEEE Wireless Communications and Networking Conference (WCNC), 1449-1454, 2022 | 10 | 2022 |
Distributed shortcut networks: Low-latency low-degree non-random topologies targeting the diameter and cable length trade-off NT Truong, I Fujiwara, M Koibuchi, KV Nguyen IEEE Transactions on Parallel and Distributed Systems 28 (4), 989-1001, 2016 | 9 | 2016 |
Hybrid electrical/optical switch architectures for training distributed deep learning in large-scale TN Truong, R Takano IEICE TRANSACTIONS on Information and Systems 104 (8), 1332-1339, 2021 | 8 | 2021 |
An Allreduce Algorithm and Network Co-design for Large-Scale Training of Distributed Deep Learning TT Nguyen, M Wahib 2021 IEEE/ACM 21st International Symposium on Cluster, Cloud and Internet …, 2021 | 8 | 2021 |
On the feasibility of hybrid electrical/optical switch architecture for large-scale training of distributed deep learning TT Nguyen, R Takano 2019 IEEE/ACM Workshop on Photonics-Optics Technology Oriented Networking …, 2019 | 8 | 2019 |
Topology-aware sparse allreduce for large-scale deep learning TT Nguyen, M Wahib, R Takano 2019 IEEE 38th International Performance Computing and Communications …, 2019 | 8 | 2019 |
Low-reliable low-latency networks optimized for HPC parallel applications TT Nguyen, H Matsutani, M Koibuchi 2018 IEEE 17th International Symposium on Network Computing and Applications …, 2018 | 8 | 2018 |
Fuzzy q-learning-based opportunistic communication for mec-enhanced vehicular crowdsensing TT Nguyen, TT Nguyen, TH Nguyen, P Le Nguyen IEEE Transactions on Network and Service Management 19 (4), 5021-5033, 2022 | 6 | 2022 |
Deep reinforcement learning-based offloading for latency minimization in 3-tier v2x networks H Dinh, NH Nguyen, TT Nguyen, TH Nguyen, TT Nguyen, P Le Nguyen 2022 IEEE Wireless Communications and Networking Conference (WCNC), 1803-1808, 2022 | 6 | 2022 |
An interconnection network exploiting trade-off between routing table size and path length TC Kieu, KV Nguyen, NT Truong, I Fujiwara, M Koibuchi 2016 Fourth International Symposium on Computing and Networking (CANDAR …, 2016 | 6 | 2016 |
FedDCT: Federated Learning of Large Convolutional Neural Networks on Resource Constrained Devices using Divide and Collaborative Training Q Nguyen, HH Pham, KS Wong, P Le Nguyen, TT Nguyen, MN Do IEEE Transactions on Network and Service Management, 2023 | 3 | 2023 |
Simeuro: A hybrid CPU-GPU parallel simulator for neuromorphic computing chips H Zhang, NM Ho, DY Polat, P Chen, M Wahib, TT Nguyen, J Meng, ... IEEE Transactions on Parallel and Distributed Systems 34 (10), 2767-2782, 2023 | 3 | 2023 |
Cadis: Handling cluster-skewed non-iid data in federated learning with clustered aggregation and knowledge distilled regularization NH Nguyen, DL Nguyen, TB Nguyen, TH Nguyen, HH Pham, TT Nguyen, ... 2023 IEEE/ACM 23rd International Symposium on Cluster, Cloud and Internet …, 2023 | 3 | 2023 |
Q-learning-based opportunistic communication for real-time mobile air quality monitoring systems TT Nguyen, TT Nguyen, TAN Dinh, TH Nguyen, P Le Nguyen 2021 IEEE International Performance, Computing, and Communications …, 2021 | 3 | 2021 |