The memory-bounded speedup model and its impacts in computing

XH Sun, X Lu - Journal of Computer Science and Technology, 2023 - Springer
With the surge of big data applications and the worsening of the memory-wall problem, the
memory system, instead of the computing unit, becomes the commonly recognized major …

Dlio: A data-centric benchmark for scientific deep learning applications

H Devarajan, H Zheng, A Kougkas… - 2021 IEEE/ACM 21st …, 2021 - ieeexplore.ieee.org
Deep learning has been shown as a successful method for various tasks, and its popularity
results in numerous open-source deep learning software tools. Deep learning has been …

Characterizing machine learning i/o workloads on leadership scale hpc systems

AK Paul, AM Karimi, F Wang - 2021 29th International …, 2021 - ieeexplore.ieee.org
High performance computing (HPC) is no longer solely limited to traditional workloads such
as simulation and modeling. With the increase in the popularity of machine learning (ML) …

Study on tiered storage algorithm based on heat correlation of astronomical data

XC Ye, HL Zhang, J Wang, YZ Zhang, X Du… - Frontiers in Astronomy …, 2024 - frontiersin.org
With the surge in astronomical data volume, modern astronomical research faces significant
challenges in data storage, processing, and access. The I/O bottleneck issue in astronomical …

Data Flow Lifecycles for Optimizing Workflow Coordination

H Lee, L Guo, M Tang, J Firoz, N Tallent… - Proceedings of the …, 2023 - dl.acm.org
A critical performance challenge in distributed scientific workflows is coordinating tasks and
data flows on distributed resources. To guide these decisions, this paper introduces data …

HRFP: Highly Relevant Frequent Patterns-Based Prefetching and Caching Algorithms for Distributed File Systems

A Nalajala, T Ragunathan, R Naha, SK Battula - Electronics, 2023 - mdpi.com
Data-intensive applications are generating massive amounts of data which is stored on
cloud computing platforms where distributed file systems are utilized for storage at the back …

I/O performance analysis of machine learning workloads on leadership scale supercomputer

AM Karimi, AK Paul, F Wang - Performance Evaluation, 2022 - Elsevier
The popularity of machine learning technologies and frameworks has led to an increasingly
large number of machine learning workloads running on high-performance computing …

Application and user-specific data prefetching and parallel read algorithms for distributed file systems

A Nalajala, T Ragunathan, R Naha, SK Battula - Cluster Computing, 2024 - Springer
Cloud computing systems are widely used to deploy big data-based applications because of
their high storage and computation capacity. The key component for storage in cloud …

Efficient Prefetching and Client-Side Caching Algorithms for Improving the Performance of Read Operations in Distributed File Systems

A Nalajala, T Ragunathan, R Naha - IEEE Access, 2022 - ieeexplore.ieee.org
Modern web applications are deployed in cloud computing systems because they support
unlimited storage and computing power. One of the main back-end storage components of …

Distributed file systembased optimization algorithm

UL Soundharya, G Vadivu, GK Chaitanya - Wireless Networks, 2024 - Springer
Database engines and file systems have been using prefetching and caching technologies
for decades to enhance the performance of I/O-intensive applications. When future data …