MAP: A visual analytics system for job monitoring and analysis

A Pal, P Malakar - 2020 IEEE International Conference on …, 2020 - ieeexplore.ieee.org
A Pal, P Malakar
2020 IEEE International Conference on Cluster Computing (CLUSTER), 2020ieeexplore.ieee.org
High-performance computing systems are used for compute-intensive jobs by multiple
users. They submit jobs to batch queues where the jobs are queued for an unknown amount
of time until the required resources are available. A large amount of data is collected by the
resource managers regarding the jobs (submit time, start time, end time, resource
requirements, etc.). Analyzing this data may help identify causes of problems that may have
occurred in the past and better optimize the system. Analyzing complex and huge logs may …
High-performance computing systems are used for compute-intensive jobs by multiple users. They submit jobs to batch queues where the jobs are queued for an unknown amount of time until the required resources are available. A large amount of data is collected by the resource managers regarding the jobs (submit time, start time, end time, resource requirements, etc.). Analyzing this data may help identify causes of problems that may have occurred in the past and better optimize the system. Analyzing complex and huge logs may be cumbersome. We have developed a unified job monitoring, analysis, and prediction system using which users can monitor current state, analyze past job logs, and predict wait-times of future jobs. In this paper, we have focused on the job monitoring and analysis modules.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果