[PDF][PDF] A reinforcement learning-informed pattern mining framework for multivariate time series classification

G Gao, Q Gao, X Yang, M Pajic, M Chi - 31st International Joint …, 2022 - par.nsf.gov
Multivariate time series (MTS) classification is a challenging and important task in various
domains and real-world applications. Much of prior work on MTS can be roughly divided into …

Get a head start: On-demand pedagogical policy selection in intelligent tutoring

G Gao, X Yang, M Chi - Proceedings of the AAAI Conference on …, 2024 - ojs.aaai.org
Reinforcement learning (RL) is broadly employed in human-involved systems to enhance
human outcomes. Off-policy evaluation (OPE) has been pivotal for RL in those realms since …

Hope: Human-centric off-policy evaluation for e-learning and healthcare

G Gao, S Ju, MS Ausin, M Chi - arXiv preprint arXiv:2302.09212, 2023 - arxiv.org
Reinforcement learning (RL) has been extensively researched for enhancing human-
environment interactions in various human-centric tasks, including e-learning and …

On trajectory augmentations for off-policy evaluation

G Gao, Q Gao, X Yang, S Ju, M Pajic… - The Twelfth International …, 2024 - openreview.net
In the realm of reinforcement learning (RL), off-policy evaluation (OPE) holds a pivotal
position, especially in high-stake human-involved scenarios such as e-learning and …

[PDF][PDF] Hierarchical Apprenticeship Learning for Disease Progression Modeling.

X Yang, G Gao, M Chi - IJCAI, 2023 - ijcai.org
Disease progression modeling (DPM) plays an essential role in characterizing patients'
historical pathways and predicting their future risks. Apprenticeship learning (AL) aims to …

An offline time-aware apprenticeship learning framework for evolving reward functions

X Yang, G Gao, M Chi - arXiv preprint arXiv:2305.09070, 2023 - arxiv.org
Apprenticeship learning (AL) is a process of inducing effective decision-making policies via
observing and imitating experts' demonstrations. Most existing AL approaches, however, are …

TERTIAN: Clinical Endpoint Prediction in ICU via Time‐Aware Transformer‐Based Hierarchical Attention Network

Y An, Y Liu, X Chen, Y Sheng - Computational Intelligence and …, 2022 - Wiley Online Library
Accurately predicting the clinical endpoint in ICU based on the patient's electronic medical
records (EMRs) is essential for the timely treatment of critically ill patients and allocation of …

Optimizing IT FinOps and Sustainability through Unsupervised Workload Characterization

X Yang, RR Arora, S Jha, C Narayanaswami… - Proceedings of the …, 2024 - ojs.aaai.org
The widespread adoption of public and hybrid clouds, along with elastic resources and
various automation tools for dynamic deployment, has accelerated the rapid provisioning of …

Multi-temporal abstraction with time-aware deep q-learning for septic shock prevention

YJ Kim, MS Ausin, M Chi - … Conference on Big Data (Big Data), 2021 - ieeexplore.ieee.org
Sepsis is a life-threatening organ dysfunction and a disease of astronomical burden. Septic
shock, the most severe complication of sepsis, leads to a mortality rate as high as 50 …

Focusing on Driving Modes Rather Than Drivers: Toward More Precise and Efficient Car-Following Behavior Modeling

D Zhang, H Rao, J Wang, J Sun, L Yue - Applied Sciences, 2023 - mdpi.com
Car-following (CF) behavior is one of the most important driving behaviors. Accurately
understanding and modeling CF behavior is essential for traffic flow simulation and user …