observing and imitating experts' demonstrations. Most existing AL approaches, however, are
not designed to cope with the evolving reward functions commonly found in human-centric
tasks such as healthcare, where offline learning is required. In this paper, we propose an
offline Time-aware Hierarchical EM Energy-based Sub-trajectory (THEMES) AL framework
to tackle the evolving reward functions in such tasks. The effectiveness of THEMES is …