查看文章

aaai.org 中的 [PDF]

Maximizing the Probability of Arriving on Time: A Practical Q-Learning Method

作者

Zhiguang Cao, Hongliang Guo, Jie Zhang, Frans Oliehoek, Ulrich Fastenrath

发表日期

2017

研讨会论文

31th AAAI Conference on Artificial Intelligence (AAAI)

页码范围

4481-4487

简介

The stochastic shortest path problem is of crucial importance for the development of sustainable transportation systems. Existing methods based on the probability tail model seek for the path that maximizes the probability of arriving at the destination before a deadline. However, they suffer from low accuracy and/or high computational cost. We design a novel Q-learning method where the converged Q-values have the practical meaning as the actual probabilities of arriving on time so as to improve accuracy. By further adopting dynamic neural networks to learn the value function, our method can scale well to large road networks with arbitrary deadlines. Experimental results on real road networks demonstrate the significant advantages of our method over other counterparts.

引用总数

被引用次数：57

201720182019202020212022202320248 3 12 11 9 7 4 1

学术搜索中的文章

Maximizing the probability of arriving on time: A practical q-learning method

Z Cao, H Guo, J Zhang, F Oliehoek, U Fastenrath - Proceedings of the AAAI conference on artificial …, 2017

被引用次数：57 相关文章所有 12 个版本