作者
Rongye Shi, Peter Steenkiste, Manuela Veloso
发表日期
2017/12/5
研讨会论文
Proceedings of the Fourth IEEE/ACM International Conference on Big Data Computing, Applications and Technologies
页码范围
255-264
出版商
ACM
简介
Automated data collection in urban transportation systems produces a large volume of passenger data. However, quite a few of the data are still incomplete, limiting the insight into passenger mobility. The unavailability of destination information in entry-only passenger data is a very common issue. Traditional approaches for estimating passenger destinations rely on heuristics that can recover only some of the missing destinations. To deal with the remaining incomplete data, this paper, for the first time, proposes a second-order inference methodology to leverage semi-supervised self-training to infer the missing destinations. The methodology involves the design of a base learner to predict the missing destinations based on the statistics of a selected similarity-based "training set", and the design of a selection strategy to select new data with high prediction confidence to update the training set. To further improve the …
引用总数
201820192020202120222023202413111
学术搜索中的文章