time series of aggregate communication volume A i, j (t)> 0 between cells i and j, for all pairs
of cells in a city over a month. We develop a methodology for predicting the future (in
particular whether two cells will talk to each other A i, j (t)> 0) based on past activity. Our data
set is sparse, with 80% of the values being zero, which makes prediction challenging. We
formulate the problem as binary classification and, using decision trees and random forests …