minimizing the manual labelling effort while achieving good-quality predictions. To
accomplish this, we devise a two-stage technique that employs hierarchical clustering using
a combination of graph clustering (community finding) and topic modelling as first stage,
followed by either another round of hierarchical clustering or an active learning approach as
second stage. We evaluate the performance of our method in terms of manual labelling …