overlap in the papers they cite. We introduce a new clustering algorithm, Streemer, which
finds cohesive foreground clusters embedded in a diffuse background, and use it to identify
knowledge communities as foreground clusters of papers which share common citations. To
analyze the evolution of these communities over time, we build predictive models with
features based on the citation structure, the vocabulary of the papers, and the affiliations and …