RC Steorts - arXiv preprint arXiv:1409.0643, 2014 - scholar.archive.org
Databases often contain corrupted, degraded, and noisy data with duplicate entries across
and within each database. Such problems arise in citations, medical databases, genetics …