SQL database with information from unstructured data sources including emails, webpages,
and pdf reports. KBC is a long-standing problem in industry and research that encompasses
problems of data extraction, cleaning, and integration. We describe DeepDive, a system that
combines database and machine learning ideas to help develop KBC systems. The key idea
in DeepDive is that statistical inference and machine learning are key tools to attack …