发明者
Vinaya Sathyanarayana, Peeta Basa Pati, Salaka Sivananda, TR Rajarajan
发表日期
2014/3/18
专利局
US
专利号
8676731
专利申请号
13180068
简介
(57) ABSTRACT A data extraction system for receiving and Scanning docu ments to generate ordered input for storage in a database employs a non-linear statistical model for a data extraction sequence having a plurality of transformations. Each trans formation transitions an extracted data value in various forms from a raw data image to a computed data value. For each transformation, a confidence model learns a confidence com ponent for the particular transformation. The learned confi dence components, generated from a control set of documents having known values, are employed in a production mode with actual raw data. The confidence component corresponds to a likelihood of transformation accuracy, and the confidence model aggregates the confidence components to compute a confidence for the extracted data value. A database stores the extracted data value labeled with the computed confidence …
引用总数
201420152016201720182019202020212022202320241122134353
学术搜索中的文章
V Sathyanarayana, PB Pati, S Sivananda… - US Patent 8,676,731, 2014