发明者
Ameya Prabhu, Charles Dognin, Maneesh Kumar Singh
发表日期
2021/1/7
专利局
US
专利申请号
16919898
简介
Machine learning systems and methods for evaluating sam pling bias in deep active classification are provided. The system generates an acquisition function based on an uncer tainty based query strategy. The system utilizes the Least Confidence and the Entropy uncertainty based query strat egies. The system acquires at least one data sample from the input data based on the acquisition function. The input data can include, but is not limited to, large datasets widely utilized for text classification. The system labels the data sample via an oracle and generates a training dataset with the labeled data sample. The system generates a sequence of training datasets by sampling b queries from the input data, each of size K. The system evaluates an efficiency and bias of sample datasets obtained by different query strategies. The system also trains a network with the generated training dataset (s).
引用总数