作者
Thomas Weikert, Ivan Nesic, Joshy Cyriac, Jens Bremerich, Alexander W Sauter, Gregor Sommer, Bram Stieltjes
发表日期
2020/4/1
期刊
European journal of radiology
卷号
125
页码范围
108862
出版商
Elsevier
简介
Purpose
To design and evaluate a self-trainable natural language processing (NLP)-based procedure to classify unstructured radiology reports. The method enabling the generation of curated datasets is exemplified on CT pulmonary angiogram (CTPA) reports.
Method
We extracted the impressions of CTPA reports created at our institution from 2016 to 2018 (n = 4397; language: German). The status (pulmonary embolism: yes/no) was manually labelled for all exams. Data from 2016/2017 (n = 2801) served as a ground truth to train three NLP architectures that only require a subset of reference datasets for training to be operative. The three architectures were as follows: a convolutional neural network (CNN), a support vector machine (SVM) and a random forest (RF) classifier. Impressions of 2018 (n = 1377) were kept aside and used for general performance measurements. Furthermore, we investigated the …
引用总数
2020202120222023202423623