作者
Danielle S Bitterman, Eli Goldner, Sean Finan, David Harris, Eric B Durbin, Harry Hochheiser, Jeremy L Warner, Raymond H Mak, Timothy Miller, Guergana K Savova
发表日期
2023/9/1
期刊
International Journal of Radiation Oncology* Biology* Physics
卷号
117
期号
1
页码范围
262-273
出版商
Elsevier
简介
Purpose
Real-world evidence for radiation therapy (RT) is limited because it is often documented only in the clinical narrative. We developed a natural language processing system for automated extraction of detailed RT events from text to support clinical phenotyping.
Methods and Materials
A multi-institutional data set of 96 clinician notes, 129 North American Association of Central Cancer Registries cancer abstracts, and 270 RT prescriptions from HemOnc.org was used and divided into train, development, and test sets. Documents were annotated for RT events and associated properties: dose, fraction frequency, fraction number, date, treatment site, and boost. Named entity recognition models for properties were developed by fine-tuning BioClinicalBERT and RoBERTa transformer models. A multiclass RoBERTa-based relation extraction model was developed to link each dose mention with each property in the …
引用总数
学术搜索中的文章
DS Bitterman, E Goldner, S Finan, D Harris, EB Durbin… - International Journal of Radiation Oncology* Biology …, 2023