Reducing negative effects of the biases of language models in zero-shot setting

X Wang, Y Xiong, B Kang, Y Zhang, PS Yu… - Proceedings of the …, 2023 - dl.acm.org
Pre-trained language models (PLMs) such as GPTs have been revealed to be biased
towards certain target classes because of the prompt and the model's intrinsic biases. In …

Schema-driven information extraction from heterogeneous tables

F Bai, J Kang, G Stanovsky, D Freitag, M Dredze… - arXiv preprint arXiv …, 2023 - arxiv.org
In this paper, we explore the question of whether large language models can support cost-
efficient information extraction from tables. We introduce schema-driven information …

Learning structural co-occurrences for structured web data extraction in low-resource settings

Z Zhang, B Yu, T Liu, T Liu, Y Wang, L Guo - Proceedings of the acm web …, 2023 - dl.acm.org
Extracting structured information from all manner of webpages is an important problem with
the potential to automate many real-world applications. Recent work has shown the …

COTER: Conditional Optimal Transport meets Table Retrieval

X Yao, Z Zhang, X Hu, J Yang, Y Guo… - Proceedings of the 17th …, 2024 - dl.acm.org
Ad hoc table retrieval refers to the task of performing semantic matching between given
queries and candidate tables. In recent years, the approach to addressing this retrieval task …

[PDF][PDF] OpenTRIAGE: Entity Linkage for Detail Webpages.

R Voyat, V Crescenzi, P Merialdo - SEBD, 2022 - ceur-ws.org
We present OpenTriage, a system for extracting structured entities from detail Web pages of
several sites and finding linkages between the extracted data. The system builds an …

EDREW-Enhanced Data Representation for Extraction in Web

MC Nunes, CF Dorneles - Proceedings of the 29th Brazilian Symposium …, 2023 - dl.acm.org
Extracting data from Web sites is still a challenge since pages have a complex and
changeable structure, and the reason is simple: Web pages are designed to be visually user …

[PDF][PDF] Information Extraction on Scientific Literature under Limited Supervision

F Bai - 2023 - bflashcp3f.github.io
As with all the Ph. D. stories, my Ph. D. journey has its ups and downs, and will be a forever
cherished memory for the rest of my life. First and foremost, I would like to give special …

[PDF][PDF] Extracting Structured Web Content using Deep Neural Language Models

G Hendriksen, A de Vries, J Dalton, F Hasibi - 2022 - cs.ru.nl
Structured information extraction from the Web plays an important role in many large-scale
automated systems, required to enable varying downstream applications. In this thesis, we …

[引用][C] NAEWI-Non-rendering Approach to Extract Web Information

MC Nunes, CF Dorneles - Anais Estendidos do XXXVII Simpósio Brasileiro de …, 2022 - SBC