In this paper, we explore the question of whether large language models can support cost- efficient information extraction from tables. We introduce schema-driven information …
Z Zhang, B Yu, T Liu, T Liu, Y Wang, L Guo - Proceedings of the acm web …, 2023 - dl.acm.org
Extracting structured information from all manner of webpages is an important problem with the potential to automate many real-world applications. Recent work has shown the …
X Yao, Z Zhang, X Hu, J Yang, Y Guo… - Proceedings of the 17th …, 2024 - dl.acm.org
Ad hoc table retrieval refers to the task of performing semantic matching between given queries and candidate tables. In recent years, the approach to addressing this retrieval task …
We present OpenTriage, a system for extracting structured entities from detail Web pages of several sites and finding linkages between the extracted data. The system builds an …
MC Nunes, CF Dorneles - Proceedings of the 29th Brazilian Symposium …, 2023 - dl.acm.org
Extracting data from Web sites is still a challenge since pages have a complex and changeable structure, and the reason is simple: Web pages are designed to be visually user …
As with all the Ph. D. stories, my Ph. D. journey has its ups and downs, and will be a forever cherished memory for the rest of my life. First and foremost, I would like to give special …
Structured information extraction from the Web plays an important role in many large-scale automated systems, required to enable varying downstream applications. In this thesis, we …