Recent LLMs have demonstrated remarkable performance in solving exam-like math word problems. However, the degree to which these numerical reasoning skills are effective in …
To facilitate conversational question answering (CQA) over hybrid contexts in finance, we present a new dataset, named PACIFIC. Compared with existing CQA datasets, PACIFIC …
We introduce KnowledgeFMath, a novel benchmark designed to evaluate LLMs' capabilities in solving knowledge-intensive math reasoning problems. Compared to prior works, this …
F Lei, S He, X Li, J Zhao, K Liu - arXiv preprint arXiv:2209.07692, 2022 - arxiv.org
In the real-world question answering scenarios, hybrid form combining both tabular and textual contents has attracted more and more attention, among which numerical reasoning …
Document Visual Question Answering (VQA) aims to answer questions over visually-rich documents. In this work, we introduce a new Document VQA dataset, named TAT-DQA …
Y Wei, Y Su, H Ma, X Yu, F Lei, Y Zhang, J Zhao… - arXiv preprint arXiv …, 2023 - arxiv.org
Large language models (LLMs) have shown nearly saturated performance on many natural language processing (NLP) tasks. As a result, it is natural for people to believe that LLMs …
Reasoning over tabular data requires both table structure understanding and a broad set of table reasoning skills. Current models with table-specific architectures and pre-training …
Recent LLMs have demonstrated remarkable performance in solving exam-like math word problems. However, the degree to which these numerical reasoning skills are effective in …
In this work, we address question answering (QA) over a hybrid of tabular and textual data that are very common content on the Web (eg SEC filings), where discrete reasoning …