- 学术资源搜索

Table pre-training: A survey on model architectures, pre-training objectives, and downstream tasks

H Dong, Z Cheng, X He, M Zhou, A Zhou… - arXiv preprint arXiv …, 2022 - arxiv.org

Since a vast number of tables can be easily collected from web pages, spreadsheets, PDFs,
and various other document types, a flurry of table pre-training frameworks have been …

被引用次数：62 相关文章所有 4 个版本

[PDF] arxiv.org

MultiHiertt: Numerical reasoning over multi hierarchical tabular and textual data

Y Zhao, Y Li, C Li, R Zhang - arXiv preprint arXiv:2206.01347, 2022 - arxiv.org

Numerical reasoning over hybrid data containing both textual and tabular content (eg,
financial reports) has recently attracted much attention in the NLP community. However …

被引用次数：82 相关文章所有 5 个版本

[PDF] arxiv.org

A survey on llm-gernerated text detection: Necessity, methods, and future directions

J Wu, S Yang, R Zhan, Y Yuan, DF Wong… - arXiv preprint arXiv …, 2023 - arxiv.org

The powerful ability to understand, follow, and generate complex language emerging from
large language models (LLMs) makes LLM-generated text flood many areas of our daily …

被引用次数：88 相关文章所有 2 个版本

Large language models on tabular data--a survey

X Fang, W Xu, F Anting Tan, J Zhang, Z Hu… - arXiv e …, 2024 - ui.adsabs.harvard.edu

Recent breakthroughs in large language modeling have facilitated rigorous exploration of
their application in diverse tasks related to tabular data modeling, such as prediction, tabular …

被引用次数：14 相关文章

[PDF] arxiv.org

Hitab: A hierarchical table dataset for question answering and natural language generation

Z Cheng, H Dong, Z Wang, R Jia, J Guo, Y Gao… - arXiv preprint arXiv …, 2021 - arxiv.org

Tables are often created with hierarchies, but existing works on table reasoning mainly focus
on flat tables and neglect hierarchical tables. Hierarchical tables challenge existing methods …

被引用次数：76 相关文章所有 5 个版本

[PDF] jmlr.org

Towards explainable evaluation metrics for machine translation

C Leiter, P Lertvittayakumjorn, M Fomicheva… - Journal of Machine …, 2024 - jmlr.org

Unlike classical lexical overlap metrics such as BLEU, most current evaluation metrics for
machine translation (for example, COMET or BERTScore) are based on black-box large …

被引用次数：8 相关文章所有 5 个版本

[PDF] arxiv.org

Coherence boosting: When your pretrained language model is not paying enough attention

N Malkin, Z Wang, N Jojic - arXiv preprint arXiv:2110.08294, 2021 - arxiv.org

Long-range semantic coherence remains a challenge in automatic language generation
and understanding. We demonstrate that large language models have insufficiently learned …

被引用次数：36 相关文章所有 4 个版本

[PDF] arxiv.org

Transformers go for the LOLs: Generating (humourous) titles from scientific abstracts end-to-end

Y Chen, S Eger - arXiv preprint arXiv:2212.10522, 2022 - arxiv.org

We consider the end-to-end abstract-to-title generation problem, exploring seven recent
transformer based models (including ChatGPT) fine-tuned on more than 30k abstract-title …

被引用次数：18 相关文章所有 4 个版本

[PDF] arxiv.org

Effective distillation of table-based reasoning ability from llms

B Yang, C Tang, K Zhao, C Xiao, C Lin - arXiv preprint arXiv:2309.13182, 2023 - arxiv.org

Large Language Models (LLMs) have demonstrated remarkable performance across a wide
range of natural language processing tasks. However, their remarkable parameter size and …

被引用次数：19 相关文章所有 4 个版本

[PDF] arxiv.org

Scitab: A challenging benchmark for compositional reasoning and claim verification on scientific tables

X Lu, L Pan, Q Liu, P Nakov, MY Kan - arXiv preprint arXiv:2305.13186, 2023 - arxiv.org

Current scientific fact-checking benchmarks exhibit several shortcomings, such as biases
arising from crowd-sourced claims and an over-reliance on text-based evidence. We present …

被引用次数：16 相关文章所有 5 个版本

高级搜索

QQ 群

Table pre-training: A survey on model architectures, pre-training objectives, and downstream tasks

MultiHiertt: Numerical reasoning over multi hierarchical tabular and textual data

A survey on llm-gernerated text detection: Necessity, methods, and future directions

Large language models on tabular data--a survey

Hitab: A hierarchical table dataset for question answering and natural language generation

Towards explainable evaluation metrics for machine translation

Coherence boosting: When your pretrained language model is not paying enough attention

Transformers go for the LOLs: Generating (humourous) titles from scientific abstracts end-to-end

Effective distillation of table-based reasoning ability from llms

Scitab: A challenging benchmark for compositional reasoning and claim verification on scientific tables

引用