Usb: A unified semi-supervised learning benchmark for classification

Y Wang, H Chen, Y Fan, W Sun… - Advances in …, 2022 - proceedings.neurips.cc
Semi-supervised learning (SSL) improves model generalization by leveraging massive
unlabeled data to augment limited labeled samples. However, currently, popular SSL …

Pandalm: An automatic evaluation benchmark for llm instruction tuning optimization

Y Wang, Z Yu, Z Zeng, L Yang, C Wang, H Chen… - arXiv preprint arXiv …, 2023 - arxiv.org
Instruction tuning large language models (LLMs) remains a challenging task, owing to the
complexity of hyperparameter selection and the difficulty involved in evaluating the tuned …

Supervised knowledge makes large language models better in-context learners

L Yang, S Zhang, Z Yu, G Bao, Y Wang, J Wang… - arXiv preprint arXiv …, 2023 - arxiv.org
Large Language Models (LLMs) exhibit emerging in-context learning abilities through
prompt engineering. The recent progress in large-scale generative models has further …

Novelqa: A benchmark for long-range novel question answering

C Wang, R Ning, B Pan, T Wu, Q Guo, C Deng… - arXiv preprint arXiv …, 2024 - arxiv.org
The rapid advancement of Large Language Models (LLMs) has introduced a new frontier in
natural language processing, particularly in understanding and processing long-context …

A false emotion opinion target extraction model with two stage BERT and background information fusion

ZY Hou, YJ Du, QZ Li, XY Li, XL Chen… - Expert Systems with …, 2024 - Elsevier
As social media has gradually become an indispensable part of people's life, more and
more users begin to express their opinions on social media. These opinions contain rich …