[HTML][HTML] Pre-trained language models and their applications

H Wang, J Li, H Wu, E Hovy, Y Sun - Engineering, 2023 - Elsevier
Pre-trained language models have achieved striking success in natural language
processing (NLP), leading to a paradigm shift from supervised learning to pre-training …

Fingpt: Open-source financial large language models

H Yang, XY Liu, CD Wang - arXiv preprint arXiv:2306.06031, 2023 - arxiv.org
Large language models (LLMs) have shown the potential of revolutionizing natural
language processing tasks in diverse domains, sparking great interest in finance. Accessing …

Long time no see! open-domain conversation with long-term persona memory

X Xu, Z Gou, W Wu, ZY Niu, H Wu, H Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
Most of the open-domain dialogue models tend to perform poorly in the setting of long-term
human-bot conversations. The possible reason is that they lack the capability of …

Godel: Large-scale pre-training for goal-directed dialog

B Peng, M Galley, P He, C Brockett, L Liden… - arXiv preprint arXiv …, 2022 - arxiv.org
We introduce GODEL (Grounded Open Dialogue Language Model), a large pre-trained
language model for dialog. In contrast with earlier models such as DialoGPT, GODEL …

Unified dialog model pre-training for task-oriented dialog understanding and generation

W He, Y Dai, M Yang, J Sun, F Huang, L Si… - Proceedings of the 45th …, 2022 - dl.acm.org
Recently, pre-training methods have shown remarkable success in task-oriented dialog
(TOD) systems. However, most existing pre-trained models for TOD focus on either dialog …

Eva2. 0: Investigating open-domain chinese dialogue systems with large-scale pre-training

Y Gu, J Wen, H Sun, Y Song, P Ke, C Zheng… - Machine Intelligence …, 2023 - Springer
Large-scale pre-training has shown remarkable performance in building open-domain
dialogue systems. However, previous works mainly focus on showing and evaluating the …

Stochastic rag: End-to-end retrieval-augmented generation through expected utility maximization

H Zamani, M Bendersky - Proceedings of the 47th International ACM …, 2024 - dl.acm.org
This paper introduces Stochastic RAG--a novel approach for end-to-end optimization of
retrieval-augmented generation (RAG) models that relaxes the simplifying assumptions of …

Deep learning applications in games: a survey from a data perspective

Z Hu, Y Ding, R Wu, L Li, R Zhang, Y Hu, F Qiu… - Applied …, 2023 - Springer
This paper presents a comprehensive review of deep learning applications in the video
game industry, focusing on how these techniques can be utilized in game development …

Survey of different large language model architectures: Trends, benchmarks, and challenges

M Shao, A Basit, R Karri, M Shafique - IEEE Access, 2024 - ieeexplore.ieee.org
Large Language Models (LLMs) represent a class of deep learning models adept at
understanding natural language and generating coherent responses to various prompts or …

Deep learning for dialogue systems: Chit-chat and beyond

R Yan, J Li, Z Yu - Foundations and Trends® in Information …, 2022 - nowpublishers.com
With the rapid progress of deep neural models and the explosion of available data
resources, dialogue systems that supports extensive topics and chit-chat conversations are …