Pre-trained language models for text generation: A survey

J Li, T Tang, WX Zhao, JY Nie, JR Wen - ACM Computing Surveys, 2024 - dl.acm.org
Text Generation aims to produce plausible and readable text in human language from input
data. The resurgence of deep learning has greatly advanced this field, in particular, with the …

An updated survey of efficient hardware architectures for accelerating deep convolutional neural networks

M Capra, B Bussolino, A Marchisio, M Shafique… - Future Internet, 2020 - mdpi.com
Deep Neural Networks (DNNs) are nowadays a common practice in most of the Artificial
Intelligence (AI) applications. Their ability to go beyond human precision has made these …

From turing to transformers: A comprehensive review and tutorial on the evolution and applications of generative transformer models

EY Zhang, AD Cheok, Z Pan, J Cai, Y Yan - Sci, 2023 - mdpi.com
In recent years, generative transformers have become increasingly prevalent in the field of
artificial intelligence, especially within the scope of natural language processing. This paper …

A mathematical investigation of hallucination and creativity in GPT models

M Lee - Mathematics, 2023 - mdpi.com
In this paper, we present a comprehensive mathematical analysis of the hallucination
phenomenon in generative pretrained transformer (GPT) models. We rigorously define and …

From word embeddings to pre-trained language models: A state-of-the-art walkthrough

M Mars - Applied Sciences, 2022 - mdpi.com
With the recent advances in deep learning, different approaches to improving pre-trained
language models (PLMs) have been proposed. PLMs have advanced state-of-the-art …

End-to-end transformer-based models in textual-based NLP

A Rahali, MA Akhloufi - AI, 2023 - mdpi.com
Transformer architectures are highly expressive because they use self-attention
mechanisms to encode long-range dependencies in the input sequences. In this paper, we …

Social media toxicity classification using deep learning: real-world application UK brexit

H Fan, W Du, A Dahou, AA Ewees, D Yousri, MA Elaziz… - Electronics, 2021 - mdpi.com
Social media has become an essential facet of modern society, wherein people share their
opinions on a wide variety of topics. Social media is quickly becoming indispensable for a …

A new approach to web application security: Utilizing gpt language models for source code inspection

Z Szabó, V Bilicki - Future Internet, 2023 - mdpi.com
Due to the proliferation of large language models (LLMs) and their widespread use in
applications such as ChatGPT, there has been a significant increase in interest in AI over the …

The new version of the ANDDigest tool with improved AI-based short names recognition

TV Ivanisenko, PS Demenkov, NA Kolchanov… - International Journal of …, 2022 - mdpi.com
The body of scientific literature continues to grow annually. Over 1.5 million abstracts of
biomedical publications were added to the PubMed database in 2021. Therefore …

[HTML][HTML] Searching images for consensus: can AI remove observer variability in pathology?

HR Tizhoosh, P Diamandis, CJV Campbell… - The American journal of …, 2021 - Elsevier
One of the major obstacles in reaching diagnostic consensus is observer variability. With the
recent success of artificial intelligence, particularly the deep networks, the question emerges …