Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks

T Mei, Y Zi, X Cheng, Z Gao, Q Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
The internal structure and operation mechanism of large-scale language models are
analyzed theoretically, especially how Transformer and its derivative architectures can …

Optimizing Search Advertising Strategies: Integrating Reinforcement Learning with Generalized Second-Price Auctions for Enhanced Ad Ranking and Bidding

C Zhou, Y Zhao, J Cao, Y Shen, J Gao, X Cui… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper explores the integration of strategic optimization methods in search advertising,
focusing on ad ranking and bidding mechanisms within E-commerce platforms. By …

Research on image classification and semantic segmentation model based on convolutional neural network

M Li, Z Zhu, R Xu, Y Feng, L Xiao - Journal of Computing and Electronic …, 2024 - drpress.org
This paper investigates convolutional neural network (CNN)-based approaches for image
classification and semantic segmentation, with a focus on addressing spatial detail loss and …

Multi-scale image recognition strategy based on convolutional neural network

H Zhang, S Diao, Y Yang, J Zhong, Y Yan - Journal of Computing and …, 2024 - drpress.org
The accurate recognition and interpretation of multi-scale visual information is a critical focus
within contemporary computer vision research. To this end, this study explores and …

Make Scale Invariant Feature Transform “Fly” with CUDA

Y Mo, C Tan, C Wang, H Qin… - International …, 2024 - ijemr.vandanapublications.com
This paper introduces an implementation of scale invariant feature transform (SIFT)
algorithm with CUDA. Primary steps including building the Gaussian pyramid and the …

Enhance Image-to-Image Generation with LLaVA Prompt and Negative Prompt

Z Ding, P Li, Q Yang, S Li - arXiv preprint arXiv:2406.01956, 2024 - arxiv.org
This paper presents a novel approach to enhance image-to-image generation by leveraging
the multimodal capabilities of the Large Language and Vision Assistant (LLaVA). We …

Lidar and Monocular Sensor Fusion Depth Estimation

S He, Y Zhu, Y Dong, H Qin… - Applied Science and …, 2024 - asejar.singhpublication.com
In this project, we present a novel approach to depth perception using a monocular camera
by incorporating information from both RGB and LiDAR modalities. Our primary objective is …

Automatic News Generation and Fact-Checking System Based on Language Processing

X Peng, Q Xu, Z Feng, H Zhao, L Tan, Y Zhou… - arXiv preprint arXiv …, 2024 - arxiv.org
This paper explores an automatic news generation and fact-checking system based on
language processing, aimed at enhancing the efficiency and quality of news production …

Deep Learning-Based Lung Medical Image Recognition

S Chai, X Fei, Y Wang, L Dai… - International Journal of …, 2024 - ijircst.irpublications.org
Pulmonary nodules serve as critical indicators for early lung cancer diagnosis, making their
detection and classification essential. The prevalent use of transfer learning in recognition …

Predict Click-Through Rates with Deep Interest Network Model in E-commerce Advertising

C Zhou, Y Zou, Y Zhao, J Cao, W Fan, Y Zhao… - arXiv preprint arXiv …, 2024 - techrxiv.org
This paper proposes new methods to enhance clickthrough rate (CTR) prediction models
using the Deep Interest Network (DIN) model, specifically applied to the advertising system …