Dynamic large language models on blockchains

Y Gong - arXiv preprint arXiv:2307.10549, 2023 - arxiv.org
Training and deploying the large language models requires a large mount of computational
resource because the language models contain billions of parameters and the text has …

Multilevel large language models for everyone

Y Gong - arXiv preprint arXiv:2307.13221, 2023 - arxiv.org
Large language models have made significant progress in the past few years. However,
they are either generic {\it or} field specific, splitting the community into different groups. In …

Prediction of yield surface of single crystal copper from discrete dislocation dynamics and geometric learning

WR Jian, M Xiao, WC Sun, W Cai - Journal of the Mechanics and Physics of …, 2024 - Elsevier
The yield surface of a material is a criterion at which macroscopic plastic deformation
begins. For crystalline solids, plastic deformation occurs through the motion of dislocations …

Gradient Domain Diffusion Models for Image Synthesis

Y Gong - arXiv preprint arXiv:2309.01875, 2023 - arxiv.org
Diffusion models are getting popular in generative image and video synthesis. However,
due to the diffusion process, they require a large number of steps to converge. To tackle this …

Adaptive weighted total variation expansion and Gaussian curvature guided low-dose CT image denoising network

Z Li, Y Liu, P Zhang, J Lu, S Ren, Z Gui - Biomedical Signal Processing and …, 2024 - Elsevier
The denoising task of low-dose CT images is a highly complex and uncertain inverse
problem. Previous studies have primarily relied on convolutional neural network to reduce …

STL: A Signed and Truncated Logarithm Activation Function for Neural Networks

Y Gong - arXiv preprint arXiv:2307.16389, 2023 - arxiv.org
Activation functions play an essential role in neural networks. They provide the non-linearity
for the networks. Therefore, their properties are important for neural networks' accuracy and …

PLMM: Personal Large Models on Mobile Devices

Y Gong - arXiv preprint arXiv:2309.14726, 2023 - arxiv.org
Inspired by Federated Learning, in this paper, we propose personal large models that are
distilled from traditional large language models but more adaptive to local users' personal …

Start-Tv: A Closed-Form Initialization For Total Variation Models

Y Gong, G Yue - 2024 IEEE International Conference on Image …, 2024 - ieeexplore.ieee.org
Although there are many iterative solvers for total variation models, few attention has been
paid on the fast and effective approximation to their optimal solutions. In this paper, we …

TSSR: A Truncated and Signed Square Root Activation Function for Neural Networks

Y Gong, Z Lin, J Sun - 2023 9th International Conference on …, 2023 - ieeexplore.ieee.org
Activation functions are essential components of neural networks. In this paper, we introduce
a new activation function called the Truncated and Signed Square Root (TSSR) function …

Axis-Aligned Gaussian Splatting for Radiance Fields

Y Gong - Proceedings of the 2024 8th International Conference …, 2024 - dl.acm.org
Although the anisotropic Gaussian kernels in 3D Gaussian splatting methods are effective in
representing the geometry, their arbitrary orientation causes several optimization difficulties …