Fast and robust early-exiting framework for autoregressive language models with synchronized parallel decoding

S Bae, J Ko, H Song, SY Yun - arXiv preprint arXiv:2310.05424, 2023 - arxiv.org
To tackle the high inference latency exhibited by autoregressive language models, previous
studies have proposed an early-exiting framework that allocates adaptive computation paths …

Decorrelation of neutral vector variables: Theory and applications

Z Ma, JH Xue, A Leijon, ZH Tan… - IEEE transactions on …, 2016 - ieeexplore.ieee.org
In this paper, we propose novel strategies for neutral vector variable decorrelation. Two
fundamental invertible transformations, namely, serial nonlinear transformation and parallel …

Security in fog computing: A novel technique to tackle an impersonation attack

S Tu, M Waqas, SU Rehman, M Aamir… - IEEE …, 2018 - ieeexplore.ieee.org
Fog computing is an encouraging technology in the coming generation to pipeline the
breach between cloud data centers and Internet of Things (IoT) devices. Fog computing is …

Variational Bayesian matrix factorization for bounded support data

Z Ma, AE Teschendorff, A Leijon, Y Qiao… - IEEE transactions on …, 2014 - ieeexplore.ieee.org
A novel Bayesian matrix factorization method for bounded support data is presented. Each
entry in the observation matrix is assumed to be beta distributed. As the beta distribution has …

Advanced dropout: A model-free methodology for bayesian dropout optimization

J Xie, Z Ma, J Lei, G Zhang, JH Xue… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Due to lack of data, overfitting ubiquitously exists in real-world applications of deep neural
networks (DNNs). We propose advanced dropout, a model-free methodology, to mitigate …

Variational learning for finite Dirichlet mixture models and applications

W Fan, N Bouguila, D Ziou - IEEE transactions on neural …, 2012 - ieeexplore.ieee.org
In this paper, we focus on the variational learning of finite Dirichlet mixture models.
Compared to other algorithms that are commonly used for mixture models (such as …

Well logging prediction and uncertainty analysis based on recurrent neural network with attention mechanism and Bayesian theory

L Zeng, W Ren, L Shan, F Huo - Journal of Petroleum Science and …, 2022 - Elsevier
Deep learning technology can fit the nonlinear relations between different logging
sequences. It solves the prediction problems that cannot be effectively disposed by …

A hybrid Markov-based model for human mobility prediction

Y Qiao, Z Si, Y Zhang, FB Abdesslem, X Zhang, J Yang - Neurocomputing, 2018 - Elsevier
Human mobility behavior is far from random, and its indicators follow non-Gaussian
distributions. Predicting human mobility has the potential to enhance location-based …

Clustering analysis in the wireless propagation channel with a variational Gaussian mixture model

Y Li, J Zhang, Z Ma, Y Zhang - IEEE Transactions on Big Data, 2018 - ieeexplore.ieee.org
In this paper, the Gaussian mixture model (GMM) is introduced to implement channel
multipath clustering. The GMM incorporates the covariance structure and the mean …

Robust learning by self-transition for handling noisy labels

H Song, M Kim, D Park, Y Shin, JG Lee - Proceedings of the 27th ACM …, 2021 - dl.acm.org
Real-world data inevitably contains noisy labels, which induce the poor generalization of
deep neural networks. It is known that the network typically begins to rapidly memorize false …