Normalization techniques in training dnns: Methodology, analysis and application

L Huang, J Qin, Y Zhou, F Zhu, L Liu… - IEEE transactions on …, 2023 - ieeexplore.ieee.org
Normalization techniques are essential for accelerating the training and improving the
generalization of deep neural networks (DNNs), and have successfully been used in various …

Deep learning for source code modeling and generation: Models, applications, and challenges

THM Le, H Chen, MA Babar - ACM Computing Surveys (CSUR), 2020 - dl.acm.org
Deep Learning (DL) techniques for Natural Language Processing have been evolving
remarkably fast. Recently, the DL advances in language modeling, machine translation, and …

On neural differential equations

P Kidger - arXiv preprint arXiv:2202.02435, 2022 - arxiv.org
The conjoining of dynamical systems and deep learning has become a topic of great
interest. In particular, neural differential equations (NDEs) demonstrate that neural networks …

Argmax flows and multinomial diffusion: Learning categorical distributions

E Hoogeboom, D Nielsen, P Jaini… - Advances in Neural …, 2021 - proceedings.neurips.cc
Generative flows and diffusion models have been predominantly trained on ordinal data, for
example natural images. This paper introduces two extensions of flows and diffusion for …

Enhancing the locality and breaking the memory bottleneck of transformer on time series forecasting

S Li, X Jin, Y Xuan, X Zhou, W Chen… - Advances in neural …, 2019 - proceedings.neurips.cc
Time series forecasting is an important problem across many domains, including predictions
of solar plant energy output, electricity consumption, and traffic jam situation. In this paper …

Deep learning for load forecasting with smart meter data: Online Adaptive Recurrent Neural Network

MN Fekri, H Patel, K Grolinger, V Sharma - Applied Energy, 2021 - Elsevier
Electricity load forecasting has been attracting research and industry attention because of its
importance for energy management, infrastructure planning, and budgeting. In recent years …

[图书][B] Neural networks and deep learning

CC Aggarwal - 2018 - Springer
“Any AI smart enough to pass a Turing test is smart enough to know to fail it.”–*** Ian
McDonald Neural networks were developed to simulate the human nervous system for …

An empirical evaluation of generic convolutional and recurrent networks for sequence modeling

S Bai, JZ Kolter, V Koltun - arXiv preprint arXiv:1803.01271, 2018 - arxiv.org
For most deep learning practitioners, sequence modeling is synonymous with recurrent
networks. Yet recent results indicate that convolutional architectures can outperform …

Independently recurrent neural network (indrnn): Building a longer and deeper rnn

S Li, W Li, C Cook, C Zhu… - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
Recurrent neural networks (RNNs) have been widely used for processing sequential data.
However, RNNs are commonly difficult to train due to the well-known gradient vanishing and …

Understanding batch normalization

N Bjorck, CP Gomes, B Selman… - Advances in neural …, 2018 - proceedings.neurips.cc
Batch normalization (BN) is a technique to normalize activations in intermediate layers of
deep neural networks. Its tendency to improve accuracy and speed up training have …