Universal transformers

M Dehghani, S Gouws, O Vinyals, J Uszkoreit… - arXiv preprint arXiv …, 2018 - arxiv.org
Recurrent neural networks (RNNs) sequentially process data by updating their state with
each new data point, and have long been the de facto choice for sequence modeling tasks …

Adaframe: Adaptive frame selection for fast video recognition

Z Wu, C Xiong, CY Ma, R Socher… - Proceedings of the …, 2019 - openaccess.thecvf.com
We present AdaFrame, a framework that adaptively selects relevant frames on a per-input
basis for fast video recognition. AdaFrame contains a Long Short-Term Memory network …

Recurrent neural network attention mechanisms for interpretable system log anomaly detection

A Brown, A Tuor, B Hutchinson, N Nichols - Proceedings of the first …, 2018 - dl.acm.org
Deep learning has recently demonstrated state-of-the art performance on key tasks related
to the maintenance of computer systems, such as intrusion detection, denial of service attack …

Tree-augmented cross-modal encoding for complex-query video retrieval

X Yang, J Dong, Y Cao, X Wang, M Wang… - Proceedings of the 43rd …, 2020 - dl.acm.org
The rapid growth of user-generated videos on the Internet has intensified the need for text-
based video retrieval systems. Traditional methods mainly favor the concept-based …

LSTMs can learn syntax-sensitive dependencies well, but modeling structure makes them better

A Kuncoro, C Dyer, J Hale, D Yogatama… - Proceedings of the …, 2018 - aclanthology.org
Abstract Language exhibits hierarchical structure, but recent work using a subject-verb
agreement diagnostic argued that state-of-the-art language models, LSTMs, fail to learn long …

Transformer grammars: Augmenting transformer language models with syntactic inductive biases at scale

L Sartran, S Barrett, A Kuncoro, M Stanojević… - Transactions of the …, 2022 - direct.mit.edu
Abstract We introduce Transformer Grammars (TGs), a novel class of Transformer language
models that combine (i) the expressive power, scalability, and strong performance of …

Improving fake news detection of influential domain via domain-and instance-level transfer

Q Nan, D Wang, Y Zhu, Q Sheng, Y Shi, J Cao… - arXiv preprint arXiv …, 2022 - arxiv.org
Both real and fake news in various domains, such as politics, health, and entertainment are
spread via online social media every day, necessitating fake news detection for multiple …

A dynamic frame selection framework for fast video recognition

Z Wu, H Li, C Xiong, YG Jiang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
We introduce AdaFrame, a conditional computation framework that adaptively selects
relevant frames on a per-input basis for fast video recognition. AdaFrame, which contains a …

A provably stable neural network Turing Machine with finite precision and time

J Stogin, A Mali, CL Giles - Information Sciences, 2024 - Elsevier
We introduce a neural stack architecture with a differentiable parameterized stack operator
approximating stack push and pop operations. We prove the stability of this stack …

NLSALog: An anomaly detection framework for log sequence in security management

R Yang, D Qu, Y Gao, Y Qian, Y Tang - IEEE Access, 2019 - ieeexplore.ieee.org
For the security defense in the current Intelligent Transportation System (ITS), malware is
often used as the security analysis data source, but only the known attack type can be …