Recent advances and trends in multimodal deep learning: A review

J Summaira, X Li, AM Shoib, S Li, J Abdul - arXiv preprint arXiv …, 2021 - arxiv.org
Deep Learning has implemented a wide range of applications and has become increasingly
popular in recent years. The goal of multimodal deep learning is to create models that can …

Deep neural network-based decision network

AR Johansen, B McCann, J Bradbury… - US Patent …, 2022 - Google Patents
The technology disclosed proposes using a combination of computationally cheap, less-
accurate bag of words (BoW) model and computationally expensive, more-accurate long …

Neural machine translation with latent tree attention

J Bradbury - US Patent 10,565,318, 2020 - Google Patents
We introduce an attentional neural machine translation model for the task of machine
translation that accomplishes the longstanding goal of natural language processing to take …

Generating dual sequence inferences using a neural network model

V Zhong, C Xiong, R Socher - US Patent 11,170,287, 2021 - Google Patents
A computer-implemented method for dual sequence inference using a neural network model
includes generating a codependent representation based on a first input representation of a …

Sentinel gate for modulating auxiliary information in a long short-term memory (lstm) neural network

LU Jiasen, C Xiong, R Socher - US Patent 10,565,306, 2020 - Google Patents
The technology disclosed presents a novel spatial attention model that uses current hidden
state information of a decoder long short-term memory (LSTM) to guide attention and to …

Sequence-to-sequence prediction using a neural network model

NS Keskar, K Ahmed, R Socher - US Patent 11,928,600, 2024 - Google Patents
A method for sequence-to-sequence prediction using a neural network model includes
generating an encoded representation based on an input sequence using an encoder of the …

Spatial attention model for image captioning

LU Jiasen, C Xiong, R Socher - US Patent 10,558,750, 2020 - Google Patents
The technology disclosed presents a novel spatial attention model that uses current hidden
state information of a decoder long short-term memory (LSTM) to guide attention and to …

End-to-end speech recognition with policy learning

Y Zhou, C Xiong - US Patent 10,573,295, 2020 - Google Patents
The disclosed technology teaches a deep end-to-end speech recognition model, including
using multi-objective learning criteria to train a deep end-to-end speech recognition model …

Hierarchical and interpretable skill acquisition in multi-task reinforcement learning

C Xiong, SHU Tianmin, R Socher - US Patent 11,562,287, 2023 - Google Patents
US11562287B2 - Hierarchical and interpretable skill acquisition in multi-task reinforcement
learning - Google Patents US11562287B2 - Hierarchical and interpretable skill acquisition in …

Dense video captioning

Y Zhou, L Zhou, C Xiong, R Socher - US Patent 10,542,270, 2020 - Google Patents
Systems and methods for dense captioning of a video include a multi-layer encoder stack
configured to receive information extracted from a plurality of video frames, a proposal …