Lightweight dense video captioning with cross-modal attention and knowledge-enhanced unbiased scene graph

S Han, J Liu, J Zhang, P Gong, X Zhang… - Complex & Intelligent …, 2023 - Springer
Dense video captioning (DVC) aims at generating description for each scene in a video.
Despite attractive progress for this task, previous works usually only concentrate on …

Knowledge graph of mobile payment platforms based on deep learning: Risk analysis and policy implications

H Xia, Y Wang, J Gauthier, JZ Zhang - Expert Systems with Applications, 2022 - Elsevier
The Fintech mobile payment platform is expanding rapidly; this expansion, in turn, creates
numerous risks. There is an urgent need to better understand these risks and to spur more …

Trajectory prediction of seagoing ships in dynamic traffic scenes via a gated spatio-temporal graph aggregation network

X Zhang, J Liu, P Gong, C Chen, B Han, Z Wu - Ocean Engineering, 2023 - Elsevier
Accurate ship trajectory prediction is essential in maritime traffic control and safety, requiring
the consideration of complex spatial and temporal dependencies within trajectory data. Most …

Goal-driven long-term marine vessel trajectory prediction with a memory-enhanced network

X Zhang, J Liu, C Chen, L Wei, Z Wu, W Dai - Expert Systems with …, 2025 - Elsevier
Enhancing the precision of marine vessel trajectory prediction (VTP) is crucial for collision
avoidance, intelligent navigation, and crisis alert in maritime safety. Most RNN-based …

Enhancing context representations with part-of-speech information and neighboring signals for question classification

P Gong, J Liu, Y Xie, M Liu, X Zhang - Complex & Intelligent Systems, 2023 - Springer
Question classification is an essential task in question answering (QA) systems. An effective
and efficient question classification model can not only restrict the search space for answers …

[PDF][PDF] SF-CNN: Deep Text Classification and Retrieval for Text Documents.

R Sarasu, KK Thyagharajan… - Intelligent Automation & …, 2023 - cdn.techscience.cn
Researchers and scientists need rapid access to text documents such as research papers,
source code and dissertations. Many research documents are available on the Internet and …

Cross-modal knowledge guided model for abstractive summarization

H Wang, J Liu, M Duan, P Gong, Z Wu, J Wang… - Complex & Intelligent …, 2024 - Springer
Abstractive summarization (AS) aims to generate more flexible and informative descriptions
than extractive summarization. Nevertheless, it often distorts or fabricates facts in the original …

MEDMCN: a novel multi-modal EfficientDet with multi-scale CapsNet for object detection

X Li, J Liu, Z Tang, B Han, Z Wu - The Journal of Supercomputing, 2024 - Springer
Object detection in real-world scenarios with multi-modal inputs is crucial for some safety-
critical systems, such as autonomous driving, security monitoring, and traffic management …

[PDF][PDF] A multi-level circulant cross-modal transformer for multimodal speech emotion recognition.

P Gong, J Liu, Z Wu, B Han… - … , Materials & Continua, 2023 - cdn.techscience.cn
Speech emotion recognition, as an important component of humancomputer interaction
technology, has received increasing attention. Recent studies have treated emotion …

Joint extraction of entity relations from geological reports based on a novel relation graph convolutional network

M Tian, K Ma, Q Wu, Q Qiu, L Tao, Z Xie - Computers & Geosciences, 2024 - Elsevier
Geological reports house a wealth of geological domain knowledge and expert experience
knowledge, and the efficient extraction of geological entity relations from these texts holds …