The cold-start problem has been a long-standing issue in recommendation. Embedding- based recommendation models provide recommendations by learning embeddings for each …
Abstract 2D image understanding is a complex problem within computer vision, but it holds the key to providing human-level scene comprehension. It goes further than identifying the …
Z Xing, Y He - International Journal of Electrical Power & Energy …, 2023 - Elsevier
Fault diagnosis is important to the timely repair of the power transformer. However, machine learning has not been exploited effectively for fault diagnosis due to the limitation of multi …
Abstract Visual Question Answering (VQA) as an important task in understanding vision and language has been proposed and aroused wide interests. In previous VQA methods …
Y Liu, X Zhang, Z Zhao, B Zhang… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Visual question answering (VQA) has gained increasing attention in both natural language processing and computer vision. The attention mechanism plays a crucial role in relating the …
Network ensemble aims to obtain better results by aggregating the predictions of multiple weak networks, in which how to keep the diversity of different networks plays a critical role in …
Z Lei, G Zhang, L Wu, K Zhang, R Liang - Data Science and Engineering, 2022 - Springer
Visual question answering is a complex multimodal task involving images and text, with broad application prospects in human–computer interaction and medical assistance …
T MeshuWelde, L Liao - Pattern Recognition, 2023 - Elsevier
The counting-based questions play a major part in Visual Question Answering (VQA), the most challenging factor is counting the different objects present in the images. Recently …
Y Liu, X Zhang, F Huang, S Shen, P Tian, L Li, Z Li - Pattern Recognition, 2022 - Elsevier
Abstract Video Question Answering (VideoQA) has gained increasing attention as an important task in understanding the rich spatio-temporal contents, ie, the appearance and …