Beyond narrative description: Generating poetry from images by multi-adversarial training

J Gui, Z Sun, Y Wen, D Tao, J Ye - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Generative adversarial networks (GANs) have recently become a hot research topic;
however, they have been studied since 2014, and a large number of algorithms have been …

被引用次数：987 相关文章所有 13 个版本

[PDF] arxiv.org

A survey on generative adversarial networks: Variants, applications, and training

A Jabbar, X Li, B Omar - ACM Computing Surveys (CSUR), 2021 - dl.acm.org

The Generative Models have gained considerable attention in unsupervised learning via a
new and practical framework called Generative Adversarial Networks (GAN) due to their …

被引用次数：302 相关文章所有 3 个版本

[PDF] thecvf.com

Seeing out of the box: End-to-end pre-training for vision-language representation learning

Z Huang, Z Zeng, Y Huang, B Liu… - Proceedings of the …, 2021 - openaccess.thecvf.com

We study on joint learning of Convolutional Neural Network (CNN) and Transformer for
vision-language pre-training (VLPT) which aims to learn cross-modal alignments from …

被引用次数：269 相关文章所有 6 个版本

[PDF] arxiv.org

GAN computers generate arts? A survey on visual arts, music, and literary text generation using generative adversarial network

S Shahriar - Displays, 2022 - Elsevier

Abstract “Art is the lie that enables us to realize the truth.”–Pablo Picasso. For centuries,
humans have dedicated themselves to producing arts to convey their imagination. The …

被引用次数：95 相关文章所有 5 个版本

A case study of conditional deep convolutional generative adversarial networks in machine fault diagnosis

J Luo, J Huang, H Li - Journal of Intelligent Manufacturing, 2021 - Springer

Due to the real working conditions, the collected mechanical fault datasets are actually
limited and always highly imbalanced, which restricts the diagnosis accuracy and stability …

被引用次数：140 相关文章所有 6 个版本

[PDF] arxiv.org

Mmt-bench: A comprehensive multimodal benchmark for evaluating large vision-language models towards multitask agi

K Ying, F Meng, J Wang, Z Li, H Lin, Y Yang… - arXiv preprint arXiv …, 2024 - arxiv.org

Large Vision-Language Models (LVLMs) show significant strides in general-purpose
multimodal applications such as visual dialogue and embodied navigation. However …

被引用次数：16 相关文章所有 3 个版本

[PDF] hbs.edu

Psychological factors underlying attitudes toward AI tools

J De Freitas, S Agarwal, B Schmitt… - Nature Human Behaviour, 2023 - nature.com

What are the psychological factors driving attitudes toward artificial intelligence (AI) tools,
and how can resistance to AI systems be overcome when they are beneficial? Here we first …

被引用次数：9 相关文章所有 5 个版本

[PDF] neurips.cc

Visual clues: Bridging vision and language foundations for image paragraph captioning

Y Xie, L Zhou, X Dai, L Yuan, N Bach… - Advances in Neural …, 2022 - proceedings.neurips.cc

People say," A picture is worth a thousand words". Then how can we get the rich information
out of the image? We argue that by using visual clues to bridge large pretrained vision …

被引用次数：22 相关文章所有 7 个版本

[PDF] ieee.org

Adversarial training in affective computing and sentiment analysis: Recent advances and perspectives

J Han, Z Zhang, N Cummins… - IEEE Computational …, 2019 - ieeexplore.ieee.org

Over the past few years, adversarial training has become an extremely active research topic
and has been successfully applied to various Artificial Intelligence (AI) domains. As a …

被引用次数：89 相关文章所有 7 个版本

[PDF] koreascience.kr

Generative adversarial networks: A literature review

J Cheng, Y Yang, X Tang, N Xiong… - KSII Transactions on …, 2020 - koreascience.kr

Abstract The Generative Adversarial Networks, as one of the most creative deep learning
models in recent years, has achieved great success in computer vision and natural …

被引用次数：64 相关文章所有 8 个版本

高级搜索

QQ 群