Scene retrieval from multiple resolution generated images based on text-to-image gan

R Yanagi, R Togo, T Ogawa… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
2019 IEEE International Symposium on Circuits and Systems (ISCAS), 2019ieeexplore.ieee.org
Text-to-image Generative Adversarial Network (GAN) is a deep learning model that
generates an image from an input sentence. It is expressly attracting attentions because of
its applicability of the generated images. However, many existing studies have still focused
on generation of high-quality images, and there are few studies focusing on application of
the generated images since text-to-image GANs still cannot produce visually pleasing
images in the complicated tasks. In this paper, we apply a text-to-image GAN as a generator …
Text-to-image Generative Adversarial Network (GAN) is a deep learning model that generates an image from an input sentence. It is expressly attracting attentions because of its applicability of the generated images. However, many existing studies have still focused on generation of high-quality images, and there are few studies focusing on application of the generated images since text-to-image GANs still cannot produce visually pleasing images in the complicated tasks. In this paper, we apply a text-to-image GAN as a generator of query images for a scene retrieval task to show availability of the visually non-pleasant images. The proposed method utilizes a low-resolution generated image that focuses on a sentence and a high-resolution generated image that focuses on each word of the sentence to retrieve a desired scene. With this mechanism, the proposed method realizes a high-accuracy scene retrieval from a sentence input. Experimental results show the effectiveness of our method.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果