Recent advancements in vision-and-language models have opened new possibilities for natural language generation, particularly in generating creative stories from visual input. We …
X Lin, X Chen - arXiv preprint arXiv:2407.02586, 2024 - arxiv.org
Visual storytelling is an emerging field that combines images and narratives to create engaging and contextually rich stories. Despite its potential, generating coherent and …
Visual storytelling, which involves generating coherent and engaging narratives from a sequence of images, is a challenging task that has garnered significant interest due to its …