The advent of large-scale pretrained language models (PLMs) has contributed greatly to the progress in natural language processing (NLP). Despite its recent success and wide …
Video summarization aims to create short, accurate, and cohesive summaries of longer videos. Despite the existence of various video summarization datasets, a notable limitation …
Long-form video content constitutes a significant portion of internet traffic making automated video summarization an essential research problem. However existing video summarization …
In everyday communication, humans frequently use speech and gestures to refer to specific areas or objects, a process known as Referential Dialogue (RD). While prior studies have …
AK Singh, D Srivastava… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
We introduce multimodal story summarization by leveraging TV episode recaps-short video sequences interweaving key story moments from previous episodes to bring viewers up to …
Artistic video portrait generation is a significant and sought-after task in the fields of computer graphics and vision. While various methods have been developed that integrate …
X Liu, P Liu, H He - arXiv preprint arXiv:2406.00050, 2024 - arxiv.org
In this study, we investigate the capabilities and inherent biases of advanced large language models (LLMs) such as GPT-3.5 and GPT-4 in the context of debate evaluation. We discover …
Assessing the artness of AI-generated images continues to be a challenge within the realm of image generation. Most existing metrics cannot be used to perform instance-level and …
Text summarization research has undergone several significant transformations with the advent of deep neural networks, pre-trained language models (PLMs), and recent large …