Z Yang, YJ Zhang, S Rehman, Y Huang - Image and Graphics: 9th …, 2017 - Springer
… of objects in an image. Due to the aforementioned reason, we combine objectdetection with imagecaptioning to … Initially, we use an objectdetection model to detectobjects in the image …
CW Kuo, Z Kira - Proceedings of the IEEE/CVF conference …, 2022 - openaccess.thecvf.com
… In this paper, we validate our proposed method on the VL task of imagecaptioning. By … -trained objectdetector described above, our method improves one of the SoTA imagecaptioning …
K Iwamura, JY Louhi Kasahara, A Moro, A Yamashita… - Sensors, 2021 - mdpi.com
… object region, not all the motion features. Therefore, objectdetection that detects an object region in an image … our proposed method improves imagecaptioning performance. Previous …
… purely as a language modeling problem and presume the availability of a pre-trained fully-supervised objectdetector over all object classes of interest, to the best of our knowledge. …
MA Al-Malla, A Jafar, N Ghneim - Journal of Big Data, 2022 - Springer
… In order to take advantage of the image classification features and the objectdetection features, we add this concatenation step, where we attach the output of the YOLOv4 subsystem as …
… propose CapSal, a salient object detection framework that exploits imagecaptioning to pro- … Inspired by the success of these works, we propose to leverage imagecaptioning as an …
… Dataset Analysis In this section, we compare our nocaps benchmark to COCO Captions [5] in terms of both image content and caption diversity. Based on ground-truth objectdetection …
… Two stage attention models consists of bottom-up attention and top-down attention, where bottom-up attention first uses objectdetection models to detect multiple informative regions in …
… objectdetectors using bounding box annotations for a limited set of object categories, as well as image… problem of knowledge transfer from image-caption pretraining to objectdetection. …