Gpt4tools: Teaching large language model to use tools via self-instruction

R Yang, L Song, Y Li, S Zhao, Y Ge… - Advances in Neural …, 2024 - proceedings.neurips.cc
This paper aims to efficiently enable Large Language Models (LLMs) to use multi-modal
tools. The advanced proprietary LLMs, such as ChatGPT and GPT-4, have shown great …

End-to-end object detection with fully convolutional network

J Wang, L Song, Z Li, H Sun, J Sun… - Proceedings of the …, 2021 - openaccess.thecvf.com
Mainstream object detectors based on the fully convolutional network has achieved
impressive performance. While most of them still need a hand-designed non-maximum …

Ts-cam: Token semantic coupled attention map for weakly supervised object localization

W Gao, F Wan, X Pan, Z Peng, Q Tian… - Proceedings of the …, 2021 - openaccess.thecvf.com
Weakly supervised object localization (WSOL) is a challenging problem when given image
category labels but requires to learn object localization models. Optimizing a convolutional …

Fully convolutional networks for panoptic segmentation

Y Li, H Zhao, X Qi, L Wang, Z Li… - Proceedings of the …, 2021 - openaccess.thecvf.com
In this paper, we present a conceptually simple, strong, and efficient framework for panoptic
segmentation, called Panoptic FCN. Our approach aims to represent and predict foreground …

Box-supervised instance segmentation with level set evolution

W Li, W Liu, J Zhu, M Cui, XS Hua, L Zhang - European conference on …, 2022 - Springer
In contrast to the fully supervised methods using pixel-wise mask labels, box-supervised
instance segmentation takes advantage of the simple box annotations, which has recently …

Learning dynamic routing for semantic segmentation

Y Li, L Song, Y Chen, Z Li, X Zhang… - Proceedings of the …, 2020 - openaccess.thecvf.com
Recently, numerous handcrafted and searched networks have been applied for semantic
segmentation. However, previous works intend to handle inputs with various scales in pre …

Affinity attention graph neural network for weakly supervised semantic segmentation

B Zhang, J Xiao, J Jiao, Y Wei… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Weakly supervised semantic segmentation is receiving great attention due to its low human
annotation cost. In this paper, we aim to tackle bounding box supervised semantic …

Tree energy loss: Towards sparsely annotated semantic segmentation

Z Liang, T Wang, X Zhang, J Sun… - Proceedings of the …, 2022 - openaccess.thecvf.com
Sparsely annotated semantic segmentation (SASS) aims to train a segmentation network
with coarse-grained (ie, point-, scribble-, and block-wise) supervisions, where only a small …

Domain-invariant stereo matching networks

F Zhang, X Qi, R Yang, V Prisacariu, B Wah… - Computer Vision–ECCV …, 2020 - Springer
State-of-the-art stereo matching networks have difficulties in generalizing to new unseen
environments due to significant domain differences, such as color, illumination, contrast, and …

Unveiling the potential of structure preserving for weakly supervised object localization

X Pan, Y Gao, Z Lin, F Tang, W Dong… - Proceedings of the …, 2021 - openaccess.thecvf.com
Weakly supervised object localization (WSOL) remains an open problem due to the
deficiency of finding object extent information using a classification network. While prior …