关注
Sitong Wu
Sitong Wu
在 link.cuhk.edu.hk 的电子邮件经过验证
标题
引用次数
引用次数
年份
Pale transformer: A general vision transformer backbone with pale-shaped attention
S Wu, T Wu, H Tan, G Guo
Proceedings of the AAAI Conference on Artificial Intelligence 36 (3), 2731-2739, 2022
602022
Fully transformer networks for semantic image segmentation
S Wu, T Wu, F Lin, S Tian, G Guo
arXiv preprint arXiv:2106.04108, 2021
422021
Structtoken: Rethinking semantic segmentation with structural prior
F Lin, Z Liang, S Wu, J He, K Chen, S Tian
IEEE Transactions on Circuits and Systems for Video Technology 33 (10), 5655 …, 2023
332023
Semantic diffusion network for semantic segmentation
H Tan, S Wu, J Pi
Advances in Neural Information Processing Systems 35, 8702-8716, 2022
242022
Catrans: Context and affinity transformer for few-shot segmentation
S Zhang, T Wu, S Wu, G Guo
arXiv preprint arXiv:2204.12817, 2022
222022
Data pruning via moving-one-sample-out
H Tan, S Wu, F Du, Y Chen, Z Wang, F Wang, X Qi
Advances in Neural Information Processing Systems 36, 2024
132024
Regionblip: A unified multi-modal pre-training framework for holistic and regional comprehension
Q Zhou, C Yu, S Zhang, S Wu, Z Wang, F Wang
arXiv preprint arXiv:2308.02299, 2023
132023
Demystify transformers & convolutions in modern image deep networks
J Dai, M Shi, W Wang, S Wu, L Xing, W Wang, X Zhu, L Lu, J Zhou, ...
arXiv preprint arXiv:2211.05781, 2022
122022
Uninext: Exploring a unified architecture for vision recognition
F Lin, J Yuan, S Wu, F Wang, Z Wang
Proceedings of the 31st ACM International Conference on Multimedia, 3200-3208, 2023
92023
Full-scale selective transformer for semantic segmentation
F Lin, S Wu, Y Ma, S Tian
Proceedings of the Asian Conference on Computer Vision, 2663-2679, 2022
82022
PRSeg: A lightweight patch rotate MLP decoder for semantic segmentation
Y Ma, F Lin, S Wu, S Tian, L Yu
IEEE Transactions on Circuits and Systems for Video Technology 33 (11), 6860 …, 2023
72023
Feature selective transformer for semantic image segmentation
F Lin, T Wu, S Wu, S Tian, G Guo
arXiv preprint arXiv:2203.14124, 2022
52022
Proxy graph matching with proximal matching networks
HR Tan, C Wang, ST Wu, TQ Wang, XY Zhang, CL Liu
Proceedings of the AAAI conference on artificial intelligence 35 (11), 9808-9815, 2021
52021
Ensemble quadratic assignment network for graph matching
H Tan, C Wang, S Wu, XY Zhang, F Yin, CL Liu
International Journal of Computer Vision, 1-23, 2024
22024
Axwin transformer: A context-aware vision transformer backbone with axial windows
F Lin, Y Ma, S Wu, L Yu, S Tian
arXiv preprint arXiv:2305.01280, 2023
22023
RoboCoder: Robotic Learning from Basic Skills to General Tasks with Large Language Models
J Li, P Chen, S Wu, C Zheng, H Xu, J Jia
arXiv preprint arXiv:2406.03757, 2024
12024
SaCo Loss: Sample-wise Affinity Consistency for Vision-Language Pre-training
S Wu, H Tan, Z Tian, Y Chen, X Qi, J Jia
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
2024
Supplementary Material for SaCo Loss: Sample-wise Affinity Consistency for Vision-Language Pre-training
S Wu, H Tan, Z Tian, Y Chen, X Qi, J Jia
系统目前无法执行此操作,请稍后再试。
文章 1–18