关注
Xiaoyi Dong
Xiaoyi Dong
Shanghai AI Laboratory
在 mail.ustc.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
CSWin transformer: A general vision transformer backbone with cross-shaped windows
X Dong, J Bao, D Chen, W Zhang, N Yu, L Yuan, D Chen, B Guo
IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2021
8912021
Mobile-former: Bridging mobilenet and transformer
Y Chen, X Dai, D Chen, M Liu, X Dong, L Yuan, Z Liu
IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2021
4552021
Peco: Perceptual codebook for bert pre-training of vision transformers
X Dong, J Bao, T Zhang, D Chen, W Zhang, L Yuan, D Chen, F Wen, N Yu
Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), 2021
2112021
Sharegpt4v: Improving large multi-modal models with better captions
L Chen, J Li, X Dong, P Zhang, C He, J Wang, F Zhao, D Lin
arXiv preprint arXiv:2311.12793, 2023
1402023
Internlm: A multilingual language model with progressively enhanced capabilities
ILM Team
2023-01-06)[2023-09-27]. https://github. com/InternLM/InternLM, 2023
1362023
Protecting Celebrities from DeepFake with Identity Consistency Transformer
X Dong, J Bao, D Chen, T Zhang, W Zhang, N Yu, D Chen, F Wen, B Guo
IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2022
1082022
Lg-gan: Label guided adversarial network for flexible targeted attack of point cloud based deep networks
H Zhou, D Chen, J Liao, K Chen, X Dong, K Liu, W Zhang, G Hua, N Yu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
1012020
Maskclip: Masked self-distillation advances contrastive language-image pretraining
X Dong, J Bao, Y Zheng, T Zhang, D Chen, H Yang, M Zeng, W Zhang, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
902023
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition
P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, S Ding, ...
arXiv preprint arXiv:2309.15112, 2023
882023
Robust superpixel-guided attentional adversarial attack
X Dong, J Han, D Chen, J Liu, H Bian, Z Ma, H Li, X Wang, W Zhang, N Yu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
652020
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model
X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ...
arXiv preprint arXiv:2401.16420, 2024
642024
GreedyFool: Distortion-Aware Sparse Adversarial Attack
X Dong, D Chen, J Bao, C Qin, L Yuan, W Zhang, N Yu, D Chen
Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 2020
622020
Bootstrapped Masked Autoencoders for Vision BERT Pretraining
X Dong, J Bao, T Zhang, D Chen, W Zhang, L Yuan, D Chen, F Wen, N Yu
ECCV 2022, 2022
592022
Shape-invariant 3D adversarial point clouds
Q Huang, X Dong, D Chen, H Zhou, W Zhang, N Yu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
552022
Self-robust 3d point recognition via gather-vector guidance
X Dong, D Chen, H Zhou, G Hua, W Zhang, N Yu
2020 IEEE/CVF conference on computer vision and pattern recognition (cvpr …, 2020
522020
Internlm2 technical report
Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ...
arXiv preprint arXiv:2403.17297, 2024
412024
Local geometric distortions resilient watermarking scheme based on symmetry
Z Ma, W Zhang, H Fang, X Dong, L Geng, N Yu
IEEE Transactions on Circuits and Systems for Video Technology 31 (12), 4826 …, 2021
412021
Diversity-aware meta visual prompting
Q Huang, X Dong, D Chen, W Zhang, F Wang, G Hua, N Yu
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
382023
Once a man: Towards multi-target attack via learning multi-target adversarial network once
J Han, X Dong, R Zhang, D Chen, W Zhang, N Yu, P Luo, X Wang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019
372019
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites
Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ...
arXiv preprint arXiv:2404.16821, 2024
352024
系统目前无法执行此操作,请稍后再试。
文章 1–20