Exploiting transformation invariance and equivariance for self-supervised sound localisation J Liu, C Ju, W Xie, Y Zhang Proceedings of the 30th ACM International Conference on Multimedia, 3742-3753, 2022 | 28 | 2022 |
Annotation-free Audio-Visual Segmentation J Liu, Y Wang, C Ju, C Ma, Y Zhang, W Xie Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, 2024 | 27 | 2024 |
Diffusionseg: Adapting diffusion towards unsupervised object discovery C Ma, Y Yang, C Ju, F Zhang, J Liu, Y Wang, Y Zhang, Y Wang arXiv preprint arXiv:2303.09813, 2023 | 24 | 2023 |
Distilling vision-language pre-training to collaborate with weakly-supervised temporal action localization C Ju, K Zheng, J Liu, P Zhao, Y Zhang, J Chang, Q Tian, Y Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 19 | 2023 |
Constraint and union for partially-supervised temporal sentence grounding C Ju, H Wang, J Liu, C Ma, Y Zhang, P Zhao, J Chang, Q Tian arXiv preprint arXiv:2302.09850, 2023 | 13 | 2023 |
Audio-aware query-enhanced transformer for audio-visual segmentation J Liu, C Ju, C Ma, Y Wang, Y Wang, Y Zhang arXiv preprint arXiv:2307.13236, 2023 | 11 | 2023 |
A 3D Mesh-based Lifting-and-Projection Network for Human Pose Transfer J Liu, Y Zhao, S Chen, Y Zhang IEEE Transactions on Multimedia 24, 4314-4327, 2022 | 7 | 2022 |
Audio-Visual Segmentation via Unlabeled Frame Exploitation J Liu, Y Liu, F Zhang, C Ju, Y Zhang, Y Wang Computer Vision and Pattern Recognition (CVPR 2024), 2024 | 2 | 2024 |