Mmcosine: Multi-modal cosine loss towards balanced audio-visual fine-grained learning R Xu, R Feng, SX Zhang, D Hu ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 21 | 2023 |
Enhancing multimodal cooperation via sample-level modality valuation Y Wei, R Feng, Z Wang, D Hu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 3 | 2024 |
Revisiting pre-training in audio-visual learning R Feng, W Xia, D Hu arXiv preprint arXiv:2302.03533, 2023 | 2 | 2023 |
Play to the Score: Stage-Guided Dynamic Multi-Sensory Fusion for Robotic Manipulation R Feng, D Hu, W Ma, X Li arXiv preprint arXiv:2408.01366, 2024 | | 2024 |
Diagnosing and Re-learning for Balanced Multimodal Learning Y Wei, S Li, R Feng, D Hu arXiv preprint arXiv:2407.09705, 2024 | | 2024 |
Enhancing Multi-modal Cooperation via Fine-grained Modality Valuation Y Wei, R Feng, Z Wang, D Hu arXiv preprint arXiv:2309.06255, 2023 | | 2023 |
SUPPLEMENTARY MATERIAL FOR MMCOSINE: MULTI-MODAL COSINE LOSS TOWARDS BALANCED AUDIO-VISUAL FINE-GRAINED LEARNING R Xu, R Feng, SX Zhang, D Hu | | |
Towards Better Egocentric Action Understanding in a Multi-Input Multi-Output View W Hou, R Feng, Y Xu, Y Tian, D Hu | | |