Mm-sap: A comprehensive benchmark for assessing self-awareness of multimodal large language models in perception Y Wang, Y Liao, H Liu, H Liu, Y Wang, Y Wang arXiv preprint arXiv:2401.07529, 2024 | 16 | 2024 |
LibriSQA: Pioneering Free-form and Open-ended Spoken Question Answering with a Novel Dataset and Framework Z Zhao, Y Jiang, H Liu, Y Wang, Y Wang arXiv preprint arXiv:2308.10390, 2023 | 3 | 2023 |
Towards an End-to-End Framework for Invasive Brain Signal Decoding with Large Language Models S Feng, H Liu, Y Wang, Y Wang INTERSPEECH 2024, 0 | 2* | |
MAV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset Z Chen, H Liu, W Yu, G Sun, H Liu, J Wu, C Zhang, Y Wang, Y Wang arXiv preprint arXiv:2403.14168, 2024 | 1 | 2024 |
Decoding Linguistic Representations of Human Brain Y Wang, H Liu, Y Wang, C Xuan, Y Hou, S Feng, H Liu, Y Liao, Y Wang arXiv preprint arXiv:2407.20622, 2024 | | 2024 |
LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language Models Z Zhao, Y Jiang, H Liu, Y Wang, Y Wang IEEE Transactions on Artificial Intelligence, 2024 | | 2024 |
Post-decoder Biasing for End-to-End Speech Recognition of Multi-turn Medical Interview H Liu, Y Wang, Y Wang Proceedings of the 2024 Joint International Conference on Computational …, 2024 | | 2024 |