An Attention-Based Approach to Hierarchical Multi-Label Music Instrument Classification Z Zhong, M Hirano, K Shimada, K Tateishi, S Takahashi, Y Mitsufuji ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 8 | 2023 |
Diffusion-based speech enhancement with joint generative and predictive decoders H Shi, K Shimada, M Hirano, T Shibuya, Y Koyama, Z Zhong, S Takahashi, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 4 | 2024 |
Extending Audio Masked Autoencoders Toward Audio Restoration Z Zhong, H Shi, M Hirano, K Shimada, K Tateishi, T Shibuya, S Takahashi, ... WASPAA 2023-2023 IEEE Workshop on Applications of Signal Processing to Audio …, 2023 | 4 | 2023 |
Assessment of a beamforming implementation developed for surface sound source separation Z Zhong, M Shakeel, K Itoyama, K Nishida, K Nakadai 2021 IEEE/SICE International Symposium on System Integration (SII), 369-374, 2021 | 2 | 2021 |
Design and assessment of a scan-and-sum beamformer for surface sound source separation Z Zhong, K Itoyama, K Nishida, K Nakadai 2020 IEEE/SICE International Symposium on System Integration (SII), 808-813, 2020 | 2 | 2020 |
SoundCTM: Uniting Score-based and Consistency Models for Text-to-Sound Generation K Saito, D Kim, T Shibuya, CH Lai, Z Zhong, Y Takida, Y Mitsufuji arXiv preprint arXiv:2405.18503, 2024 | 1 | 2024 |
Visual Echoes: A Simple Unified Transformer for Audio-Visual Generation S Yang, Z Zhong, M Zhao, S Takahashi, M Ishii, T Shibuya, Y Mitsufuji arXiv preprint arXiv:2405.14598, 2024 | 1 | 2024 |
SpecMaskGIT: Masked Generative Modeling of Audio Spectrograms for Efficient Audio Synthesis and Beyond M Comunita, Z Zhong, A Takahashi, S Yang, M Zhao, K Saito, YI Shibuya, ... arXiv preprint arXiv:2406.17672, 2024 | | 2024 |
On the Language Encoder of Contrastive Cross-modal Models M Zhao, J Ono, Z Zhong, CH Lai, Y Takida, N Murata, WH Liao, T Shibuya, ... arXiv preprint arXiv:2310.13267, 2023 | | 2023 |
Source separation device, source separation method and program KN Kazuhiro Nakadai, Zhi Zhong, Katsutoshi Itoyama JP Patent 特許第7316614号, 2023 | | 2023 |