shengpeng ji 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	127	127
h 指数	5	5
i10 指数	3	3

0

120

60

30

90

2023202411 116

开放获取的出版物数量

1 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Zhou ZhaoZhejiang University在 zju.edu.cn 的电子邮件经过验证
Ziyue JiangZhejiang University在 zju.edu.cn 的电子邮件经过验证
Qian Chen (陈谦)Alibaba Group在 alibaba-inc.com 的电子邮件经过验证
Wen WangAlibaba DAMO Academy在 alibaba-inc.com 的电子邮件经过验证
Long ZhouMicrosoft Research Asia在 microsoft.com 的电子邮件经过验证
Shujie Liu (刘树杰）Microsoft Research Asia在 microsoft.com 的电子邮件经过验证

shengpeng ji

shengpeng ji

Zhejiang university

在 zju.edu.cn 的电子邮件经过验证 - 首页

Speech NLP Large Language Models


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Mega-tts: Zero-shot text-to-speech at scale with intrinsic inductive bias Z Jiang, Y Ren, Z Ye, J Liu, C Zhang, Q Yang, S Ji, R Huang, C Wang, ... arXiv preprint arXiv:2306.03509, 2023	49	2023
Mega-tts 2: Zero-shot text-to-speech with arbitrary length speech prompts Z Jiang, J Liu, Y Ren, J He, Z Ye, S Ji, Q Yang, C Zhang, P Wei, C Wang, ... ICLR 2024, 2023	34*	2023
Textrolspeech: A text style control speech corpus with codec language text-to-speech models S Ji, J Zuo, M Fang, Z Jiang, F Chen, X Duan, B Huai, Z Zhao ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024	19	2024
Language-codec: Reducing the gaps between discrete codec representation and speech language models S Ji, M Fang, Z Jiang, R Huang, J Zuo, S Wang, Z Zhao arXiv preprint arXiv:2402.12208, 2024	7	2024
MobileSpeech: A Fast and High-Fidelity Framework for Mobile Zero-Shot Text-to-Speech S Ji, Z Jiang, H Wang, J Zuo, Z Zhao ACL 2024 Main, 2024	5	2024
ControlSpeech: Towards Simultaneous Zero-shot Speaker Cloning and Zero-shot Language Style Control With Decoupled Codec S Ji, J Zuo, M Fang, S Zheng, Q Chen, W Wang, Z Jiang, H Huang, ... arXiv preprint arXiv:2406.01205, 2024	3	2024
FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs Alibaba technical report arXiv preprint arXiv:2407.04051, 2024	2	2024
ACE: A Generative Cross-Modal Retrieval Framework with Coarse-To-Fine Semantic Modeling M Fang, S Ji, J Zuo, H Huang, Y Xia, J Zhu, X Cheng, X Yang, W Liu, ... arXiv preprint arXiv:2406.17507, 2024	2	2024
Unlocking the Potential of Multimodal Unified Discrete Representation through Training-Free Codebook Optimization and Hierarchical Alignment H Huang, Y Xia, S Ji, S Wang, H Wang, J Zhu, Z Dong, Z Zhao arXiv preprint arXiv:2403.05168, 2024	2	2024
Generating Neural Networks for Diverse Networking Classification Tasks via Hardware-Aware Neural Architecture Search G Xie, Q Li, Z Shi, H Fang, S Ji, Y Jiang, Z Yuan, L Ma, M Xu IEEE Transactions on Computers, 2023	2	2023
WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling S Ji, Z Jiang, X Cheng, Y Chen, M Fang, J Zuo, Q Yang, R Li, Z Zhang, ... arXiv preprint arXiv:2408.16532, 2024	1	2024
SyncTalklip: Highly Synchronized Lip-Readable Speaker Generation with Multi-Task Learning X Yang, X Cheng, D Fu, M Fang, J Zuo, S Ji, T Jin, Z Zhao ACM Multimedia 2024, 2024	1	2024

系统目前无法执行此操作，请稍后再试。

文章 1–12