Shuming Ma 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	4231	4050
h 指数	36	35
i10 指数	63	60

1400

700

350

1050

2017201820192020202120222023202423 154 220 294 417 565 1312 1237

开放获取的出版物数量

查看全部

17 篇文章

1 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Furu WeiPartner Research Manager, Microsoft Research在 microsoft.com 的电子邮件经过验证
Xu SunAssociate Professor, Peking University在 pku.edu.cn 的电子邮件经过验证
houfeng wangPeking University在 pku.edu.cn 的电子邮件经过验证
Junyang LinQwen Team, Alibaba Group & Peking University在 alibaba-inc.com 的电子邮件经过验证
Lei CuiMicrosoft Research Asia在 microsoft.com 的电子邮件经过验证
Tianyu LiuAlibaba在 pku.edu.cn 的电子邮件经过验证
Jingjing XuShanghai AI Lab在 pku.edu.cn 的电子邮件经过验证
Wenjie LiThe Hong Kong Polytechnic University在 comp.polyu.edu.hk 的电子邮件经过验证
Sujian LIPeking Univ.在 pku.edu.cn 的电子邮件经过验证
Yizhong WangUniversity of Washington在 cs.washington.edu 的电子邮件经过验证

关注

Shuming Ma

Microsoft Research Asia

在 microsoft.com 的电子邮件经过验证 - 首页

Natural language processing deep learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
SGM: sequence generation model for multi-label classification P Yang, X Sun, W Li, S Ma, W Wu, H Wang arXiv preprint arXiv:1806.04822, 2018	433	2018
Language is not all you need: Aligning perception with language models S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ... Advances in Neural Information Processing Systems 36, 2024	309	2024
Kosmos-2: Grounding multimodal large language models to the world Z Peng, W Wang, L Dong, Y Hao, S Huang, S Ma, F Wei arXiv preprint arXiv:2306.14824, 2023	304	2023
Why can gpt learn in-context? language models implicitly perform gradient descent as meta-optimizers D Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui, F Wei arXiv preprint arXiv:2212.10559, 2022	240	2022
Global encoding for abstractive summarization J Lin, X Sun, S Ma, Q Su arXiv preprint arXiv:1805.03989, 2018	187	2018
meprop: Sparsified back propagation for accelerated deep learning with reduced overfitting X Sun, X Ren, S Ma, H Wang International Conference on Machine Learning, 3299-3308, 2017	181	2017
Retentive network: A successor to transformer for large language models Y Sun, L Dong, S Huang, S Ma, Y Xia, J Xue, J Wang, F Wei arXiv preprint arXiv:2307.08621, 2023	131	2023
Deepnet: Scaling transformers to 1,000 layers H Wang, S Ma, L Dong, S Huang, D Zhang, F Wei IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024	124	2024
XLM-E: Cross-lingual language model pre-training via ELECTRA Z Chi, S Huang, L Dong, S Ma, B Zheng, S Singhal, P Bajaj, X Song, ... arXiv preprint arXiv:2106.16138, 2021	107	2021
A simple and effective unified encoder for document-level machine translation S Ma, D Zhang, M Zhou Proceedings of the 58th annual meeting of the association for computational …, 2020	94	2020
Language models are general-purpose interfaces Y Hao, H Song, L Dong, S Huang, Z Chi, W Wang, S Ma, F Wei arXiv preprint arXiv:2206.06336, 2022	87	2022
A length-extrapolatable transformer Y Sun, L Dong, B Patra, S Ma, S Huang, A Benhaim, V Chaudhary, ... arXiv preprint arXiv:2212.10554, 2022	84	2022
Improving semantic relevance for sequence-to-sequence learning of chinese social media text summarization S Ma, X Sun, J Xu, H Wang, W Li, Q Su arXiv preprint arXiv:1706.02459, 2017	80	2017
Longnet: Scaling transformers to 1,000,000,000 tokens J Ding, S Ma, L Dong, X Zhang, S Huang, W Wang, N Zheng, F Wei arXiv preprint arXiv:2307.02486, 2023	78	2023
Query and output: Generating words by querying distributed word representations for paraphrase generation S Ma, X Sun, W Li, S Li, W Li, X Ren arXiv preprint arXiv:1803.01465, 2018	77	2018
Alternating language modeling for cross-lingual pre-training J Yang, S Ma, D Zhang, S Wu, Z Li, M Zhou Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 9386-9393, 2020	76	2020
Bag-of-words as target for neural machine translation S Ma, X Sun, Y Wang, J Lin arXiv preprint arXiv:1805.04871, 2018	73	2018
Semantic-unit-based dilated convolution for multi-label text classification J Lin, Q Su, P Yang, S Ma, X Sun arXiv preprint arXiv:1808.08561, 2018	70	2018
mT6: Multilingual pretrained text-to-text transformer with translation pairs Z Chi, L Dong, S Ma, SHXL Mao, H Huang, F Wei arXiv preprint arXiv:2104.08692, 2021	68	2021
A deep reinforced sequence-to-set model for multi-label classification P Yang, F Luo, S Ma, J Lin, X Sun Proceedings of the 57th Annual Meeting of the Association for Computational …, 2019	64	2019

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用