关注
Marzieh Fadaee
Marzieh Fadaee
Senior Research Scientist, Cohere For AI
在 cohere.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Data Augmentation for Low-Resource Neural Machine Translation
M Fadaee, A Bisazza, C Monz
Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017
5742017
Back-translation sampling by targeting difficult words in neural machine translation
M Fadaee, C Monz
arXiv preprint arXiv:1808.09006, 2018
832018
mMARCO: A Multilingual Version of the MS MARCO Passage Ranking Dataset
L Henrique Bonifacio, V Jeronymo, H Queiroz Abonizio, I Campiotti, ...
arXiv preprint arXiv:2108.13897, 2021
772021
Inpars: Data augmentation for information retrieval using large language models
L Bonifacio, H Abonizio, M Fadaee, R Nogueira
arXiv preprint arXiv:2202.05144, 2022
742022
Inpars: Unsupervised dataset generation for information retrieval
L Bonifacio, H Abonizio, M Fadaee, R Nogueira
Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022
652022
Inpars-v2: Large language models as efficient dataset generators for information retrieval
V Jeronymo, L Bonifacio, H Abonizio, M Fadaee, R Lotufo, J Zavrel, ...
arXiv preprint arXiv:2301.01820, 2023
622023
When less is more: Investigating data pruning for pretraining llms at scale
M Marion, A Üstün, L Pozzobon, A Wang, M Fadaee, S Hooker
arXiv preprint arXiv:2309.04564, 2023
372023
Examining the tip of the iceberg: A data set for idiom translation
M Fadaee, A Bisazza, C Monz
arXiv preprint arXiv:1802.04681, 2018
372018
Aya model: An instruction finetuned open-access multilingual language model
A Üstün, V Aryabumi, ZX Yong, WY Ko, D D'souza, G Onilude, N Bhandari, ...
arXiv preprint arXiv:2402.07827, 2024
362024
No parameter left behind: How distillation and model size affect zero-shot retrieval
GM Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ...
arXiv preprint arXiv:2206.02873, 2022
282022
Aya dataset: An open-access collection for multilingual instruction tuning
S Singh, F Vargus, D Dsouza, BF Karlsson, A Mahendiran, WY Ko, ...
arXiv preprint arXiv:2402.06619, 2024
232024
In defense of cross-encoders for zero-shot retrieval
G Rosa, L Bonifacio, V Jeronymo, H Abonizio, M Fadaee, R Lotufo, ...
arXiv preprint arXiv:2212.06121, 2022
202022
Learning Topic-Sensitive Word Representations
M Fadaee, A Bisazza, C Monz
Proceedings of the 55th Annual Meeting of the Association for Computational …, 2017
202017
Back to basics: Revisiting reinforce style optimization for learning from human feedback in llms
A Ahmadian, C Cremer, M Gallé, M Fadaee, J Kreutzer, A Üstün, ...
arXiv preprint arXiv:2402.14740, 2024
192024
The unreasonable volatility of neural machine translation models
M Fadaee, C Monz
arXiv preprint arXiv:2005.12398, 2020
152020
Aya 23: Open weight releases to further multilingual progress
V Aryabumi, J Dang, D Talupuru, S Dash, D Cairuz, H Lin, B Venkitesh, ...
arXiv preprint arXiv:2405.15032, 2024
142024
Elo uncovered: Robustness and best practices in language model evaluation
M Boubdir, E Kim, B Ermis, S Hooker, M Fadaee
arXiv preprint arXiv:2311.17295, 2023
112023
Data augmentation for low-resource neural machine translation. arXiv 2017
M Fadaee, A Bisazza, C Monz
arXiv preprint arXiv:1705.00440, 0
9
Automatic WordNet Construction Using Markov Chain Monte Carlo
M Fadaee, H Ghader, H Faili, A Shakery
Polibits, 13-22, 2013
72013
A New Neural Search and Insights Platform for Navigating and Organizing AI Research
M Fadaee, O Gureenkova, F Rejon-Barrera, C Schnober, W Weerkamp, ...
arXiv preprint arXiv:2011.00061, 2020
62020
系统目前无法执行此操作,请稍后再试。
文章 1–20