Follow
Vishrav Chaudhary
Vishrav Chaudhary
Microsoft AI
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Unsupervised cross-lingual representation learning at scale
A Conneau
arXiv preprint arXiv:1911.02116, 2019
65712019
Holistic evaluation of language models
P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ...
arXiv preprint arXiv:2211.09110, 2022
11062022
Beyond english-centric multilingual machine translation
A Fan, S Bhosale, H Schwenk, Z Ma, A El-Kishky, S Goyal, M Baines, ...
Journal of Machine Learning Research 22 (107), 1-48, 2021
8152021
CCNet: Extracting high quality monolingual datasets from web crawl data
G Wenzek, MA Lachaux, A Conneau, V Chaudhary, F Guzmán, A Joulin, ...
arXiv preprint arXiv:1911.00359, 2019
6272019
Phi-3 technical report: A highly capable language model locally on your phone
M Abdin, J Aneja, H Awadalla, A Awadallah, AA Awan, N Bach, A Bahree, ...
arXiv preprint arXiv:2404.14219, 2024
5492024
Language is not all you need: Aligning perception with language models
S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ...
Advances in Neural Information Processing Systems 36, 72096-72109, 2023
483*2023
The flores-101 evaluation benchmark for low-resource and multilingual machine translation
N Goyal, C Gao, V Chaudhary, PJ Chen, G Wenzek, D Ju, S Krishnan, ...
Transactions of the Association for Computational Linguistics 10, 522-538, 2022
4232022
Multilingual translation with extensible multilingual pretraining and finetuning
Y Tang, C Tran, X Li, PJ Chen, N Goyal, V Chaudhary, J Gu, A Fan
arXiv preprint arXiv:2008.00401, 2020
3962020
Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia
H Schwenk, V Chaudhary, S Sun, H Gong, F Guzmán
arXiv preprint arXiv:1907.05791, 2019
3512019
The flores evaluation datasets for low-resource machine translation: Nepali-english and sinhala-english
F Guzmán, PJ Chen, M Ott, J Pino, G Lample, P Koehn, V Chaudhary, ...
arXiv preprint arXiv:1902.01382, 2019
3162019
Findings of the 2021 conference on machine translation (WMT21)
F Akhbardeh, A Arkhangorodsky, M Biesialska, O Bojar, R Chatterjee, ...
Proceedings of the sixth conference on machine translation, 1-88, 2021
1912021
CCAligned: A massive collection of cross-lingual web-document pairs
A El-Kishky, V Chaudhary, F Guzmán, P Koehn
arXiv preprint arXiv:1911.06154, 2019
1792019
Self-training improves pre-training for natural language understanding
J Du, E Grave, B Gunel, V Chaudhary, O Celebi, M Auli, V Stoyanov, ...
arXiv preprint arXiv:2010.02194, 2020
1692020
Unsupervised quality estimation for neural machine translation
M Fomicheva, S Sun, L Yankovskaya, F Blain, F Guzmán, M Fishel, ...
Transactions of the Association for Computational Linguistics 8, 539-555, 2020
1592020
Multilingual translation from denoising pre-training
Y Tang, C Tran, X Li, PJ Chen, N Goyal, V Chaudhary, J Gu, A Fan
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021
1502021
Findings of the WMT 2021 shared task on quality estimation
L Specia, F Blain, M Fomicheva, C Zerva, Z Li, V Chaudhary, AFT Martins
Proceedings of the Sixth Conference on Machine Translation, 684-725, 2021
1462021
A length-extrapolatable transformer
Y Sun, L Dong, B Patra, S Ma, S Huang, A Benhaim, V Chaudhary, ...
arXiv preprint arXiv:2212.10554, 2022
1352022
AmericasNLI: Evaluating zero-shot natural language understanding of pretrained multilingual models in truly low-resource languages
A Ebrahimi, M Mager, A Oncevay, V Chaudhary, L Chiruzzo, A Fan, ...
arXiv preprint arXiv:2104.08726, 2021
852021
Low-Resource Corpus Filtering using Multilingual Sentence Embeddings
V Chaudhary, Y Tang, F Guzmán, H Schwenk, P Koehn
Proceedings of the Fourth Conference on Machine Translation (Volume 3 …, 2019
852019
Findings of the WMT 2019 shared task on parallel corpus filtering for low-resource conditions
P Koehn, F Guzmán, V Chaudhary, J Pino
Proceedings of the Fourth Conference on Machine Translation (Volume 3 …, 2019
842019
The system can't perform the operation now. Try again later.
Articles 1–20