Prati
Vishrav Chaudhary
Vishrav Chaudhary
Microsoft AI
Potvrđena adresa e-pošte na microsoft.com - Početna stranica
Naslov
Citirano
Citirano
Godina
Unsupervised cross-lingual representation learning at scale
A Conneau, K Khandelwal, N Goyal, V Chaudhary, G Wenzek, F Guzmán, ...
arXiv preprint arXiv:1911.02116, 2019
57512019
Holistic evaluation of language models
P Liang, R Bommasani, T Lee, D Tsipras, D Soylu, M Yasunaga, Y Zhang, ...
arXiv preprint arXiv:2211.09110, 2022
7952022
Beyond english-centric multilingual machine translation
A Fan, S Bhosale, H Schwenk, Z Ma, A El-Kishky, S Goyal, M Baines, ...
Journal of Machine Learning Research 22 (107), 1-48, 2021
6882021
CCNet: Extracting high quality monolingual datasets from web crawl data
G Wenzek, MA Lachaux, A Conneau, V Chaudhary, F Guzmán, A Joulin, ...
arXiv preprint arXiv:1911.00359, 2019
5422019
Language is not all you need: Aligning perception with language models
S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ...
Advances in Neural Information Processing Systems 36, 2024
356*2024
The flores-101 evaluation benchmark for low-resource and multilingual machine translation
N Goyal, C Gao, V Chaudhary, PJ Chen, G Wenzek, D Ju, S Krishnan, ...
Transactions of the Association for Computational Linguistics 10, 522-538, 2022
3452022
Multilingual translation with extensible multilingual pretraining and finetuning
Y Tang, C Tran, X Li, PJ Chen, N Goyal, V Chaudhary, J Gu, A Fan
arXiv preprint arXiv:2008.00401, 2020
3402020
Wikimatrix: Mining 135m parallel sentences in 1620 language pairs from wikipedia
H Schwenk, V Chaudhary, S Sun, H Gong, F Guzmán
arXiv preprint arXiv:1907.05791, 2019
3162019
The flores evaluation datasets for low-resource machine translation: Nepali-english and sinhala-english
F Guzmán, PJ Chen, M Ott, J Pino, G Lample, P Koehn, V Chaudhary, ...
arXiv preprint arXiv:1902.01382, 2019
2932019
Findings of the 2021 conference on machine translation (WMT21)
A Farhad, A Arkady, B Magdalena, B Ondřej, C Rajen, C Vishrav, ...
Proceedings of the Sixth Conference on Machine Translation, 1-88, 2021
1662021
CCAligned: A massive collection of cross-lingual web-document pairs
A El-Kishky, V Chaudhary, F Guzmán, P Koehn
arXiv preprint arXiv:1911.06154, 2019
1592019
Self-training improves pre-training for natural language understanding
J Du, E Grave, B Gunel, V Chaudhary, O Celebi, M Auli, V Stoyanov, ...
arXiv preprint arXiv:2010.02194, 2020
1562020
Unsupervised quality estimation for neural machine translation
M Fomicheva, S Sun, L Yankovskaya, F Blain, F Guzmán, M Fishel, ...
Transactions of the Association for Computational Linguistics 8, 539-555, 2020
1392020
Findings of the WMT 2021 shared task on quality estimation
L Specia, F Blain, M Fomicheva, C Zerva, Z Li, V Chaudhary, AFT Martins
Proceedings of the Sixth Conference on Machine Translation, 684-725, 2021
1372021
Multilingual translation from denoising pre-training
Y Tang, C Tran, X Li, PJ Chen, N Goyal, V Chaudhary, J Gu, A Fan
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021 …, 2021
1262021
Phi-3 technical report: A highly capable language model locally on your phone
M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ...
arXiv preprint arXiv:2404.14219, 2024
1132024
A length-extrapolatable transformer
Y Sun, L Dong, B Patra, S Ma, S Huang, A Benhaim, V Chaudhary, ...
arXiv preprint arXiv:2212.10554, 2022
942022
Low-Resource Corpus Filtering using Multilingual Sentence Embeddings
V Chaudhary, Y Tang, F Guzmán, H Schwenk, P Koehn
Proceedings of the Fourth Conference on Machine Translation (Volume 3 …, 2019
812019
Findings of the WMT 2019 shared task on parallel corpus filtering for low-resource conditions
P Koehn, F Guzmán, V Chaudhary, J Pino
Proceedings of the Fourth Conference on Machine Translation (Volume 3 …, 2019
802019
Findings of the AmericasNLP 2021 shared task on open machine translation for indigenous languages of the Americas
M Mager, A Oncevay, A Ebrahimi, J Ortega, AR Gonzales, A Fan, ...
Proceedings of the First Workshop on Natural Language Processing for …, 2021
752021
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–20