Shuming Ma

Citirano

	Sve	Od 2019.
Citati	3990	3811
H-indeks	34	33
i10-indeks	61	58

1500

750

375

1125

2017201820192020202120222023202424 153 220 294 416 568 1494 813

Javni pristup

Prikaži sve

16 članaka

1 članak

dostupno

nije dostupno

Na temelju uvjeta financiranja

Suautori

Furu WeiPartner Research Manager, Microsoft ResearchPotvrđena adresa e-pošte na microsoft.com
Xu SunAssociate Professor, Peking UniversityPotvrđena adresa e-pošte na pku.edu.cn
houfeng wangPeking UniversityPotvrđena adresa e-pošte na pku.edu.cn
Junyang LinQwen Team, Alibaba Group & Peking UniversityPotvrđena adresa e-pošte na alibaba-inc.com
Lei CuiMicrosoft Research AsiaPotvrđena adresa e-pošte na microsoft.com
Tianyu LiuAlibabaPotvrđena adresa e-pošte na pku.edu.cn
Jingjing XuShanghai AI LabPotvrđena adresa e-pošte na pku.edu.cn
Wenjie LiThe Hong Kong Polytechnic UniversityPotvrđena adresa e-pošte na comp.polyu.edu.hk
Sujian LIPeking Univ.Potvrđena adresa e-pošte na pku.edu.cn
Yizhong WangUniversity of WashingtonPotvrđena adresa e-pošte na cs.washington.edu

Prati

Shuming Ma

Microsoft Research Asia

Potvrđena adresa e-pošte na microsoft.com - Početna stranica

Natural language processing deep learning


Naslov Poredaj po navodima Poredaj po godini Poredaj po naslovu	Citirano Citirano	Godina
SGM: sequence generation model for multi-label classification P Yang, X Sun, W Li, S Ma, W Wu, H Wang arXiv preprint arXiv:1806.04822, 2018	417	2018
Language is not all you need: Aligning perception with language models S Huang, L Dong, W Wang, Y Hao, S Singhal, S Ma, T Lv, L Cui, ... Advances in Neural Information Processing Systems 36, 2024	259	2024
Kosmos-2: Grounding multimodal large language models to the world Z Peng, W Wang, L Dong, Y Hao, S Huang, S Ma, F Wei arXiv preprint arXiv:2306.14824, 2023	224	2023
Graph of thoughts: Solving elaborate problems with large language models M Besta, N Blach, A Kubicek, R Gerstenberger, M Podstawski, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (16), 17682 …, 2024	218	2024
Why can gpt learn in-context? language models implicitly perform gradient descent as meta-optimizers D Dai, Y Sun, L Dong, Y Hao, S Ma, Z Sui, F Wei arXiv preprint arXiv:2212.10559, 2022	202	2022
Global encoding for abstractive summarization J Lin, X Sun, S Ma, Q Su arXiv preprint arXiv:1805.03989, 2018	184	2018
meprop: Sparsified back propagation for accelerated deep learning with reduced overfitting X Sun, X Ren, S Ma, H Wang International Conference on Machine Learning, 3299-3308, 2017	176	2017
Deepnet: Scaling transformers to 1,000 layers H Wang, S Ma, L Dong, S Huang, D Zhang, F Wei IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024	108	2024
Xlm-e: Cross-lingual language model pre-training via electra Z Chi, S Huang, L Dong, S Ma, B Zheng, S Singhal, P Bajaj, X Song, ... arXiv preprint arXiv:2106.16138, 2021	104	2021
A simple and effective unified encoder for document-level machine translation S Ma, D Zhang, M Zhou Proceedings of the 58th annual meeting of the association for computational …, 2020	91	2020
Language models are general-purpose interfaces Y Hao, H Song, L Dong, S Huang, Z Chi, W Wang, S Ma, F Wei arXiv preprint arXiv:2206.06336, 2022	82	2022
Improving semantic relevance for sequence-to-sequence learning of chinese social media text summarization S Ma, X Sun, J Xu, H Wang, W Li, Q Su arXiv preprint arXiv:1706.02459, 2017	78	2017
Retentive network: A successor to transformer for large language models Y Sun, L Dong, S Huang, S Ma, Y Xia, J Xue, J Wang, F Wei arXiv preprint arXiv:2307.08621, 2023	76	2023
Query and output: Generating words by querying distributed word representations for paraphrase generation S Ma, X Sun, W Li, S Li, W Li, X Ren arXiv preprint arXiv:1803.01465, 2018	75	2018
Bag-of-words as target for neural machine translation S Ma, X Sun, Y Wang, J Lin arXiv preprint arXiv:1805.04871, 2018	73	2018
A length-extrapolatable transformer Y Sun, L Dong, B Patra, S Ma, S Huang, A Benhaim, V Chaudhary, ... arXiv preprint arXiv:2212.10554, 2022	68	2022
Alternating language modeling for cross-lingual pre-training J Yang, S Ma, D Zhang, S Wu, Z Li, M Zhou Proceedings of the AAAI Conference on Artificial Intelligence 34 (05), 9386-9393, 2020	68	2020
Semantic-unit-based dilated convolution for multi-label text classification J Lin, Q Su, P Yang, S Ma, X Sun arXiv preprint arXiv:1808.08561, 2018	65	2018
mT6: Multilingual pretrained text-to-text transformer with translation pairs Z Chi, L Dong, S Ma, SHXL Mao, H Huang, F Wei arXiv preprint arXiv:2104.08692, 2021	64	2021
Deltalm: Encoder-decoder pre-training for language generation and translation by augmenting pretrained multilingual encoders S Ma, L Dong, S Huang, D Zhang, A Muzio, S Singhal, HH Awadalla, ... arXiv preprint arXiv:2106.13736, 2021	62	2021

Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.

Članci 1–20

Godišnji broj citata

Dvostruki navodi

Spojeni navodi

Dodavanje suautoraSuautori

Prati

Citirano

Suautori