Dota 2 with large scale deep reinforcement learning C Berner, G Brockman, B Chan, V Cheung, P Dębiak, C Dennison, ... arXiv preprint arXiv:1912.06680, 2019 | 1591 | 2019 |
Opt: Open pre-trained transformer language models S Zhang, S Roller, N Goyal, M Artetxe, M Chen, S Chen, C Dewan, ... arXiv preprint arXiv:2205.01068, 2022 | 1367* | 2022 |
Lima: Less is more for alignment C Zhou, P Liu, P Xu, S Iyer, J Sun, Y Mao, X Ma, A Efrat, P Yu, L Yu, ... arXiv preprint arXiv:2305.11206, 2023 | 141 | 2023 |
xformers: A modular and hackable transformer modelling library B Lefaudeux, F Massa, D Liskovich, W Xiong, V Caggiano, S Naren, M Xu, ... | 40 | 2021 |
Openai five, 2018 J Pachocki, G Brockman, J Raiman, S Zhang, H Pondé, J Tang, F Wolski, ... URL https://blog. openai. com/openai-five, 2018 | 27* | 2018 |
Scaling laws for generative mixed-modal language models A Aghajanyan, L Yu, A Conneau, WN Hsu, K Hambardzumyan, S Zhang, ... arXiv preprint arXiv:2301.03728, 2023 | 22 | 2023 |
Scaling autoregressive multi-modal models: Pretraining and instruction tuning L Yu, B Shi, R Pasunuru, B Muller, O Golovneva, T Wang, A Babu, B Tang, ... arXiv preprint arXiv:2309.02591, 2023 | 17 | 2023 |
Long-term planning and situational awareness in openai five J Raiman, S Zhang, F Wolski arXiv preprint arXiv:1912.06721, 2019 | 10 | 2019 |
A theory on adam instability in large-scale machine learning I Molybog, P Albert, M Chen, Z DeVito, D Esiobu, N Goyal, PS Koura, ... arXiv preprint arXiv:2304.09871, 2023 | 7 | 2023 |
Neural network surgery with sets J Raiman, S Zhang, C Dennison arXiv preprint arXiv:1912.06719, 2019 | 6 | 2019 |
Hirsch index and a co-authorship network S Zhang | 1 | 2011 |
Children’s E-Learning Interactions and Perceived Outcomes with Educational Key Opinion Leaders in China S Zhang, J Shen, J Yan Communications of the Association for Information Systems 53 (1), 20, 2023 | | 2023 |
The PlaceIQ Analytic Platform: Location Oriented Approaches to Mobile Audiences JM Huerta, J Lenaghan, S Milton, K Brackney, A Kapila, R Shraga, ... Proceedings of the Eighth International Workshop on Data Mining for Online …, 2014 | | 2014 |
Scale-Free Networks and Proximity Measures S Zhang | | 2011 |
Decomposition of Mathematical Models of Quantum Information Transference Using a Simulated Annealing Technique S Zhang | | 2010 |