Stephen Zhao
Stephen Zhao
Computer Science Student, University of Toronto
Verified email at
Cited by
Cited by
Maximum entropy gain exploration for long horizon multi-goal reinforcement learning
S Pitis, H Chan, S Zhao, B Stadie, J Ba
International Conference on Machine Learning, 7750-7761, 2020
Joint energy-based models for semi-supervised classification
S Zhao, JH Jacobsen, W Grathwohl
ICML 2020 Workshop on Uncertainty and Robustness in Deep Learning 1, 2020
Proximal learning with opponent-learning awareness
S Zhao, C Lu, RB Grosse, J Foerster
Advances in Neural Information Processing Systems 35, 26324-26336, 2022
Probabilistic inference in language models via twisted sequential monte carlo
S Zhao, R Brekelmans, A Makhzani, R Grosse
arXiv preprint arXiv:2404.17546, 2024
Reproducing" Are Sixteen Heads Really Better than One?"
S Zhao, S Yuan
Layer-Wise Contrastive Unsupervised Representation Learning
S Zhao
The system can't perform the operation now. Try again later.
Articles 1–6