Xiang Cheng

Cited by

	All	Since 2019
Citations	1308	1231
h-index	12	12
i10-index	12	12

300

150

225

2017201820192020202120222023202415 58 81 146 192 222 299 281

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Xiang Cheng

Massachusetts Institute of Technology

Verified email at berkeley.edu - Homepage

stochastic processes machine learning theory of deep learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Underdamped Langevin MCMC: A non-asymptotic analysis X Cheng, NS Chatterji, PL Bartlett, MI Jordan Conference on learning theory, 300-323, 2018	325	2018
Convergence of Langevin MCMC in KL-divergence X Cheng, P Bartlett Algorithmic Learning Theory, 186-211, 2018	205	2018
Sharp convergence rates for Langevin dynamics in the nonconvex setting X Cheng, NS Chatterji, Y Abbasi-Yadkori, PL Bartlett, MI Jordan arXiv preprint arXiv:1805.01648, 2018	184	2018
Is there an analog of Nesterov acceleration for gradient-based MCMC? YA Ma, NS Chatterji, X Cheng, N Flammarion, PL Bartlett, MI Jordan	157	2021
Asymptotic behavior of\ell_p-based laplacian regularization in semi-supervised learning A El Alaoui, X Cheng, A Ramdas, MJ Wainwright, MI Jordan Conference on Learning Theory, 879-906, 2016	109	2016
Transformers learn to implement preconditioned gradient descent for in-context learning K Ahn, X Cheng, H Daneshmand, S Sra Advances in Neural Information Processing Systems 36, 2024	86	2024
Optimal dimension dependence of the Metropolis-adjusted Langevin algorithm S Chewi, C Lu, K Ahn, X Cheng, T Le Gouic, P Rigollet Conference on Learning Theory, 1260-1300, 2021	66	2021
Stochastic Gradient and Langevin Processes X Cheng, D Yin, PL Bartlett, MI Jordan arXiv preprint arXiv:1907.03215, 2019	55*	2019
Exploiting optimization for local graph clustering K Fountoulakis, X Cheng, J Shun, F Roosta-Khorasani, MW Mahoney arXiv preprint arXiv:1602.01886, 2016	47*	2016
Restart sampling for improving generative processes Y Xu, M Deng, X Cheng, Y Tian, Z Liu, T Jaakkola Advances in Neural Information Processing Systems 36, 76806-76838, 2023	25	2023
Linear attention is (maybe) all you need (to understand transformer optimization) K Ahn, X Cheng, M Song, C Yun, A Jadbabaie, S Sra arXiv preprint arXiv:2310.01082, 2023	20	2023
Transformers implement functional gradient descent to learn non-linear functions in context X Cheng, Y Chen, S Sra arXiv preprint arXiv:2312.06528, 2023	12	2023
Efficient Sampling on Riemannian Manifolds via Langevin MCMC X Cheng, J Zhang, S Sra Advances in Neural Information Processing Systems, 2022	6	2022
Theory and algorithms for diffusion processes on Riemannian manifolds X Cheng, J Zhang, S Sra arXiv preprint arXiv:2204.13665, 2022	5	2022
The Interplay between Sampling and Optimization X Cheng University of California, Berkeley, 2020	4	2020
Fast conditional mixing of mcmc algorithms for non-log-concave distributions X Cheng, B Wang, J Zhang, Y Zhu Advances in Neural Information Processing Systems 36, 2024	2	2024
Riemannian Bilevel Optimization S Dutta, X Cheng, S Sra arXiv preprint arXiv:2405.15816, 2024		2024
Fast conditional mixing of MCMC algorithms for non-log-concave distributions B Wang, X Cheng, J Zhang, Y Zhu Proceedings of the 37th International Conference on Neural Information …, 2023		2023
FLAG n’FLARE: Fast Linearly-Coupled Adaptive Gradient Methods X Cheng, F Roosta, S Palombo, P Bartlett, M Mahoney International Conference on Artificial Intelligence and Statistics, 404-414, 2018		2018

The system can't perform the operation now. Try again later.

Articles 1–19

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by