Mengdi Wang

Cited by

	All	Since 2019
Citations	6176	5790
h-index	44	43
i10-index	91	86

1600

800

400

1200

201520162017201820192020202120222023202422 45 112 154 263 498 916 1108 1438 1563

Public access

View all

47 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Lin F. Yang (杨林)Assistant Professor, Department of Electrical and Computer Engineering @ UCLAVerified email at ee.ucla.edu
Alec KoppelAI Research Lead, JP Morgan AI ResearchVerified email at jpmchase.com
Csaba SzepesvariDeepMind & University of AlbertaVerified email at cs.ualberta.ca
Yinyu YeK.T. Li Professor of Engineering, Stanford UniversityVerified email at stanford.edu
Tuo ZhaoAssociate Professor, Georgia TechVerified email at gatech.edu
Dimitri BertsekasArizona State University - Massachusetts Institute of TechnologyVerified email at mit.edu
Ethan X. FangAssociate Professor at Duke UniversityVerified email at duke.edu
Aaron SidfordStanford UniversityVerified email at stanford.edu
Botao HaoOpenAIVerified email at openai.com
Anru ZhangDuke UniversityVerified email at duke.edu
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC BerkeleyVerified email at cs.berkeley.edu
Zhaoran WangAssociate Professor at Northwestern UniversityVerified email at northwestern.edu
Yu-Xiang WangAssociate Professor @ UC San DiegoVerified email at ucsd.edu
Prateek MittalProfessor, Princeton UniversityVerified email at princeton.edu
Tong ZhangUIUCVerified email at tongzhang-ml.org
Saeed GhadimiUniversity of WaterlooVerified email at uwaterloo.ca
Tor LattimoreDeepMindVerified email at google.com
Andrzej RuszczyńskiBoard of Governors Professor of Rutgers UniversityVerified email at business.rutgers.edu
Zheng Tracy KeHarvard UniversityVerified email at fas.harvard.edu
Lihong Li (李力鸿)AmazonVerified email at amazon.com

Mengdi Wang

Center for Statistics & Machine Learning, ECE, Princeton University

Verified email at princeton.edu - Homepage

reinforcement learning optimization machine learning data science control


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Sample-optimal parametric q-learning using linearly additive features L Yang, M Wang International conference on machine learning, 6995-7004, 2019	367	2019
Model-based reinforcement learning with value-targeted regression A Ayoub, Z Jia, C Szepesvari, M Wang, L Yang International Conference on Machine Learning, 463-474, 2020	329	2020
Reinforcement Learning in Feature Space: Matrix Bandit, Kernels, and Regret Bound LF Yang, M Wang International Conference on Machine Learning, 2020, 2019	325	2019
Stochastic compositional gradient descent: algorithms for minimizing compositions of expected-value functions M Wang, EX Fang, H Liu Mathematical Programming 161, 419-449, 2017	279	2017
Approximation methods for bilevel programming S Ghadimi, M Wang arXiv preprint arXiv:1802.02246, 2018	264	2018
Near-optimal time and sample complexities for solving Markov decision processes with a generative model A Sidford, M Wang, X Wu, L Yang, Y Ye Advances in Neural Information Processing Systems 31, 2018	263	2018
Minimax-optimal off-policy evaluation with linear function approximation Y Duan, Z Jia, M Wang International Conference on Machine Learning, 2701-2709, 2020	165	2020
Accelerating stochastic composition optimization M Wang, J Liu, EX Fang Journal of Machine Learning Research, 2017, 2016	159	2016
Variational policy gradient method for reinforcement learning with general utilities J Zhang, A Koppel, AS Bedi, C Szepesvari, M Wang Advances in Neural Information Processing Systems 2020, 2020	145	2020
Variance reduced value iteration and faster algorithms for solving markov decision processes A Sidford, M Wang, X Wu, Y Ye. Proceedings of the Twenty-Ninth Annual ACM-SIAM Symposium on Discrete …, 2017	141*	2017
Visual adversarial examples jailbreak aligned large language models X Qi, K Huang, A Panda, P Henderson, M Wang, P Mittal Proceedings of the AAAI Conference on Artificial Intelligence 38 (19), 21527 …, 2024	137*	2024
A single timescale stochastic approximation method for nested stochastic optimization S Ghadimi, A Ruszczynski, M Wang SIAM Journal on Optimization 30 (1), 960-979, 2020	130	2020
Stochastic first-order methods with random constraint projection M Wang, DP Bertsekas SIAM Journal on Optimization 26 (1), 681-717, 2016	120*	2016
Towards compact cnns via collaborative compression Y Li, S Lin, J Liu, Q Ye, M Wang, F Chao, F Yang, J Ma, Q Tian, R Ji Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021	100	2021
On function approximation in reinforcement learning: Optimism in the face of large state spaces Z Yang, C Jin, Z Wang, M Wang, MI Jordan arXiv preprint arXiv:2011.04622, 2020	100*	2020
Finite-sum composition optimization via variance reduced gradient descent X Lian, M Wang, J Liu Artificial Intelligence and Statistics. 2017., 2016	97	2016
Score approximation, estimation and distribution recovery of diffusion models on low-dimensional data M Chen, K Huang, T Zhao, M Wang International Conference on Machine Learning, 2024	94	2024
Solving discounted stochastic two-player games with near-optimal time and sample complexity A Sidford, M Wang, L Yang, Y Ye International Conference on Artificial Intelligence and Statistics, 2992-3002, 2020	86	2020
Randomized linear programming solves the markov decision problem in nearly linear (sometimes sublinear) time M Wang Mathematics of Operations Research 45 (2), 517-546, 2020	83*	2020
A distributed tracking algorithm for reconstruction of graph signals X Wang, M Wang, Y Gu IEEE Journal of Selected Topics in Signal Processing 9 (4), 728-740, 2015	79	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors