Shipra Agrawal

Cited by

	All	Since 2019
Citations	6521	4593
h-index	28	22
i10-index	40	31

1000

500

250

750

200620072008200920102011201220132014201520162017201820192020202120222023202432 24 39 52 61 68 66 97 144 185 261 337 442 598 811 915 955 991 321

Public access

View all

7 articles

1 article

available

not available

Based on funding mandates

Co-authors

Yinyu YeK.T. Li Professor of Engineering, Stanford UniversityVerified email at stanford.edu
nikhil r. devanurAmazonVerified email at nikhildevanur.com
Jayant HaritsaIndian Institute of ScienceVerified email at iisc.ac.in
Zizhuo WangThe Chinese University of Hong Kong, Shenzhen / Cardinal OperationsVerified email at cuhk.edu.cn
Rajeev RastogiAmazonVerified email at amazon.com
Amin SaberiProfessor, Stanford UniversityVerified email at stanford.edu
Vijay KrishnanIIT Bombay, Stanford University, https://turing.comVerified email at cs.stanford.edu
Yichuan DingDesautels Faculty of Management, McGill UniversityVerified email at mcgill.ca
Lihong Li (李力鸿)AmazonVerified email at amazon.com
Erick DelageProfessor, Department of Decision Sciences, HEC MontréalVerified email at hec.ca
Tomáš KocákUniversity of PotsdamVerified email at uni-potsdam.de
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMindVerified email at meta.com
Rémi MunosDeepMindVerified email at inria.fr
Supratim DebTechnical Lead @ MetaVerified email at fb.com
B. Aditya PrakashAssociate Professor, Georgia Institute of TechnologyVerified email at cs.cmu.edu
Nimrod MegiddoDistinguished Research Staff Member, IBM Almaden Research CenterVerified email at us.ibm.com

Shipra Agrawal

Columbia university

Verified email at columbia.edu - Homepage

multi-armed bandits reinforcement learning online and stochastic optmization


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Analysis of thompson sampling for the multi-armed bandit problem S Agrawal, N Goyal Conference on learning theory, 39.1-39.26, 2012	1500	2012
Thompson sampling for contextual bandits with linear payoffs S Agrawal, N Goyal International conference on machine learning, 127-135, 2013	1136	2013
Near-optimal regret bounds for thompson sampling S Agrawal, N Goyal Journal of the ACM (JACM) 64 (5), 1-24, 2017	641	2017
A dynamic near-optimal algorithm for online linear programming S Agrawal, Z Wang, Y Ye Operations Research 62 (4), 876-890, 2014	346	2014
A framework for high-accuracy privacy-preserving mining S Agrawal, JR Haritsa 21st International Conference on Data Engineering (ICDE'05), 193-204, 2005	239	2005
Optimistic posterior sampling for reinforcement learning: worst-case regret bounds S Agrawal, R Jia Advances in Neural Information Processing Systems 30, 2017	226	2017
A near-optimal exploration-exploitation approach for assortment selection S Agrawal, V Avadhanula, V Goyal, A Zeevi Proceedings of the 2016 ACM Conference on Economics and Computation, 599-600, 2016	217*	2016
Bandits with concave rewards and convex knapsacks S Agrawal, NR Devanur Proceedings of the fifteenth ACM conference on Economics and computation …, 2014	212	2014
Fast Algorithms for Online Stochastic Convex Programming S Agrawal, NR Devanur SODA 2015, 2015	189	2015
Reinforcement learning for integer programming: Learning to cut Y Tang, S Agrawal, Y Faenza International conference on machine learning, 9367-9376, 2020	187	2020
Price of correlations in stochastic optimization S Agrawal, Y Ding, A Saberi, Y Ye Operations Research 60 (1), 150-162, 2012	163*	2012
Linear contextual bandits with knapsacks S Agrawal, N Devanur Advances in Neural Information Processing Systems 29, 2016	152	2016
Thompson sampling for the mnl-bandit S Agrawal, V Avadhanula, V Goyal, A Zeevi Conference on learning theory, 76-78, 2017	120	2017
Bandits with delayed, aggregated anonymous feedback C Pike-Burke, S Agrawal, C Szepesvari, S Grunewalder International Conference on Machine Learning, 4105-4113, 2018	119	2018
Discretizing continuous action space for on-policy optimization Y Tang, S Agrawal Proceedings of the aaai conference on artificial intelligence 34 (04), 5981-5988, 2020	113	2020
On addressing efficiency concerns in privacy-preserving mining S Agrawal, V Krishnan, JR Haritsa Database Systems for Advanced Applications: 9th International Conference …, 2004	111	2004
An efficient algorithm for contextual bandits with knapsacks, and an extension to concave objectives S Agrawal, NR Devanur, L Li Conference on Learning Theory, 4-18, 2016	99	2016
Efficient detection of distributed constraint violations S Agrawal, S Deb, KVM Naidu, R Rastogi 2007 IEEE 23rd International Conference on Data Engineering, 1320-1324, 2006	75	2006
Learning in structured mdps with convex cost functions: Improved regret bounds for inventory management S Agrawal, R Jia Proceedings of the 2019 ACM Conference on Economics and Computation, 743-744, 2019	71	2019
A unified framework for dynamic prediction market design S Agrawal, E Delage, M Peters, Z Wang, Y Ye Operations research 59 (3), 550-568, 2011	65*	2011

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors