Lihong Li (李力鸿)

Citirano

	Sve	Od 2019.
Citati	25200	17155
H-indeks	65	56
i10-indeks	101	85

3900

1950

975

2925

2008200920102011201220132014201520162017201820192020202120222023202493 189 217 342 437 544 592 831 986 1225 1765 2332 3044 3532 3392 3822 1033

Javni pristup

Prikaži sve

14 članaka

0 članaka

dostupno

nije dostupno

Na temelju uvjeta financiranja

Suautori

John LangfordMicrosoft Research New YorkPotvrđena adresa e-pošte na hunch.net
Michael LittmanBrown UniversityPotvrđena adresa e-pošte na brown.edu
Jianfeng GaoMicrosoft Research, RedmondPotvrđena adresa e-pošte na microsoft.com
Wei Chu（褚崴）InfPotvrđena adresa e-pošte na gatsby.ucl.ac.uk
Li DengChief AI Officer, Citadel (former)Potvrđena adresa e-pošte na ieee.org
Robert SchapireMicrosoft ResearchPotvrđena adresa e-pošte na microsoft.com
Bo DaiGoogle Brain & Georgia TechPotvrđena adresa e-pošte na google.com
Denny ZhouResearch Scientist, Google DeepMindPotvrđena adresa e-pošte na google.com
Jianshu ChenPrincipal Scientist, AmazonPotvrđena adresa e-pošte na ucla.edu
Asli CelikyilmazResearcher @ FAIR at Meta AIPotvrđena adresa e-pošte na ieee.org
Dale SchuurmansUniversity of Alberta, Google DeepMindPotvrđena adresa e-pošte na cs.ualberta.ca
Zachary C. LiptonRaj Reddy Associate Professor of Machine Learning @ Carnegie Mellon University; CTO + CSO @ AbridgePotvrđena adresa e-pošte na cmu.edu
Yun-Nung (Vivian) ChenNational Taiwan UniversityPotvrđena adresa e-pošte na ieee.org
Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityPotvrđena adresa e-pošte na cs.stanford.edu
Faisal Ahmed, PhDMicrosoftPotvrđena adresa e-pošte na microsoft.com
Thomas J. WalshSony AIPotvrđena adresa e-pošte na sony.com
Xiujun LiUniversity of Washington / ApplePotvrđena adresa e-pošte na cs.washington.edu
Chong WangApplePotvrđena adresa e-pošte na cs.princeton.edu
Csaba SzepesvariDeepMind & University of AlbertaPotvrđena adresa e-pošte na cs.ualberta.ca
Ofir NachumOpenAIPotvrđena adresa e-pošte na openai.com

Prati

Lihong Li (李力鸿)

Amazon

Potvrđena adresa e-pošte na amazon.com - Početna stranica

Reinforcement Learning Machine Learning Artificial Intelligence


Naslov Poredaj po navodima Poredaj po godini Poredaj po naslovu	Citirano Citirano	Godina
A contextual-bandit approach to personalized news article recommendation L Li, W Chu, J Langford, RE Schapire Proceedings of the 19th international conference on World wide web, 661-670, 2010	3243	2010
An empirical evaluation of thompson sampling O Chapelle, L Li Advances in neural information processing systems 24, 2011	1721	2011
Parallelized stochastic gradient descent M Zinkevich, M Weimer, L Li, A Smola Advances in neural information processing systems 23, 2010	1706	2010
Contextual bandits with linear payoff functions W Chu, L Li, L Reyzin, R Schapire Proceedings of the Fourteenth International Conference on Artificial …, 2011	1171	2011
Doubly robust policy evaluation and learning M Dudík, J Langford, L Li arXiv preprint arXiv:1103.4601, 2011	887	2011
Doubly Robust Policy Evaluation and Learning M Dudık, J Langford, L Li	887*
Neural approaches to conversational AI J Gao, M Galley, L Li The 41st international ACM SIGIR conference on research & development in …, 2018	870	2018
Doubly robust off-policy value evaluation for reinforcement learning N Jiang, L Li International conference on machine learning, 652-661, 2016	818	2016
Unbiased offline evaluation of contextual-bandit-based news article recommendation algorithms L Li, W Chu, J Langford, X Wang Proceedings of the fourth ACM international conference on Web search and …, 2011	651	2011
PAC model-free reinforcement learning AL Strehl, L Li, E Wiewiora, J Langford, ML Littman Proceedings of the 23rd international conference on Machine learning, 881-888, 2006	622	2006
Sparse Online Learning via Truncated Gradient. J Langford, L Li, T Zhang Journal of Machine Learning Research 10 (3), 2009	591	2009
Towards a unified theory of state abstraction for MDPs. L Li, TJ Walsh, ML Littman AI&M 1 (2), 3, 2006	580	2006
Taming the monster: A fast and simple algorithm for contextual bandits A Agarwal, D Hsu, S Kale, J Langford, L Li, R Schapire International Conference on Machine Learning, 1638-1646, 2014	545	2014
Towards end-to-end reinforcement learning of dialogue agents for information access B Dhingra, L Li, X Li, J Gao, YN Chen, F Ahmed, L Deng arXiv preprint arXiv:1609.00777, 2016	515*	2016
Doubly robust policy evaluation and optimization M Dudík, D Erhan, J Langford, L Li	491	2014
End-to-end task-completion neural dialogue systems X Li, YN Chen, L Li, J Gao, A Celikyilmaz arXiv preprint arXiv:1703.01008, 2017	446	2017
Neuro-symbolic program synthesis E Parisotto, A Mohamed, R Singh, L Li, D Zhou, P Kohli arXiv preprint arXiv:1611.01855, 2016	396	2016
Reinforcement Learning in Finite MDPs: PAC Analysis. AL Strehl, L Li, ML Littman Journal of Machine Learning Research 10 (11), 2009	364	2009
Breaking the curse of horizon: Infinite-horizon off-policy estimation Q Liu, L Li, Z Tang, D Zhou Advances in neural information processing systems 31, 2018	361	2018
Contextual bandit algorithms with supervised learning guarantees A Beygelzimer, J Langford, L Li, L Reyzin, RE Schapire Arxiv preprint arXiv:1002.4058, 2010	341	2010

Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.

Članci 1–20

Godišnji broj citata

Dvostruki navodi

Spojeni navodi

Dodavanje suautoraSuautori

Prati

Citirano

Suautori