Yao Liu

Citirano

	Sve	Od 2019.
Citati	673	666
H-indeks	9	9
i10-indeks	9	9

180

135

20182019202020212022202320246 45 75 165 151 155 75

Javni pristup

Prikaži sve

8 članaka

0 članaka

dostupno

nije dostupno

Na temelju uvjeta financiranja

Suautori

Emma BrunskillAssociate Professor of Computer Science, Stanford UniversityPotvrđena adresa e-pošte na cs.stanford.edu
Omer GottesmanAmazonPotvrđena adresa e-pošte na amazon.com
Finale Doshi-VelezProfessor, HarvardPotvrđena adresa e-pošte na seas.harvard.edu
Alekh AgarwalGooglePotvrđena adresa e-pošte na google.com
Adith SwaminathanMicrosoft ResearchPotvrđena adresa e-pošte na microsoft.com
Pierre-Luc BaconUniversity of MontrealPotvrđena adresa e-pošte na mila.quebec
Zhaohan Daniel GuoDeepMindPotvrđena adresa e-pošte na google.com
Allen NieStanford UniversityPotvrđena adresa e-pošte na stanford.edu
Rasool FakoorAmazon Web ServicesPotvrđena adresa e-pošte na amazon.com
Shoham SabachAssociate Professor, Technion, Faculty of Data and Decision SciencesPotvrđena adresa e-pošte na technion.ac.il
Kavosh AsadiResearch Scientist, Amazon Web ServicesPotvrđena adresa e-pošte na amazon.com
Yannis Flet-BerliacPostdoc, Stanford UniversityPotvrđena adresa e-pošte na stanford.edu
Liwei WangProfessor, Peking UniversityPotvrđena adresa e-pošte na cis.pku.edu.cn
Dipendra MisraMicrosoft Research New YorkPotvrđena adresa e-pošte na microsoft.com
Robert SchapireMicrosoft ResearchPotvrđena adresa e-pošte na microsoft.com
Miroslav DudikMicrosoft ResearchPotvrđena adresa e-pošte na microsoft.com
Zuxin LiuCarnegie Mellon UniversityPotvrđena adresa e-pošte na cs.cmu.edu
Jesse ZhangPhD Student, USCPotvrđena adresa e-pošte na usc.edu
Philip ThomasUniversity of Massachusetts AmherstPotvrđena adresa e-pošte na cs.umass.edu
Pratik ChaudhariUniversity of PennsylvaniaPotvrđena adresa e-pošte na seas.upenn.edu

Prati

Yao Liu

Amazon

Potvrđena adresa e-pošte na stanford.edu - Početna stranica

Reinforcement Learning Machine Learning


Naslov Poredaj po navodima Poredaj po godini Poredaj po naslovu	Citirano Citirano	Godina
Provably good batch reinforcement learning without great exploration Y Liu, A Swaminathan, A Agarwal, E Brunskill Advances in Neural Information Processing Systems 33, 1264–1274, 2020	204	2020
Off-Policy Policy Gradient with Stationary Distribution Correction Y Liu, A Swaminathan, A Agarwal, E Brunskill Proceedings of The 35th Uncertainty in Artificial Intelligence Conference …, 2019	173*	2019
Representation balancing mdps for off-policy policy evaluation Y Liu, O Gottesman, A Raghu, M Komorowski, A Faisal, F Doshi-Velez, ... Advances in Neural Information Processing Systems 31, 2644--2653, 2018	74	2018
Interpretable off-policy evaluation in reinforcement learning by highlighting influential transitions O Gottesman, J Futoma, Y Liu, S Parbhoo, L Celi, E Brunskill, ... International Conference on Machine Learning, 3658-3667, 2020	51	2020
Understanding the curse of horizon in off-policy evaluation via conditional importance sampling Y Liu, PL Bacon, E Brunskill International Conference on Machine Learning, 6184-6193, 2020	39	2020
Behaviour policy estimation in off-policy policy evaluation: Calibration matters A Raghu, O Gottesman, Y Liu, M Komorowski, A Faisal, F Doshi-Velez, ... arXiv preprint arXiv:1807.01066, 2018	39	2018
Combining parametric and nonparametric models for off-policy evaluation O Gottesman, Y Liu, S Sussex, E Brunskill, F Doshi-Velez In International Conference on Machine Learning, 2366-2375, 2019	30	2019
When Simple Exploration is Sample Efficient: Identifying Sufficient Conditions for Random Exploration to Yield PAC RL Algorithms Y Liu, E Brunskill The 14th European Workshop on Reinforcement Learning, 2018	23	2018
Pac continuous state online multitask reinforcement learning with identification Y Liu, Z Guo, E Brunskill Proceedings of the 2016 International Conference on Autonomous Agents …, 2016	19	2016
Reinforcement learning tutor better supported lower performers in a math task S Ruan, A Nie, W Steenbergen, J He, JQ Zhang, M Guo, Y Liu, ... Machine Learning, 1-26, 2024	8	2024
All-action policy gradient methods: A numerical integration approach B Petit, L Amdahl-Culleton, Y Liu, J Smith, PL Bacon arXiv preprint arXiv:1910.09093, 2019	5	2019
Nonlinear Dimensionality Reduction by Local Orthogonality Preserving Alignment T Lin, Y Liu, B Wang, LW Wang, HB Zha Journal of Computer Science and Technology 31 (3), 512-524, 2016	3*	2016
Offline policy optimization with eligible actions Y Liu, Y Flet-Berliac, E Brunskill Uncertainty in Artificial Intelligence, 1253-1263, 2022	2	2022
TAIL: Task-specific Adapters for Imitation Learning with Large Pretrained Models Z Liu, J Zhang, K Asadi, Y Liu, D Zhao, S Sabach, R Fakoor arXiv preprint arXiv:2310.05905, 2023	1	2023
Provably sample-efficient RL with side information about latent dynamics Y Liu, D Misra, M Dudík, RE Schapire Advances in Neural Information Processing Systems 35, 33482-33493, 2022	1	2022
Stitched trajectories for off-policy learning S Sussex, O Gottesman, Y Liu, S Murphy, E Brunskill, F Doshi-Velez ICML Workshop, 2018	1	2018
Budgeting counterfactual for offline RL Y Liu, P Chaudhari, R Fakoor Advances in Neural Information Processing Systems 36, 2024		2024
TD Convergence: An Optimization Perspective K Asadi, S Sabach, Y Liu, O Gottesman, R Fakoor Advances in Neural Information Processing Systems 36, 2024		2024
Model Selection for Off-Policy Policy Evaluation Y Liu, PS Thomas, E Brunskill The Multi-disciplinary Conference on Reinforcement Learning and Decision Making, 2017		2017

Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.

Članci 1–19

Godišnji broj citata

Dvostruki navodi

Spojeni navodi

Dodavanje suautoraSuautori

Prati

Citirano

Suautori