Prati
Ching-An Cheng
Ching-An Cheng
Microsoft Research
Potvrđena adresa e-pošte na microsoft.com - Početna stranica
Naslov
Citirano
Citirano
Godina
Agile autonomous driving using end-to-end deep imitation learning
Y Pan, CA Cheng, K Saigol, K Lee, X Yan, E Theodorou, B Boots
Robotics: science and systems, 2018
3572018
Bellman-consistent pessimism for offline reinforcement learning
T Xie, CA Cheng, N Jiang, P Mineiro, A Agarwal
Advances in neural information processing systems 34, 6683-6694, 2021
2602021
Truncated back-propagation for bilevel optimization
A Shaban, CA Cheng, N Hatch, B Boots
The 22nd International Conference on Artificial Intelligence and Statistics …, 2019
2562019
Imitation learning for agile autonomous driving
Y Pan, CA Cheng, K Saigol, K Lee, X Yan, EA Theodorou, B Boots
The International Journal of Robotics Research 39 (2-3), 286-302, 2020
1562020
Adversarially trained actor critic for offline reinforcement learning
CA Cheng, T Xie, N Jiang, A Agarwal
International Conference on Machine Learning, 3852-3878, 2022
1122022
Fast policy learning through imitation and reinforcement
CA Cheng, X Yan, N Wagener, B Boots
arXiv preprint arXiv:1805.10413, 2018
902018
Variational inference for Gaussian process models with linear complexity
CA Cheng, B Boots
Advances in Neural Information Processing Systems 30, 2017
882017
RMPflow: A Computational Graph for Automatic Motion Policy Generation
CA Cheng, M Mukadam, J Issac, S Birchfield, D Fox, B Boots, N Ratliff
Algorithmic Foundations of Robotics XIII: Proceedings of the 13th Workshop …, 2020
862020
An online learning approach to model predictive control
N Wagener, CA Cheng, J Sacks, B Boots
arXiv preprint arXiv:1902.08967, 2019
822019
Intra order-preserving functions for calibration of multi-class neural networks
A Rahimi, A Shaban, CA Cheng, R Hartley, B Boots
Advances in Neural Information Processing Systems 33, 13456-13467, 2020
642020
Cautiously optimistic policy optimization and exploration with linear function approximation
A Zanette, CA Cheng, A Agarwal
Conference on Learning Theory, 4473-4525, 2021
572021
Safe reinforcement learning using advantage-based intervention
NC Wagener, B Boots, CA Cheng
International Conference on Machine Learning, 10630-10640, 2021
502021
Heuristic-guided reinforcement learning
CA Cheng, A Kolobov, A Swaminathan
Advances in Neural Information Processing Systems 34, 13550-13563, 2021
492021
Orthogonally decoupled variational Gaussian processes
H Salimbeni, CA Cheng, B Boots, M Deisenroth
Advances in neural information processing systems 31, 2018
492018
Incremental variational sparse Gaussian process regression
CA Cheng, B Boots
Advances in Neural Information Processing Systems 29, 2016
482016
Virtual impedance control for safe human-robot interaction
SY Lo, CA Cheng, HP Huang
Journal of Intelligent & Robotic Systems 82, 3-19, 2016
422016
RMPflow: A Geometric Framework for Generation of Multitask Motion Policies
CA Cheng, M Mukadam, J Issac, S Birchfield, D Fox, B Boots, N Ratliff
IEEE Transactions on Automation Science and Engineering 18 (3), 968-987, 2021
342021
Direct nash optimization: Teaching language models to self-improve with general preferences
C Rosset, CA Cheng, A Mitra, M Santacroce, A Awadallah, T Xie
arXiv preprint arXiv:2404.03715, 2024
332024
Convergence of value aggregation for imitation learning
CA Cheng, B Boots
International Conference on Artificial Intelligence and Statistics, 1801-1809, 2018
332018
Trajectory-wise control variates for variance reduction in policy gradient methods
CA Cheng, X Yan, B Boots
Conference on Robot Learning, 1379-1394, 2020
292020
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–20