Follow
Jing-Cheng Pang
Jing-Cheng Pang
Other namesJingcheng Pang
Ph.D. student, Nanjing University.
Verified email at lamda.nju.edu.cn - Homepage
Title
Cited by
Cited by
Year
Regret minimization experience replay in off-policy reinforcement learning
XH Liu, Z Xue, JC Pang, S Jiang, F Xu, Y Yu
NeurIPS'21, 2021
292021
Offline imitation learning with a misspecified simulator
S Jiang, JC Pang, Y Yu
NeurIPS'20, 2020
222020
Language Model Self-improvement by Reinforcement Learning Contemplation
JC Pang, P Wang, K Li, XH Chen, J Xu, Z Zhang, Y Yu
ICLR'24, 2024
72024
Natural Language Instruction-following with Task-related Language Development and Translation
JC Pang, XY Yang, SH Yang, XH Chen, Y Yu
NeurIPS'23, 2024
4*2024
Improving Fictitious Play Reinforcement Learning with Expanding Models
RJ Qin, JC Pang, Y Yu
arXiv preprint arXiv:1911.11928, 2019
22019
Reinforcement Learning With Sparse-Executing Actions via Sparsity Regularization
JC Pang, T Xu, S Jiang, YR Liu, Y Yu
arXiv preprint arXiv:2105.08666, 2023
1*2023
Model gradient: unified model and policy learning in model-based reinforcement learning
C Jia, F Zhang, T Xu, JC Pang, Z Zhang, Y Yu
Frontiers of Computer Science 18 (4), 184339, 2024
2024
Knowledgeable Agents by Offline Reinforcement Learning from Large Language Model Rollouts
JC Pang, SH Yang, K Li, J Zhang, XH Chen, N Tang, Y Yu
arXiv preprint arXiv:2404.09248, 2024
2024
Empowering Language Models with Active Inquiry for Deeper Understanding
JC Pang, HB Fan, P Wang, JH Xiao, N Tang, SH Yang, C Jia, SJ Huang, ...
arXiv preprint arXiv:2402.03719, 2024
2024
Object-Oriented Option Framework for Robotics Manipulation in Clutter
JC Pang, SH Yang, XH Chen, X Yang, Y Yu, M Ma, Z Guo, H Yang, ...
2023 IEEE/RSJ International Conference on Intelligent Robots and Systems …, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–10