Prati
Chengqian Gao
Chengqian Gao
MBZUAI
Potvrđena adresa e-pošte na mbzuai.ac.ae
Naslov
Citirano
Citirano
Godina
Value penalized q-learning for recommender systems
C Gao, K Xu, K Zhou, L Li, X Wang, B Yuan, P Zhao
Proceedings of the 45th International ACM SIGIR Conference on Research and …, 2022
162022
Hard-Thresholding Meets Evolution Strategies in Reinforcement Learning
C Gao, W de Vazelhes, H Zhang, B Gu, Z Xu
arXiv preprint arXiv:2405.01615, 2024
2024
Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation
C Gao, K Xu, L Liu, D Ye, P Zhao, Z Xu
arXiv preprint arXiv:2210.10469, 2022
2022
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–3