Prati
Qiwen Cui
Qiwen Cui
Potvrđena adresa e-pošte na uw.edu - Početna stranica
Naslov
Citirano
Citirano
Godina
When are offline two-player zero-sum Markov games solvable?
Q Cui, SS Du
Advances in Neural Information Processing Systems 35, 25779-25791, 2022
562022
Randomized Exploration for Reinforcement Learning with General Value Function Approximation
H Ishfaq, Q Cui, V Nguyen, A Ayoub, Z Yang, Z Wang, D Precup, LF Yang
Thirty-eighth International Conference on Machine Learning, 2021
452021
Breaking the curse of multiagents in a large state space: Rl in markov games with independent linear function approximation
Q Cui, K Zhang, S Du
The Thirty Sixth Annual Conference on Learning Theory, 2651-2652, 2023
282023
Clinical decision support model for tooth extraction therapy derived from electronic dental records
Q Cui, Q Chen, P Liu, D Liu, Z Wen
The Journal of Prosthetic Dentistry 126 (1), 83-90, 2021
282021
Provably efficient offline multi-agent reinforcement learning via strategy-wise bonus
Q Cui, SS Du
Advances in Neural Information Processing Systems 35, 11739-11751, 2022
262022
Minimax sample complexity for turn-based stochastic game
Q Cui, LF Yang
Thirty-seventh Conference on Uncertainty in Artificial Intelligence, 2020
262020
Near-optimal randomized exploration for tabular markov decision processes
Z Xiong, R Shen, Q Cui, M Fazel, SS Du
Advances in Neural Information Processing Systems 35, 6358-6371, 2022
24*2022
Learning in congestion games with bandit feedback
Q Cui, Z Xiong, M Fazel, SS Du
Advances in Neural Information Processing Systems 35, 11009-11022, 2022
152022
On gap-dependent bounds for offline reinforcement learning
X Wang, Q Cui, SS Du
Advances in Neural Information Processing Systems 35, 14865-14877, 2022
152022
Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning?
Q Cui, LF Yang
Thirty-fourth Conference on Neural Information Processing Systems, 2020
132020
An efficient Fisher matrix approximation method for large-scale neural network optimization
M Yang, D Xu, Q Cui, Z Wen, P Xu
IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (5), 5391-5403, 2022
12*2022
Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning
Z Zhou, C Zhu, R Zhou, Q Cui, A Gupta, SS Du
arXiv preprint arXiv:2310.19308, 2023
42023
Learning Optimal Tax Design in Nonatomic Congestion Games
Q Cui, M Fazel, SS Du
arXiv preprint arXiv:2402.07437, 2024
22024
Refined sample complexity for markov games with independent linear function approximation
Y Dai, Q Cui, SS Du
arXiv preprint arXiv:2402.07082, 2024
12024
A Black-box Approach for Non-stationary Multi-agent Reinforcement Learning
H Jiang, Q Cui, Z Xiong, M Fazel, SS Du
arXiv preprint arXiv:2306.07465, 2023
12023
Offline congestion games: How feedback type affects data coverage requirement
H Jiang, Q Cui, Z Xiong, M Fazel, SS Du
arXiv preprint arXiv:2210.13396, 2022
12022
BabelBench: An Omni Benchmark for Code-Driven Analysis of Multimodal and Multistructured Data
X Wang, Q Cui, Y Tao, Y Wang, Z Chai, X Han, B Liu, J Yuan, J Su, ...
arXiv preprint arXiv:2410.00773, 2024
2024
Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques
N Zhang, X Wang, Q Cui, R Zhou, SM Kakade, SS Du
arXiv preprint arXiv:2409.00717, 2024
2024
-Puzzle: A Cost-Efficient Testbed for Benchmarking Reinforcement Learning Algorithms in Generative Language Model
Y Zhang, L Chen, B Liu, Y Yang, Q Cui, Y Tao, H Yang
arXiv preprint arXiv:2403.07191, 2024
2024
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–19