Follow
Yifang Chen
Title
Cited by
Cited by
Year
A new algorithm for non-stationary contextual bandits: Efficient, optimal and parameter-free
Y Chen, CW Lee, H Luo, CY Wei
Conference on Learning Theory, 696-726, 2019
157*2019
Fair contextual multi-armed bandits: Theory and experiments
Y Chen, A Cuellar, H Luo, J Modi, H Nemlekar, S Nikolaidis
Conference on Uncertainty in Artificial Intelligence, 181-190, 2020
712020
Reward-free rl is no harder than reward-aware rl in linear markov decision processes
AJ Wagenmaker, Y Chen, M Simchowitz, S Du, K Jamieson
International Conference on Machine Learning, 22430-22456, 2022
672022
Multi-armed bandits with fairness constraints for distributing resources to human teammates
H Claure, Y Chen, J Modi, M Jung, S Nikolaidis
Proceedings of the 2020 ACM/IEEE International Conference on Human-Robot …, 2020
622020
First-order regret in reinforcement learning with linear function approximation: A robust estimation approach
AJ Wagenmaker, Y Chen, M Simchowitz, S Du, K Jamieson
International Conference on Machine Learning, 22384-22429, 2022
382022
Improved corruption robust algorithms for episodic reinforcement learning
Y Chen, S Du, K Jamieson
International Conference on Machine Learning, 1561-1570, 2021
342021
Online and bandit algorithms for nonstationary stochastic saddle-point optimization
A Roy, Y Chen, K Balasubramanian, P Mohapatra
arXiv preprint arXiv:1912.01698, 2019
252019
Active multi-task representation learning
Y Chen, K Jamieson, S Du
International Conference on Machine Learning, 3271-3298, 2022
142022
Improved active multi-task representation learning via lasso
Y Wang, Y Chen, K Jamieson, SS Du
International Conference on Machine Learning, 35548-35578, 2023
112023
LabelBench: A Comprehensive Framework for Benchmarking Adaptive Label-Efficient Learning
J Zhang, Y Chen, G Canal, AM Das, G Bhatt, S Mussmann, Y Zhu, ...
Journal of Data-centric Machine Learning Research, 2024
8*2024
Corruption robust active learning
Y Chen, SS Du, KG Jamieson
Advances in Neural Information Processing Systems 34, 29643-29654, 2021
82021
Decoding-time language model alignment with multiple objectives
R Shi, Y Chen, Y Hu, AL Liu, H Hajishirzi, NA Smith, SS Du
arXiv preprint arXiv:2406.18853, 2024
72024
An Experimental Design Framework for Label-Efficient Supervised Finetuning of Large Language Models
G Bhatt, Y Chen, AM Das, J Zhang, ST Truong, S Mussmann, Y Zhu, ...
arXiv preprint arXiv:2401.06692, 2024
72024
Causal bandits: Online decision-making in endogenous settings
J Zhang, Y Chen, A Singh
arXiv preprint arXiv:2211.08649, 2022
62022
More practical and adaptive algorithms for online quantum state learning
Y Chen, X Wang
arXiv preprint arXiv:2006.01013, 2020
62020
CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning
Y Wang, Y Chen, W Yan, A Fang, W Zhou, K Jamieson, SS Du
arXiv preprint arXiv:2405.19547, 2024
5*2024
Active representation learning for general task space with applications in robotics
Y Chen, Y Huang, SS Du, KG Jamieson, G Shi
Advances in Neural Information Processing Systems 36, 2024
22024
Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
Y Chen, S Wang, Z Yang, H Sharma, N Karampatziakis, D Yu, ...
arXiv preprint arXiv:2407.02119, 2024
2024
Improved Adaptive Algorithm for Scalable Active Learning with Weak Labeler
Y Chen, K Sankararaman, A Lazaric, M Pirotta, D Karamshuk, Q Wang, ...
arXiv preprint arXiv:2211.02233, 2022
2022
A Deep Bayesian Bandits Approach for Anticancer Therapy: Exploration via Functional Prior
M Lu, Y Chen, SI Lee
arXiv preprint arXiv:2205.02944, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–20