Kianté Brantley

Citirano

	Sve	Od 2019.
Citati	618	598
H-indeks	9	9
i10-indeks	9	9

200

100

150

2016201720182019202020212022202320244 7 9 24 74 105 100 193 102

Javni pristup

Prikaži sve

1 članak

0 članaka

dostupno

nije dostupno

Na temelju uvjeta financiranja

Suautori

Wen SunAssistant Professor, Cornell UniversityPotvrđena adresa e-pošte na cornell.edu
Hal Daumé IIIAssociate Professor of Computer Science, University of MarylandPotvrđena adresa e-pošte na umiacs.umd.edu
Prithviraj AmmanabroluAssistant Professor, University of California, San DiegoPotvrđena adresa e-pošte na ucsd.edu
Miroslav DudikMicrosoft ResearchPotvrđena adresa e-pošte na microsoft.com
Kyunghyun ChoNew York University, GenentechPotvrđena adresa e-pošte na nyu.edu
Sean J WelleckAssistant Professor, Carnegie Mellon UniversityPotvrđena adresa e-pošte na andrew.cmu.edu
Robert SchapireMicrosoft ResearchPotvrđena adresa e-pošte na microsoft.com
Thorsten JoachimsProfessor of Computer Science, Cornell UniversityPotvrđena adresa e-pošte na cs.cornell.edu
Geoff GordonProfessor of Machine Learning, Carnegie Mellon UniversityPotvrđena adresa e-pošte na cs.cmu.edu
Yoav ArtziCornell University; ASAPPPotvrđena adresa e-pošte na cs.cornell.edu
Shi FengUniversity of ChicagoPotvrđena adresa e-pošte na uchicago.edu
Khanh NguyenCHAI BerkeleyPotvrđena adresa e-pošte na berkeley.edu
Michel GalleySr. Principal Researcher at MicrosoftPotvrđena adresa e-pošte na acm.org
Bill DolanPartner Research Manager, Microsoft ResearchPotvrđena adresa e-pošte na microsoft.com
Yizhe ZhangResearch Scientist @ Apple MLRPotvrđena adresa e-pošte na apple.com
J. Andrew BagnellCarnegie Mellon UniversityPotvrđena adresa e-pošte na ri.cmu.edu

Prati

Kianté Brantley

Cornell University

Potvrđena adresa e-pošte na cornell.edu - Početna stranica

imitation learning machine learning natural language processing


Naslov Poredaj po navodima Poredaj po godini Poredaj po naslovu	Citirano Citirano	Godina
Is reinforcement learning (not) for natural language processing?: Benchmarks, baselines, and building blocks for natural language policy optimization R Ramamurthy, P Ammanabrolu, K Brantley, J Hessel, R Sifa, ... The Eleventh International Conference on Learning Representations, 2023	156*	2023
Non-monotonic sequential text generation S Welleck, K Brantley, HD Iii, K Cho International Conference on Machine Learning, 6716-6726, 2019	128	2019
Disagreement-regularized imitation learning K Brantley, W Sun, M Henaff International Conference on Learning Representations, 2019	106	2019
Reinforcement learning with convex constraints S Miryoosefi, K Brantley, H Daume III, M Dudik, RE Schapire Advances in neural information processing systems 32, 2019	94	2019
Constrained episodic reinforcement learning in concave-convex and knapsack settings K Brantley, M Dudik, T Lykouris, S Miryoosefi, M Simchowitz, A Slivkins, ... Advances in Neural Information Processing Systems 33, 16315-16326, 2020	49	2020
Ldaexplore: Visualizing topic models generated using latent dirichlet allocation A Ganesan, K Brantley, S Pan, J Chen arXiv preprint arXiv:1507.06593, 2015	31	2015
Active imitation learning with noisy guidance K Brantley, A Sharaf, H Daumé III arXiv preprint arXiv:2005.12801, 2020	18	2020
Successor feature sets: Generalizing successor representations across policies K Brantley, S Mehri, GJ Gordon Proceedings of the AAAI Conference on Artificial Intelligence 35 (13), 11774 …, 2021	11	2021
Learning to Generate Better Than Your LLM JD Chang, K Brantley, R Ramamurthy, D Misra, W Sun arXiv preprint arXiv:2306.11816, 2023	10	2023
The umd neural machine translation systems at wmt17 bandit learning task A Sharaf, S Feng, K Nguyen, K Brantley, H Daumé III arXiv preprint arXiv:1708.01318, 2017	4	2017
Interactive text generation F Faltings, M Galley, B Peng, K Brantley, W Cai, Y Zhang, J Gao, B Dolan arXiv preprint arXiv:2303.00908, 2023	3	2023
lilGym: Natural Language Visual Reasoning with Reinforcement Learning A Wu, K Brantley, N Kojima, Y Artzi arXiv preprint arXiv:2211.01994, 2022	2	2022
Dataset Reset Policy Optimization for RLHF JD Chang, W Shan, O Oertell, K Brantley, D Misra, JD Lee, W Sun arXiv preprint arXiv:2404.08495, 2024	1	2024
RL for Consistency Models: Faster Reward Guided Text-to-Image Generation O Oertell, JD Chang, Y Zhang, K Brantley, W Sun arXiv preprint arXiv:2404.03673, 2024	1	2024
Ranking with Long-Term Constraints K Brantley, Z Fang, S Dean, T Joachims Proceedings of the 17th ACM International Conference on Web Search and Data …, 2024	1	2024
Reviewer2: Optimizing Review Generation Through Prompt Generation Z Gao, K Brantley, T Joachims arXiv preprint arXiv:2402.10886, 2024	1	2024
Expert-in-the-Loop for Sequential Decisions and Predictions K Brantley University of Maryland, College Park, 2021	1	2021
BCAP: An Artificial Neural Network Pruning Technique to Reduce Overfitting K Brantley University of Maryland, Baltimore County, 2016	1	2016
REBEL: Reinforcement Learning via Regressing Relative Rewards Z Gao, JD Chang, W Zhan, O Oertell, G Swamy, K Brantley, T Joachims, ... arXiv preprint arXiv:2404.16767, 2024		2024
A Surprising Failure? Multimodal LLMs and the NLVR Challenge A Wu, K Brantley, Y Artzi arXiv preprint arXiv:2402.17793, 2024		2024

Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.

Članci 1–20

Godišnji broj citata

Dvostruki navodi

Spojeni navodi

Dodavanje suautoraSuautori

Prati

Citirano

Suautori