Jailbreaking black box large language models in twenty queries P Chao, A Robey, E Dobriban, H Hassani, GJ Pappas, E Wong arXiv preprint arXiv:2310.08419, 2023 | 112 | 2023 |
Adversarial prompting for black box foundation models N Maus, P Chao, E Wong, J Gardner arXiv preprint arXiv:2302.04237 1 (2), 2023 | 54* | 2023 |
Interventional and counterfactual inference with diffusion models P Chao, P Blöbaum, SP Kasiviswanathan arXiv preprint arXiv:2302.00860, 2023 | 12 | 2023 |
Different definitions of conic sections in hyperbolic geometry P Chao, J Rosenberg Involve, a Journal of Mathematics 11 (5), 753-768, 2018 | 7 | 2018 |
AdaPT-GMM: Powerful and robust covariate-assisted multiple testing P Chao, W Fithian arXiv preprint arXiv:2106.15812, 2021 | 6 | 2021 |
Generative models for pose transfer P Chao, A Li, G Swamy arXiv preprint arXiv:1806.09070, 2018 | 3 | 2018 |
A safe harbor for ai evaluation and red teaming S Longpre, S Kapoor, K Klyman, A Ramaswami, R Bommasani, ... arXiv preprint arXiv:2403.04893, 2024 | 2 | 2024 |
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models P Chao, E Debenedetti, A Robey, M Andriushchenko, F Croce, V Sehwag, ... arXiv preprint arXiv:2404.01318, 2024 | | 2024 |
Statistical Estimation Under Distribution Shift: Wasserstein Perturbations and Minimax Theory P Chao, E Dobriban arXiv preprint arXiv:2308.01853, 2023 | | 2023 |