Follow
Ofir Press
Title
Cited by
Cited by
Year
Bloom: A 176b-parameter open-access multilingual language model
T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...
11302023
Using the Output Embedding to Improve Language Models
O Press, L Wolf
EACL 2017, 2017
7392017
Train Short, Test Long: Attention with Linear Biases Enables Input Length Extrapolation
O Press, NA Smith, M Lewis
ICLR 2022, 2021
3392021
Measuring and narrowing the compositionality gap in language models
O Press, M Zhang, S Min, L Schmidt, NA Smith, M Lewis
Findings of EMNLP 2023, 2022
248*2022
Language Generation with Recurrent Generative Adversarial Networks without Pre-training
O Press, A Bar, B Bogin, J Berant, L Wolf
1st Workshop on Learning to Generate Natural Language at ICML 2017, 2017
1362017
How language model hallucinations can snowball
M Zhang, O Press, W Merrill, A Liu, NA Smith
arXiv preprint arXiv:2305.13534, 2023
1232023
What Language Model to Train if You Have One Million GPU Hours?
T Le Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, ...
Findings of EMNLP 2022, 2022
782022
Improving Transformer Models by Reordering their Sublayers
O Press, NA Smith, O Levy
ACL 2020, 2019
742019
Shortformer: Better Language Modeling using Shorter Inputs
O Press, NA Smith, M Lewis
ACL 2021, 2020
682020
Transformer Language Models without Positional Encodings Still Learn Positional Information
A Haviv, O Ram, O Press, P Izsak, O Levy
Findings of EMNLP 2022, 2022
542022
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?
CE Jimenez, J Yang, A Wettig, S Yao, K Pei, O Press, K Narasimhan
ICLR 2024, 2023
322023
You may not need attention
O Press, NA Smith
arXiv preprint arXiv:1810.13409, 2018
282018
Partially shuffling the training data to improve language models
O Press
arXiv preprint arXiv:1903.04167, 2019
42019
Bloom: A 176b-parameter open-access multilingual language model
BS Workshop, TL Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, ...
arXiv preprint arXiv:2211.05100, 2022
22022
Complementing Scale: Novel Guidance Methods for Improving Language Models
O Press
University of Washington, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–15