Arithmetic circuits: A chasm at depth 3 A Gupta, P Kamath, N Kayal, R Saptharishi SIAM Journal on Computing 45 (3), 1064-1079, 2016 | 169* | 2016 |
Injecting numerical reasoning skills into language models M Geva*, A Gupta*, J Berant Proceedings of the 58th Annual Meeting of the Association for Computational …, 2020 | 143 | 2020 |
Approaching the chasm at depth four A Gupta, P Kamath, N Kayal, R Saptharishi Journal of the ACM (JACM) 61 (6), 1-16, 2014 | 132 | 2014 |
Break it down: A question understanding benchmark T Wolfson, M Geva, A Gupta, M Gardner, Y Goldberg, D Deutch, J Berant Transactions of the Association for Computational Linguistics 8, 183-198, 2020 | 127 | 2020 |
Reconstruction of depth-4 multilinear circuits with top fan-in 2 A Gupta, N Kayal, S Lokam Proceedings of the forty-fourth annual ACM symposium on Theory of computing …, 2012 | 29 | 2012 |
Gmat: Global memory augmentation for transformers A Gupta, J Berant arXiv preprint arXiv:2006.03274, 2020 | 26 | 2020 |
Algebraic Geometric Techniques for Depth-4 PIT & Sylvester-Gallai Conjectures for Varieties. A Gupta Electron. Colloquium Comput. Complex. 21, 130, 2014 | 24 | 2014 |
Diagonal state spaces are as effective as structured state spaces A Gupta, A Gu, J Berant Advances in Neural Information Processing Systems 35, 22982-22994, 2022 | 22 | 2022 |
Scrolls: Standardized comparison over long language sequences U Shaham, E Segal, M Ivgi, A Efrat, O Yoran, A Haviv, A Gupta, W Xiong, ... arXiv preprint arXiv:2201.03533, 2022 | 22 | 2022 |
Random arithmetic formulas can be reconstructed efficiently A Gupta, N Kayal, Y Qiao computational complexity 23, 207-303, 2014 | 20 | 2014 |
On the parameterization and initialization of diagonal state space models A Gu, A Gupta, K Goel, C Ré Advances in Neural Information Processing Systems 35, 35971-35983, 2022 | 19 | 2022 |
Efficient reconstruction of random multilinear formulas A Gupta, N Kayal, S Lokam 2011 IEEE 52nd Annual Symposium on Foundations of Computer Science, 778-787, 2011 | 18 | 2011 |
Long range language modeling via gated state spaces H Mehta, A Gupta, A Cutkosky, B Neyshabur arXiv preprint arXiv:2206.13947, 2022 | 15 | 2022 |
Memory-efficient Transformers via Top-k Attention A Gupta, G Dar, S Goodman, D Ciprut, J Berant Proceedings of the Second Workshop on Simple and Efficient Natural Language …, 2021 | 9 | 2021 |
Analyzing transformers in embedding space G Dar, M Geva, A Gupta, J Berant arXiv preprint arXiv:2209.02535, 2022 | 6 | 2022 |
Organic solar cells and its characteristics A Gupta J Material Sci Eng 4 (203), 2169-0022.1000203, 2015 | 6 | 2015 |
Synthesis of polyaniline without metal doping and its characterization A Gupta, M Kumar J Mater Sci Surf Eng 6, 802-804, 2018 | 5 | 2018 |
Value-aware Approximate Attention A Gupta, J Berant Proceedings of the 2021 Conference on Empirical Methods in Natural Language …, 2021 | 3 | 2021 |
Diagonal State Space Augmented Transformers for Speech Recognition G Saon, A Gupta, X Cui ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
Simplifying and Understanding State Space Models with Diagonal Linear RNNs A Gupta, H Mehta, J Berant arXiv preprint arXiv:2212.00768, 2022 | 1 | 2022 |