Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022 | 754 | 2022 |
COM2SENSE: A commonsense reasoning benchmark with complementary sentences S Singh, N Wen, Y Hou, P Alipoormolabashi, TL Wu, X Ma, N Peng arXiv preprint arXiv:2106.00969, 2021 | 42 | 2021 |
Eventplus: A temporal event understanding pipeline MD Ma, J Sun, M Yang, KH Huang, N Wen, S Singh, R Han, N Peng arXiv preprint arXiv:2101.04922, 2021 | 29 | 2021 |
Melinda: A multimodal dataset for biomedical experiment method classification TL Wu, S Singh, S Paul, G Burns, N Peng Proceedings of the AAAI Conference on Artificial Intelligence 35 (16), 14076 …, 2021 | 16 | 2021 |
Viphy: Probing" visible" physical commonsense knowledge S Singh, E Qasemi, M Chen arXiv preprint arXiv:2209.07000, 2022 | 4 | 2022 |