Listening to Affected Communities to Define Extreme Speech: Dataset and Experiments A Maronikolakis, A Wisiorek, L Nann, H Jabbar, S Udupa, H Schütze arXiv preprint arXiv:2203.11764, 2022 | 8 | 2022 |
Flow-Adapter Architecture for Unsupervised Machine Translation Y Liu, H Jabbar, H Schütze arXiv preprint arXiv:2204.12225, 2022 | 5 | 2022 |
An Information-Theoretic Approach and Dataset for Probing Gender Stereotypes in Multilingual Masked Language Models V Steinborn, P Dufter, H Jabbar, H Schütze Findings of the Association for Computational Linguistics: NAACL 2022, 921-932, 2022 | 4 | 2022 |
MorphPiece: Moving away from Statistical Language Representation H Jabbar arXiv preprint arXiv:2307.07262, 2023 | 1 | 2023 |
WordScape: a Pipeline to extract multilingual, visually rich Documents with Layout Annotations from Web Crawl Data M Weber, C Siebenschuh, RM Butler, A Alexandrov, VR Thanner, ... NeurIPS 2023 Datasets and Benchmarks, 2023 | | 2023 |
MorphPiece: A Linguistic Tokenizer for Large Language Models H Jabbar | | |
End-to-End Learning Artificial Intelligence E Labintcev, H Jabbar, A Sieler, C Holland | | |