Prati
Daniel Simig
Daniel Simig
Cohere
Potvrđena adresa e-pošte na cohere.com
Naslov
Citirano
Citirano
Godina
Opt: Open pre-trained transformer language models
S Zhang, S Roller, N Goyal, M Artetxe, M Chen, S Chen, C Dewan, ...
arXiv preprint arXiv:2205.01068, 2022
16542022
Few-shot learning with multilingual language models
XV Lin, T Mihaylov, M Artetxe, T Wang, S Chen, D Simig, M Ott, N Goyal, ...
arXiv preprint arXiv:2112.10668, 2021
133*2021
Opt-iml: Scaling language model instruction meta learning through the lens of generalization
S Iyer, XV Lin, R Pasunuru, T Mihaylov, D Simig, P Yu, K Shuster, T Wang, ...
arXiv preprint arXiv:2212.12017, 2022
622022
Semdedup: Data-efficient learning at web-scale through semantic deduplication
A Abbas, K Tirumala, D Simig, S Ganguli, AS Morcos
arXiv preprint arXiv:2303.09540, 2023
582023
Megabyte: Predicting million-byte sequences with multiscale transformers
L Yu, D Simig, C Flaherty, A Aghajanyan, L Zettlemoyer, M Lewis
Advances in Neural Information Processing Systems 36, 2024
412024
D4: Improving llm pretraining via document de-duplication and diversification
K Tirumala, D Simig, A Aghajanyan, A Morcos
Advances in Neural Information Processing Systems 36, 2024
242024
Understanding in-context learning via supportive pretraining data
X Han, D Simig, T Mihaylov, Y Tsvetkov, A Celikyilmaz, T Wang
arXiv preprint arXiv:2306.15091, 2023
202023
Open vocabulary extreme classification using generative models
D Simig, F Petroni, P Yanki, K Popat, C Du, S Riedel, M Yazdani
arXiv preprint arXiv:2205.05812, 2022
122022
Text characterization toolkit (TCT)
D Simig, T Wang, V Dankers, P Henderson, K Batsuren, D Hupkes, ...
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the …, 2022
5*2022
Evaluating end-to-end entity linking on domain-specific knowledge bases: Learning about ancient technologies from museum collections
S Cadavid-Sanchez, K Kacem, RAM Frade, J Boehm, T Chaney, ...
arXiv preprint arXiv:2305.14588, 2023
2023
Turning Flows into Trees: Graph Analytics for Aerodynamic Flows
D Simig, P Kelly
2016
Natural Language to Neural Programs
D Simig
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–12