Nitish Shirish Keskar

Citirano

	Sve	Od 2019.
Citati	11931	11242
H-indeks	28	27
i10-indeks	41	41

3100

1550

775

2325

20172018201920202021202220232024136 504 1004 1556 1693 2174 3003 1801

Javni pristup

Prikaži sve

5 članaka

0 članaka

dostupno

nije dostupno

Na temelju uvjeta financiranja

Suautori

Richard Socheryou.comPotvrđena adresa e-pošte na stanford.edu
Caiming XiongSalesforce ResearchPotvrđena adresa e-pošte na salesforce.com
Bryan McCannYou.comPotvrđena adresa e-pošte na you.com
Jorge NocedalProfessor, Industrial Engineering, Northwestern UniversityPotvrđena adresa e-pošte na NORTHWESTERN.EDU
Dheevatsa MudigereDistinguished Engineer, NVIDIAPotvrđena adresa e-pošte na nvidia.com
Mikhail SmelyanskiyFacebookPotvrđena adresa e-pošte na intel.com
Lav R. VarshneyUniversity of Illinois Urbana-ChampaignPotvrđena adresa e-pošte na illinois.edu
Stephen MerityPotvrđena adresa e-pošte na smerity.com
Nikhil NaikMITPotvrđena adresa e-pošte na mit.edu
Akhilesh Deepak GotmareSalesforce ResearchPotvrđena adresa e-pošte na salesforce.com
Ali MadaniProfluent BioPotvrđena adresa e-pošte na berkeley.edu
Nazneen RajaniHugging FacePotvrđena adresa e-pošte na huggingface.co
Huan WangSalesforce ResearchPotvrđena adresa e-pošte na yale.edu
Semih YavuzSalesforce ResearchPotvrđena adresa e-pošte na salesforce.com
Albert S. BerahasAssistant Professor, University of MichiganPotvrđena adresa e-pošte na umich.edu
Karim AhmedDartmouth College, Samsung Research AmericaPotvrđena adresa e-pošte na dartmouth.edu
Tong NiuSalesforce ResearchPotvrđena adresa e-pošte na salesforce.com
Raphael R EguchiStanford UniversityPotvrđena adresa e-pošte na alumni.stanford.edu
Jasdeep SinghStanford UniversityPotvrđena adresa e-pošte na stanford.edu
Wojciech KryścińskiCoherePotvrđena adresa e-pošte na cohere.com

Prati

Nitish Shirish Keskar

OpenAI

Potvrđena adresa e-pošte na openai.com - Početna stranica

Deep Learning Mathematical Optimization Natural Language Processing


Naslov Poredaj po navodima Poredaj po godini Poredaj po naslovu	Citirano Citirano	Godina
On large-batch training for deep learning: Generalization gap and sharp minima NS Keskar, D Mudigere, J Nocedal, M Smelyanskiy, PTP Tang arXiv preprint arXiv:1609.04836, 2016	3338	2016
Gpt-4 technical report J Achiam, S Adler, S Agarwal, L Ahmad, I Akkaya, FL Aleman, D Almeida, ... arXiv preprint arXiv:2303.08774, 2023	1338*	2023
Regularizing and optimizing LSTM language models S Merity, NS Keskar, R Socher arXiv preprint arXiv:1708.02182, 2017	1266	2017
Ctrl: A conditional transformer language model for controllable generation NS Keskar, B McCann, LR Varshney, C Xiong, R Socher arXiv preprint arXiv:1909.05858, 2019	1092	2019
Beyond the imitation game: Quantifying and extrapolating the capabilities of language models A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ... arXiv preprint arXiv:2206.04615, 2022	729	2022
The natural language decathlon: Multitask learning as question answering B McCann, NS Keskar, C Xiong, R Socher arXiv preprint arXiv:1806.08730, 2018	650	2018
Improving generalization performance by switching from adam to sgd NS Keskar, R Socher arXiv preprint arXiv:1712.07628, 2017	623	2017
Neural text summarization: A critical evaluation W Kryściński, NS Keskar, B McCann, C Xiong, R Socher arXiv preprint arXiv:1908.08960, 2019	377	2019
Gedi: Generative discriminator guided sequence generation B Krause, AD Gotmare, B McCann, NS Keskar, S Joty, R Socher, ... arXiv preprint arXiv:2009.06367, 2020	298	2020
A closer look at deep learning heuristics: Learning rate restarts, warmup and distillation A Gotmare, NS Keskar, C Xiong, R Socher arXiv preprint arXiv:1810.13243, 2018	276	2018
Progen: Language modeling for protein generation A Madani, B McCann, N Naik, NS Keskar, N Anand, RR Eguchi, ... arXiv preprint arXiv:2004.03497, 2020	232	2020
An analysis of neural language modeling at multiple scales S Merity, NS Keskar, R Socher arXiv preprint arXiv:1803.08240, 2018	188	2018
Deep learning-enabled breast cancer hormonal receptor status determination from base-level H&E stains N Naik, A Madani, A Esteva, NS Keskar, MF Press, D Ruderman, DB Agus, ... Nature communications 11 (1), 5727, 2020	175	2020
Weighted transformer network for machine translation K Ahmed, NS Keskar, R Socher arXiv preprint arXiv:1711.02132, 2017	155	2017
Balancing communication and computation in distributed optimization AS Berahas, R Bollapragada, NS Keskar, E Wei IEEE Transactions on Automatic Control 64 (8), 3141-3155, 2018	114	2018
Sequence-to-sequence prediction using a neural network model NS Keskar, K Ahmed, R Socher US Patent 11,928,600, 2024	107	2024
Multitask learning as question answering NS Keskar, B McCann, C Xiong, R Socher US Patent 11,501,076, 2022	86	2022
Multitask learning as question answering B McCann, NS Keskar, C Xiong, R Socher US Patent 10,776,581, 2020	83	2020
Hybrid training of deep networks NS Keskar, R Socher US Patent 11,276,002, 2022	78	2022
Xlda: Cross-lingual data augmentation for natural language inference and question answering J Singh, B McCann, NS Keskar, C Xiong, R Socher arXiv preprint arXiv:1905.11471, 2019	77	2019

Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.

Članci 1–20

Godišnji broj citata

Dvostruki navodi

Spojeni navodi

Dodavanje suautoraSuautori

Prati

Citirano

Suautori