Seuraa
Nitish Shirish Keskar
Nitish Shirish Keskar
OpenAI
Vahvistettu sähköpostiosoite verkkotunnuksessa openai.com - Kotisivu
Nimike
Viittaukset
Viittaukset
Vuosi
On large-batch training for deep learning: Generalization gap and sharp minima
NS Keskar, D Mudigere, J Nocedal, M Smelyanskiy, PTP Tang
arXiv preprint arXiv:1609.04836, 2016
23232016
Regularizing and optimizing LSTM language models
S Merity, NS Keskar, R Socher
arXiv preprint arXiv:1708.02182, 2017
10622017
Ctrl: A conditional transformer language model for controllable generation
NS Keskar, B McCann, LR Varshney, C Xiong, R Socher
arXiv preprint arXiv:1909.05858, 2019
6202019
The natural language decathlon: Multitask learning as question answering
B McCann, NS Keskar, C Xiong, R Socher
arXiv preprint arXiv:1806.08730, 2018
4902018
Improving generalization performance by switching from adam to sgd
NS Keskar, R Socher
arXiv preprint arXiv:1712.07628, 2017
4302017
Neural text summarization: A critical evaluation
W Kryściński, NS Keskar, B McCann, C Xiong, R Socher
arXiv preprint arXiv:1908.08960, 2019
2312019
An analysis of neural language modeling at multiple scales
S Merity, NS Keskar, R Socher
arXiv preprint arXiv:1803.08240, 2018
1752018
A closer look at deep learning heuristics: Learning rate restarts, warmup and distillation
A Gotmare, NS Keskar, C Xiong, R Socher
arXiv preprint arXiv:1810.13243, 2018
1592018
Gedi: Generative discriminator guided sequence generation
B Krause, AD Gotmare, B McCann, NS Keskar, S Joty, R Socher, ...
arXiv preprint arXiv:2009.06367, 2020
1252020
Progen: Language modeling for protein generation
A Madani, B McCann, N Naik, NS Keskar, N Anand, RR Eguchi, ...
arXiv preprint arXiv:2004.03497, 2020
1242020
Weighted transformer network for machine translation
K Ahmed, NS Keskar, R Socher
arXiv preprint arXiv:1711.02132, 2017
1132017
Deep learning-enabled breast cancer hormonal receptor status determination from base-level H&E stains
N Naik, A Madani, A Esteva, NS Keskar, MF Press, D Ruderman, DB Agus, ...
Nature communications 11 (1), 1-8, 2020
912020
Balancing communication and computation in distributed optimization
AS Berahas, R Bollapragada, NS Keskar, E Wei
IEEE Transactions on Automatic Control 64 (8), 3141-3155, 2018
772018
Coarse-grain fine-grain coattention network for multi-evidence question answering
V Zhong, C Xiong, NS Keskar, R Socher
arXiv preprint arXiv:1901.00603, 2019
572019
Xlda: Cross-lingual data augmentation for natural language inference and question answering
J Singh, B McCann, NS Keskar, C Xiong, R Socher
arXiv preprint arXiv:1905.11471, 2019
542019
Sequence-to-sequence prediction using a neural network model
NS Keskar, K Ahmed, R Socher
US Patent App. 15/884,125, 2019
522019
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
A Srivastava, A Rastogi, A Rao, AAM Shoeb, A Abid, A Fisch, AR Brown, ...
arXiv preprint arXiv:2206.04615, 2022
442022
Multitask learning as question answering
B McCann, NS Keskar, C Xiong, R Socher
US Patent 10,776,581, 2020
412020
Multitask Learning As Question Answering
NS Keskar, B McCann, C Xiong, R Socher
US Patent App. 15/974,075, 2019
412019
adaqn: An adaptive quasi-newton algorithm for training rnns
NS Keskar, AS Berahas
Joint European conference on machine learning and knowledge discovery in …, 2016
402016
Järjestelmä ei voi suorittaa toimenpidettä nyt. Yritä myöhemmin uudelleen.
Artikkelit 1–20