Follow
Sharan Chetlur
Title
Cited by
Cited by
Year
cudnn: Efficient primitives for deep learning
S Chetlur, C Woolley, P Vandermersch, J Cohen, J Tran, B Catanzaro, ...
arXiv preprint arXiv:1410.0759, 2014
18642014
cuDNN: Efficient primitives for deep learning. CoRR abs/1410.0759 (2014)
S Chetlur, C Woolley, P Vandermersch, J Cohen, J Tran, B Catanzaro, ...
arXiv preprint arXiv:1410.0759, 2014
472014
Shelhamer, E. cudnn: Efficient primitives for deep learning. arXiv 2014
S Chetlur, C Woolley, P Vandermersch, J Cohen, J Tran, B Catanzaro
arXiv preprint arXiv:1410.0759, 2019
382019
cudnn: E cient primitives for deep learning
S Chetlur, C Woolley, P Vandermersch, J Cohen, J Tran, B Catanzaro, ...
arXiv preprint arXiv:1410.0759, 2014
262014
cudnn: Efficient primitives for deep learning, CoRR abs/1410.0759
S Chetlur, C Woolley, P Vandermersch, J Cohen, J Tran, B Catanzaro, ...
arXiv preprint arXiv:1410.0759, 2014
112014
Wafer-Scale Fast Fourier Transforms
M Orenes-Vera, I Sharapov, R Schreiber, M Jacquelin, P Vandermersch, ...
arXiv preprint arXiv:2209.15040, 2022
2022
System and method for re-factorizing a square matrix into lower and upper triangular matrices on a parallel processor
PV Maxim Naumov, Sharanyan Chetlur, Lung Sheng Chien, Robert Strzodka
US Patent US 9170836 B2, 2014
2014
The system can't perform the operation now. Try again later.
Articles 1–7