Tensor comprehensions: Framework-agnostic high-performance machine learning abstractions N Vasilache, O Zinenko, T Theodoridis, P Goyal, Z DeVito, WS Moses, ... arXiv preprint arXiv:1802.04730, 2018 | 499 | 2018 |
MLIR: Scaling compiler infrastructure for domain specific computation C Lattner, M Amini, U Bondhugula, A Cohen, A Davis, J Pienaar, R Riddle, ... 2021 IEEE/ACM International Symposium on Code Generation and Optimization …, 2021 | 471 | 2021 |
Learning visual features from large weakly supervised data A Joulin, L Van Der Maaten, A Jabri, N Vasilache Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The …, 2016 | 442 | 2016 |
Fast convolutional nets with fbfft: A GPU performance evaluation N Vasilache, J Johnson, M Mathieu, S Chintala, S Piantino, Y LeCun arXiv preprint arXiv:1412.7580, 2014 | 413 | 2014 |
Semi-automatic composition of loop transformations for deep parallelism and memory hierarchies S Girbal, N Vasilache, C Bastoul, A Cohen, D Parello, M Sigler, O Temam International Journal of Parallel Programming 34, 261-317, 2006 | 324 | 2006 |
MLIR: A compiler infrastructure for the end of Moore's law C Lattner, M Amini, U Bondhugula, A Cohen, A Davis, J Pienaar, R Riddle, ... arXiv preprint arXiv:2002.11054, 2020 | 309 | 2020 |
Loop transformations: convexity, pruning and optimization LN Pouchet, U Bondhugula, C Bastoul, A Cohen, J Ramanujam, ... ACM SIGPLAN Notices 46 (1), 549-562, 2011 | 186 | 2011 |
Iterative optimization in the polyhedral model: Part I, one-dimensional time LN Pouchet, C Bastoul, A Cohen, N Vasilache International Symposium on Code Generation and Optimization (CGO'07), 144-156, 2007 | 184 | 2007 |
Systems, methods and apparatus for distributed decision processing J Ezick, R Lethin, N Vasilache US Patent 8,688,619, 2014 | 157 | 2014 |
Runnemede: An architecture for ubiquitous high-performance computing NP Carter, A Agrawal, S Borkar, R Cledat, H David, D Dunning, J Fryman, ... 2013 IEEE 19th International Symposium on High Performance Computer …, 2013 | 139 | 2013 |
Facilitating the search for compositions of program transformations A Cohen, M Sigler, S Girbal, O Temam, D Parello, N Vasilache Proceedings of the 19th annual international conference on Supercomputing …, 2005 | 133 | 2005 |
Polyhedral code generation in the real world N Vasilache, C Bastoul, A Cohen Compiler Construction: 15th International Conference, CC 2006, Held as Part …, 2006 | 128 | 2006 |
GRAPHITE: Polyhedral analyses and optimizations for GCC S Pop, A Cohen, C Bastoul, S Girbal, GA Silber, N Vasilache proceedings of the 2006 GCC developers summit 6, 90-91, 2006 | 126 | 2006 |
A mapping path for multi-GPGPU accelerated computers from a portable high level programming abstraction A Leung, N Vasilache, B Meister, M Baskaran, D Wohlford, C Bastoul, ... Proceedings of the 3rd Workshop on General-Purpose Computation on Graphics …, 2010 | 119 | 2010 |
Efficient and scalable computations with sparse tensors M Baskaran, B Meister, N Vasilache, R Lethin 2012 IEEE Conference on High Performance Extreme Computing, 1-6, 2012 | 97 | 2012 |
System, methods and apparatus for program optimization for multi-threaded processor architectures C Bastoul, RA Lethin, AK Leung, BJ Meister, P Szilagyi, NT Vasilache, ... US Patent 8,930,926, 2015 | 85 | 2015 |
R-Stream Compiler. B Meister, N Vasilache, D Wohlford, MM Baskaran, A Leung, R Lethin Encyclopedia of Parallel Computing, 1756-1765, 2011 | 82 | 2011 |
Violated dependence analysis N Vasilache, C Bastoul, A Cohen, S Girbal Proceedings of the 20th annual international conference on Supercomputing …, 2006 | 72 | 2006 |
The next 700 accelerated layers: From mathematical expressions of network computation graphs to accelerated GPU kernels, automatically N Vasilache, O Zinenko, T Theodoridis, P Goyal, Z Devito, WS Moses, ... ACM Transactions on Architecture and Code Optimization (TACO) 16 (4), 1-26, 2019 | 68 | 2019 |
Joint scheduling and layout optimization to enable multi-level vectorization N Vasilache, B Meister, M Baskaran, R Lethin IMPACT, Paris, France, 2012 | 63 | 2012 |