BLIS: A Framework for Rapidly Instantiating BLAS Functionality FG Van Zee, RA Van De Geijn ACM Transactions on Mathematical Software (TOMS) 41 (3), 14, 2015 | 293 | 2015 |
Programming matrix algorithms-by-blocks for thread-level parallelism G Quintana-Ortí, ES Quintana-Ortí, RAVD Geijn, FGV Zee, E Chan ACM Transactions on Mathematical Software (TOMS) 36 (3), 1-26, 2009 | 185 | 2009 |
Supermatrix: a multithreaded runtime scheduling system for algorithms-by-blocks E Chan, FG Van Zee, P Bientinesi, ES Quintana-Orti, G Quintana-Orti, ... Proceedings of the 13th ACM SIGPLAN Symposium on Principles and practice of …, 2008 | 134 | 2008 |
Anatomy of high-performance many-threaded matrix multiplication TM Smith, R Van De Geijn, M Smelyanskiy, JR Hammond, FG Van Zee 2014 IEEE 28th International Parallel and Distributed Processing Symposium …, 2014 | 133 | 2014 |
Scalable parallelization of FLAME code via the workqueuing model FG Van Zee, P Bientinesi, TM Low, RA Van De Geijn A Systematic Approach to Matrix Computations 10, 169, 2008 | 122* | 2008 |
The libflame library for dense matrix computations FG Van Zee, E Chan, RA Van de Geijn, ES Quintana-Orti, G Quintana-Orti Computing in science & engineering 11 (6), 56-63, 2009 | 121 | 2009 |
The BLIS framework: Experiments in portability FG Van Zee, TM Smith, B Marker, TM Low, RAVD Geijn, FD Igual, ... ACM Transactions on Mathematical Software (TOMS) 42 (2), 1-19, 2016 | 97 | 2016 |
Scheduling of QR factorization algorithms on SMP and multi-core architectures G Quintana-Orti, ES Quintana-Orti, E Chan, RA Van de Geijn, FG Van Zee 16th Euromicro Conference on Parallel, Distributed and Network-Based …, 2008 | 74 | 2008 |
Satisfying your dependencies with SuperMatrix E Chan, FG Van Zee, ES Quintana-Orti, G Quintana-Orti, R Van De Geijn 2007 IEEE International Conference on Cluster Computing, 91-99, 2007 | 56 | 2007 |
The FLAME approach: From dense linear algebra algorithms to high-performance multi-accelerator implementations FD Igual, E Chan, ES Quintana-Ortí, G Quintana-Ortí, RA Van De Geijn, ... Journal of Parallel and Distributed Computing 72 (9), 1134-1143, 2012 | 52 | 2012 |
Accumulating Householder transformations, revisited T Joffrain, TM Low, ES Quintana-Ortí, R Geijn, FGV Zee ACM Transactions on Mathematical Software (TOMS) 32 (2), 169-179, 2006 | 46 | 2006 |
Implementing high-performance complex matrix multiplication via the 3m and 4m methods FG Van Zee, TM Smith ACM Transactions on Mathematical Software (TOMS) 44 (1), 1-36, 2017 | 29 | 2017 |
Toward scalable matrix multiply on multithreaded architectures B Marker, FGV Zee, K Goto, G Quintana-Orti, RA Geijn European Conference on Parallel Processing, 748-757, 2007 | 29 | 2007 |
BLIS: A framework for generating BLAS-like libraries FG Van Zee, RA van de Geijn The University of Texas at Austin, Department of Computer Sciences …, 2012 | 25 | 2012 |
Making programming synonymous with programming for linear algebra libraries M Castillo, E Chan, FD Igual, R Mayo, ES QUINTANAORTI, ... The University of Texas at Austin, Department of Computer Sciences, Tech …, 2008 | 20 | 2008 |
Design of scalable dense linear algebra libraries for multithreaded architectures: the LU factorization G Quintana-Ortí, ES Quintana-Ortí, E Chan, RA Van De Geijn, FG Van Zee 2008 IEEE International Symposium on Parallel and Distributed Processing, 1-8, 2008 | 18 | 2008 |
Restructuring the tridiagonal and bidiagonal QR algorithms for performance FG Van Zee, RA Van de Geijn, G Quintana-Orti ACM Transactions on Mathematical Software (TOMS) 40 (3), 1-34, 2014 | 16* | 2014 |
Restructuring the QR-Algorithm for High-Performance Applications of Givens Rotations FG Van Zee, RA Van De Geijn, G Quintana-Orti | 15 | 2011 |
Families of algorithms for reducing a matrix to condensed form FG Van Zee, RA Van De Geijn, G Quintana-Ortí, GJ Elizondo ACM Transactions on Mathematical Software (TOMS) 39 (1), 1-32, 2012 | 14 | 2012 |
libflame: The Complete Reference (2009) FG Van Zee preparation. http://www. cs. utexas. edu/users/flame, 0 | 13 | |