Optimization of sparse matrix-vector multiplication on emerging multicore platforms S Williams, L Oliker, R Vuduc, J Shalf, K Yelick, J Demmel SC'07: Proceedings of the 2007 ACM/IEEE Conference on Supercomputing, 1-12, 2007 | 952 | 2007 |
Stencil computation optimization and auto-tuning on state-of-the-art multicore architectures K Datta, M Murphy, V Volkov, S Williams, J Carter, L Oliker, D Patterson, ... SC'08: Proceedings of the 2008 ACM/IEEE conference on Supercomputing, 1-12, 2008 | 805 | 2008 |
The potential of the cell processor for scientific computing S Williams, J Shalf, L Oliker, S Kamil, P Husbands, K Yelick Proceedings of the 3rd Conference on Computing Frontiers, 9-20, 2006 | 482 | 2006 |
Optimization and performance modeling of stencil computations on modern microprocessors K Datta, S Kamil, S Williams, L Oliker, J Shalf, K Yelick SIAM review 51 (1), 129-159, 2009 | 304 | 2009 |
An auto-tuning framework for parallel multicore stencil computations S Kamil, C Chan, L Oliker, J Shalf, S Williams 2010 IEEE international symposium on parallel & distributed processing …, 2010 | 285 | 2010 |
A whole-genome shotgun approach for assembling and anchoring the hexaploid bread wheat genome JA Chapman, M Mascher, A Buluç, K Barry, E Georganas, A Session, ... Genome biology 16 (1), 1-17, 2015 | 279 | 2015 |
Implicit and explicit optimizations for stencil computations S Kamil, K Datta, S Williams, L Oliker, J Shalf, K Yelick Proceedings of the 2006 workshop on Memory system performance and …, 2006 | 196 | 2006 |
PLUM: Parallel load balancing for adaptive unstructured meshes L Oliker, R Biswas Journal of Parallel and Distributed Computing 52 (2), 150-177, 1998 | 184 | 1998 |
Job superscheduler architecture and performance in computational grid environments H Shan, L Oliker, R Biswas SC'03: Proceedings of the 2003 ACM/IEEE Conference on Supercomputing, 44-44, 2003 | 146 | 2003 |
Reduced-bandwidth multithreaded algorithms for sparse matrix-vector multiplication A Buluc, S Williams, L Oliker, J Demmel 2011 IEEE International Parallel & Distributed Processing Symposium, 721-733, 2011 | 144 | 2011 |
Lattice Boltzmann simulation optimization on leading multicore platforms S Williams, J Carter, L Oliker, J Shalf, K Yelick 2008 IEEE International Symposium on Parallel and Distributed Processing, 1-14, 2008 | 138 | 2008 |
Scientific computing kernels on the cell processor S Williams, J Shalf, L Oliker, S Kamil, P Husbands, K Yelick International Journal of Parallel Programming 35 (3), 263-298, 2007 | 135 | 2007 |
Impact of modern memory subsystems on cache optimizations for stencil computations S Kamil, P Husbands, L Oliker, J Shalf, K Yelick Proceedings of the 2005 workshop on Memory system performance, 36-43, 2005 | 133 | 2005 |
Scientific computations on modern parallel vector systems L Oliker, A Canning, J Carter, J Shalf, S Ethier SC'04: Proceedings of the 2004 ACM/IEEE Conference on Supercomputing, 10-10, 2004 | 110 | 2004 |
Roofline model toolkit: A practical tool for architectural and program analysis YJ Lo, S Williams, BV Straalen, TJ Ligocki, MJ Cordery, NJ Wright, ... International Workshop on Performance Modeling, Benchmarking and Simulation …, 2014 | 108 | 2014 |
Analyzing ultra-scale application communication requirements for a reconfigurable hybrid interconnect J Shalf, S Kamil, L Oliker, D Skinner SC'05: Proceedings of the 2005 ACM/IEEE Conference on Supercomputing, 17-17, 2005 | 98 | 2005 |
Parallel de bruijn graph construction and traversal for de novo genome assembly E Georganas, A Buluç, J Chapman, L Oliker, D Rokhsar, K Yelick SC'14: Proceedings of the International Conference for High Performance …, 2014 | 93 | 2014 |
Effects of ordering strategies and programming paradigms on sparse matrix computations L Oliker, X Li, P Husbands, R Biswas Siam Review 44 (3), 373-393, 2002 | 91 | 2002 |
Optimizing and tuning the fast multipole method for state-of-the-art multicore architectures A Chandramowlishwaran, S Williams, L Oliker, I Lashuk, G Biros, R Vuduc 2010 IEEE International Symposium on Parallel & Distributed Processing …, 2010 | 90 | 2010 |
Analysis of photonic networks for a chip multiprocessor using scientific applications G Hendry, S Kamil, A Biberman, J Chan, BG Lee, M Mohiyuddin, A Jain, ... 2009 3rd ACM/IEEE International Symposium on Networks-on-Chip, 104-113, 2009 | 88 | 2009 |