Guojing Cong
Cited by
Cited by
Fast shared-memory algorithms for computing the minimum spanning forest of sparse graphs
DA Bader, G Cong
Journal of Parallel and Distributed Computing 66 (11), 1366-1378, 2006
On the convergence properties of a -step averaging stochastic gradient descent algorithm for nonconvex optimization
F Zhou, G Cong
arXiv preprint arXiv:1708.01012, 2017
Solving large, irregular graph problems using adaptive work-stealing
G Cong, S Kodali, S Krishnamoorthy, D Lea, V Saraswat, T Wen
2008 37th International Conference on Parallel Processing, 536-545, 2008
On the architectural requirements for efficient execution of graph algorithms
DA Bader, G Cong, J Feo
2005 International Conference on Parallel Processing (ICPP'05), 547-556, 2005
Fast PGAS implementation of distributed graph algorithms
G Cong, G Almasi, V Saraswat
SC'10: Proceedings of the 2010 ACM/IEEE International Conference for High …, 2010
Automated detection of application performance bottlenecks
IH Chung, G Cong, DJ Klepacki, S Sbaraglia, SR Seelam, HF Wen
US Patent 8,225,291, 2012
Iterative, non-uniform profiling method for automatically refining performance bottleneck regions in scientific code
G Cong, PK Malkin
US Patent 8,214,806, 2012
Optimizing large-scale graph analysis on multithreaded, multicore platforms
G Cong, K Makarychev
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
An experimental study of parallel biconnected components algorithms on symmetric multiprocessors (SMPs)
G Cong, DA Bader
19th IEEE International Parallel and Distributed Processing Symposium, 9 pp., 2005
Programmable framework for automatic tuning of software applications
IH Chung, G Cong, DJ Klepacki, S Sbaraglia, SR Seelam, HF Wen
US Patent 8,327,325, 2012
Profiling application performance according to data structure
IH Chung, G Cong, K Ekanadham, D Klepacki, S Sbaraglia, HF Wen
US Patent 8,490,061, 2013
Application data prefetching on the IBM Blue Gene/Q supercomputer
IH Chung, C Kim, HF Wen, G Cong
SC'12: Proceedings of the International Conference on High Performance …, 2012
Lock-free parallel algorithms: An experimental study
G Cong, D Bader
International Conference on High-Performance Computing, 516-527, 2004
Accelerating data loading in deep neural network training
CC Yang, G Cong
2019 IEEE 26th International Conference on High Performance Computing, Data …, 2019
A framework for automated performance bottleneck detection
IH Chung, G Cong, D Klepacki, S Sbaraglia, S Seelam, HF Wen
2008 IEEE International Symposium on Parallel and Distributed Processing, 1-7, 2008
Techniques for designing efficient parallel graph algorithms for SMPs and multicore processors
G Cong, DA Bader
International Symposium on Parallel and Distributed Processing and …, 2007
An empirical analysis of parallel random permutation algorithms on SMPs
G Cong, DA Bader
Georgia Institute of Technology, 2006
A productivity centered tools framework for application performance tuning
H Wen, S Sbaraglia, S Seelam, I Chung, G Cong, D Klepacki
Fourth International Conference on the Quantitative Evaluation of Systems …, 2007
Fast PGAS connected components algorithms
G Cong, G Almasi, V Saraswat
Proceedings of the Third Conference on Partitioned Global Address Space …, 2009
Designing irregular parallel algorithms with mutual exclusion and lock-free protocols
G Cong, DA Bader
Journal of Parallel and Distributed Computing 66 (6), 854-866, 2006
The system can't perform the operation now. Try again later.
Articles 1–20