Xu Liu
Cited by
Cited by
Evaluating modern gpu interconnect: Pcie, nvlink, nv-sli, nvswitch and gpudirect
A Li, SL Song, J Chen, J Li, X Liu, NR Tallent, KJ Barker
IEEE Transactions on Parallel and Distributed Systems 31 (1), 94-110, 2019
OMPT: An OpenMP tools application programming interface for performance analysis
AE Eichenberger, J Mellor-Crummey, M Schulz, M Wong, N Copty, ...
International Workshop on OpenMP, 171-185, 2013
A tool to analyze the performance of multithreaded programs on NUMA architectures
X Liu, J Mellor-Crummey
ACM Sigplan Notices 49 (8), 259-272, 2014
Locality-aware CTA clustering for modern GPUs
A Li, SL Song, W Liu, X Liu, A Kumar, H Corporaal
ACM SIGARCH Computer Architecture News 45 (1), 297-311, 2017
Flep: Enabling flexible and efficient preemption on gpus
B Wu, X Liu, X Zhou, C Jiang
ACM SIGPLAN Notices 52 (4), 483-496, 2017
memif Towards Programming Heterogeneous Memory Asynchronously
FX Lin, X Liu
ACM SIGPLAN Notices 51 (4), 369-383, 2016
A data-centric profiler for parallel programs
X Liu, J Mellor-Crummey
SC'13: Proceedings of the International Conference on High Performance …, 2013
Tartan: evaluating modern GPU interconnect via a multi-GPU benchmark suite
A Li, SL Song, J Chen, X Liu, N Tallent, K Barker
2018 IEEE International Symposium on Workload Characterization (IISWC), 191-202, 2018
Cvr: Efficient vectorization of spmv on x86 processors
B Xie, J Zhan, X Liu, W Gao, Z Jia, X He, L Zhang
Proceedings of the 2018 International Symposium on Code Generation and …, 2018
Pinpointing data locality problems using data-centric analysis
X Liu, J Mellor-Crummey
International Symposium on Code Generation and Optimization (CGO 2011), 171-180, 2011
Scaanalyzer: A tool to identify memory scalability bottlenecks in parallel programs
X Liu, B Wu
SC'15: Proceedings of the International Conference for High Performance …, 2015
Cudaadvisor: Llvm-based runtime profiling for modern gpus
D Shen, SL Song, A Li, X Liu
Proceedings of the 2018 International Symposium on Code Generation and …, 2018
Pinpointing data locality bottlenecks with low overhead
X Liu, J Mellor-Crummey
2013 IEEE International Symposium on Performance Analysis of Systems and …, 2013
Towards efficient SpMV on sunway manycore architectures
C Liu, B Xie, X Liu, W Xue, H Yang, X Liu
Proceedings of the 2018 International Conference on Supercomputing, 363-373, 2018
OMPT and OMPD: OpenMP tools application programming interfaces for performance analysis and debugging
A Eichenberger, J Mellor-Crummey, M Schulz, N Copty, J DelSignore, ...
International Workshop on OpenMP (IWOMP 2013), 2013
Call paths for pin tools
M Chabbi, X Liu, J Mellor-Crummey
Proceedings of Annual IEEE/ACM International Symposium on Code Generation …, 2014
Characterizing emerging heterogeneous memory
D Shen, X Liu, FX Lin
ACM SIGPLAN Notices 51 (11), 13-23, 2016
A new approach for performance analysis of OpenMP programs
X Liu, J Mellor-Crummey, M Fagan
Proceedings of the 27th international ACM conference on International …, 2013
Redspy: Exploring value locality in software
S Wen, M Chabbi, X Liu
Proceedings of the Twenty-Second International Conference on Architectural …, 2017
Watching for software inefficiencies with witch
S Wen, X Liu, J Byrne, M Chabbi
Proceedings of the Twenty-Third International Conference on Architectural …, 2018
The system can't perform the operation now. Try again later.
Articles 1–20