Prati
Charlene J. Yang
Charlene J. Yang
NVIDIA; National Energy Research Scientific Computing Center (NERSC); Pawsey Supercomputing Centre
Potvrđena adresa e-pošte na nvidia.com
Naslov
Citirano
Citirano
Godina
Hierarchical Roofline Analysis for GPUs: Accelerating Performance Optimization for the NERSC‐9 Perlmutter System
C Yang, T Kurth, S Williams
Concurrency and Computation: Practice and Experience 32 (20), e5547, 2020
572020
An Empirical Roofline Methodology for Quantitatively Assessing Performance Portability
C Yang, R Gayatri, T Kurth, P Basu, Z Ronaghi, A Adetokunbo, B Friesen, ...
2018 IEEE/ACM International Workshop on Performance, Portability and …, 2018
442018
A Case Study for Performance Portability using OpenMP 4.5
R Gayatri, C Yang, T Kurth, J Deslippe
Fifth Workshop on Accelerator Programming Using Directives (WACCPD) 2018, 2018
392018
Accelerating Large-Scale Excited-State GW Calculations on Leadership HPC Systems
M Del Ben, C Yang, Z Li, H Felipe, S Louie, J Deslippe
SC20: International Conference for High Performance Computing, Networking …, 2020
252020
Timemory: modular performance analysis for HPC
JR Madsen, MG Awan, H Brunie, J Deslippe, R Gayatri, L Oliker, Y Wang, ...
High Performance Computing: 35th International Conference, ISC High …, 2020
252020
A Novel Multi-Level Integrated Roofline Model Approach for Performance Characterization
T Koskela, Z Matveev, C Yang, A Adedoyin, R Belenov, P Thierry, Z Zhao, ...
International Conference on High Performance Computing, 226-245, 2018
222018
Hierarchical roofline analysis: How to collect data using performance tools on intel cpus and nvidia gpus
C Yang
arXiv preprint arXiv:2009.02449, 2020
212020
Hierarchical Roofline Performance Analysis for Deep Learning Applications
C Yang, Y Wang, S Farrell, T Kurth, S Williams
arXiv preprint arXiv:2009.05257, 2020
152020
Time-Based Roofline for Deep Learning Performance Analysis
Y Wang, C Yang, S Farrell, Y Zhang, T Kurth, S Williams
arXiv preprint arXiv:2009.04598, 2020
122020
8 steps to 3.7 tflop/s on nvidia v100 gpu: Roofline analysis and other tricks
C Yang
arXiv preprint arXiv:2008.11326, 2020
112020
Accelerate Science on Perlmutter with NERSC
C Yang, J Deslippe
Bulletin of the American Physical Society 65 (Peer-Reviewed Talk), 2020
112020
Outcomes of openMP hackathon: openMP application experiences with the offloading model (part II)
B Chapman, B Pham, C Yang, C Daley, C Bertoni, D Kulkarni, ...
OpenMP: Enabling Massive Node-Level Parallelism: 17th International Workshop …, 2021
102021
A Factor Graph Approach to Exploiting Cyclic Prefix for Equalization in OFDM Systems
CJ Yang, Q Guo, DD Huang, S Nordholm
IEEE transactions on communications 61 (12), 4972-4983, 2013
102013
Toward Automated Application Profiling on Cray Systems
C Yang, B Friesen, T Kurth, B Cook, S Williams
Cray User Group (CUG), 2018
82018
Rahulkumar Gayatri, Thorsten Kurth, Protonu Basu, Zahra Ronaghi, Adedoyin Adetokunbo, Brian Friesen, Brandon Cook, Douglas Doerfler, Leonid Oliker, et al. 2018. An empirical …
C Yang
ACM International Workshop on Performance, Portability and Productivity in …, 2018
72018
Outcomes of OpenMP Hackathon: OpenMP Application Experiences with the Offloading Mode
S Pophale, D Oryspayev, B Chapman, B Pham, C Yang, C Daley, ...
Brookhaven National Lab.(BNL), Upton, NY (United States), 2021
62021
A Metric for Evaluating Supercomputer Performance in the Era of Extreme Heterogeneity
B Austin, C Daley, D Doerfler, J Deslippe, B Cook, B Friesen, T Kurth, ...
62018
An extended roofline performance model with pci-e and network ceilings
AS Dufek, JR Deslippe, PT Lin, CJ Yang, BG Cook, J Madsen
2021 International Workshop on Performance Modeling, Benchmarking and …, 2021
52021
Exploiting Cyclic Prefix for Turbo-OFDM Receiver Design
L Xu, CJ Yang, D Huang, A Cantoni
IEEE Access 5, 15762-15775, 2017
52017
A Novel Relay Selection Scheme in Multi-Antenna Cooperative Systems
CJ Yang, Z Zhang, W Meng
2010 IEEE International Conference on Software Engineering and Service …, 2010
52010
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–20