Follow
Jens Domke
Jens Domke
RIKEN Center for Computational Science (R-CCS) / Tokyo Institute of Technology
Verified email at riken.jp - Homepage
Title
Cited by
Cited by
Year
Deadlock-free oblivious routing for arbitrary topologies
J Domke, T Hoefler, WE Nagel
2011 IEEE International Parallel & Distributed Processing Symposium, 616-627, 2011
832011
Fail-in-place network design: interaction between topology, routing algorithm and failures
J Domke, T Hoefler, S Matsuoka
SC'14: Proceedings of the International Conference for High Performance …, 2014
452014
Matrix engines for high performance computing: A paragon of performance or grasping at straws?
J Domke, E Vatai, A Drozd, P ChenT, Y Oyama, L Zhang, S Salaria, ...
2021 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2021
362021
High-performance routing with multipathing and path diversity in ethernet and HPC networks
M Besta, J Domke, M Schneider, M Konieczny, S Di Girolamo, ...
IEEE Transactions on Parallel and Distributed Systems 32 (4), 943-959, 2020
362020
Mitigating inter-job interference using adaptive flow-aware routing
SA Smith, CE Cromey, DK Lowenthal, J Domke, N Jain, JJ Thiagarajan, ...
SC18: International Conference for High Performance Computing, Networking …, 2018
362018
Why globally re-shuffle? Revisiting data shuffling in large scale deep learning
TT Nguyen, F Trahay, J Domke, A Drozd, E Vatai, J Liao, M Wahib, ...
2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2022
302022
Scheduling-aware routing for supercomputers
J Domke, T Hoefler
SC'16: Proceedings of the International Conference for High Performance …, 2016
282016
MLPerf™ HPC: A holistic benchmark suite for scientific machine learning on HPC systems
S Farrell, M Emani, J Balma, L Drescher, A Drozd, A Fink, G Fox, D Kanter, ...
2021 IEEE/ACM Workshop on Machine Learning in High Performance Computing …, 2021
272021
Routing on the dependency graph: A new approach to deadlock-free high-performance routing
J Domke, T Hoefler, S Matsuoka
proceedings of the 25th ACM international symposium on high-performance …, 2016
262016
HyperX topology: First at-scale implementation and comparison to the fat-tree
J Domke, S Matsuoka, IR Ivanov, Y Tsushima, T Yuki, A Nomura, S Miura, ...
Proceedings of the International Conference for High Performance Computing …, 2019
242019
Preliminary performance analysis of multi-rail fat-tree networks
N Wolfe, M Mubarak, N Jain, J Domke, A Bhatele, CD Carothers, RB Ross
2017 17th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2017
232017
Double-precision fpus in high-performance computing: an embarrassment of riches?
J Domke, K Matsumura, M Wahib, H Zhang, K Yashima, T Tsuchikawa, ...
2019 IEEE International Parallel and Distributed Processing Symposium (IPDPS …, 2019
222019
Scaling distributed deep learning workloads beyond the memory capacity with KARMA
M Wahib, H Zhang, TT Nguyen, A Drozd, J Domke, L Zhang, R Takano, ...
SC20: International Conference for High Performance Computing, Networking …, 2020
212020
Hardware-centric analysis of network performance for MPI applications
KA Brown, J Domke, S Matsuoka
2015 IEEE 21st International Conference on Parallel and Distributed Systems …, 2015
152015
Myths and legends in high-performance computing
S Matsuoka, J Domke, M Wahib, A Drozd, T Hoefler
The International Journal of High Performance Computing Applications 37 (3-4 …, 2023
122023
High-performance gpu-to-cpu transpilation and optimization via high-level parallel constructs
WS Moses, IR Ivanov, J Domke, T Endo, J Doerfert, O Zinenko
Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and …, 2023
122023
Toward reliable validation of hpc network simulation models
M Mubarak, N Jain, J Domke, N Wolfe, C Ross, K Li, A Bhatele, ...
2017 Winter Simulation Conference (WSC), 659-674, 2017
122017
Tracing data movements within MPI collectives
KA Brown, J Domke, S Matsuoka
Proceedings of the 21st European MPI Users' Group Meeting, 117-118, 2014
112014
Runtime tracing of the community earth system model: feasibility study and benefits
J Domke, D Wang
Procedia Computer Science 9, 1950-1958, 2012
112012
Optimizing asynchronous multi-level checkpoint/restart configurations with machine learning
T Dey, K Sato, B Nicolae, J Guo, J Domke, W Yu, F Cappello, K Mohror
2020 IEEE International Parallel and Distributed Processing Symposium …, 2020
102020
The system can't perform the operation now. Try again later.
Articles 1–20