Follow
Min Si
Title
Cited by
Cited by
Year
MT-MPI: Multithreaded MPI for many-core environments
M Si, AJ Peña, P Balaji, M Takagi, Y Ishikawa
Proceedings of the 28th ACM international conference on Supercomputing, 125-134, 2014
632014
Casper: An asynchronous progress model for MPI RMA on many-core architectures
M Si, AJ Pena, J Hammond, P Balaji, M Takagi, Y Ishikawa
2015 IEEE International Parallel and Distributed Processing Symposium, 665-676, 2015
552015
The glorious Glasgow Haskell compilation system user’s guide
GHC Team
Version 6 (1), 2005
45*2005
Why is MPI so slow? analyzing the fundamental limits in implementing MPI-3.1
K Raffenetti, A Amer, L Oden, C Archer, W Bland, H Fujita, Y Guo, ...
Proceedings of the international conference for high performance computing …, 2017
332017
Direct MPI library for Intel Xeon Phi co-processors
M Si, Y Ishikawa, M Tatagi
2013 IEEE International Symposium on Parallel & Distributed Processing …, 2013
292013
Scalable deep learning via I/O analysis and optimization
S Pumma, M Si, WC Feng, P Balaji
ACM Transactions on Parallel Computing (TOPC) 6 (2), 1-34, 2019
242019
Parallel I/O optimizations for scalable deep learning
S Pumma, M Si, W Feng, P Balaji
2017 IEEE 23rd International Conference on Parallel and Distributed Systems …, 2017
222017
Design of direct communication facility for many-core based accelerators
M Si, Y Ishikawa
2012 IEEE 26th International Parallel and Distributed Processing Symposium …, 2012
212012
Towards scalable deep learning via I/O analysis and optimization
S Pumma, M Si, W Feng, P Balaji
2017 IEEE 19th International Conference on High Performance Computing and …, 2017
202017
Process-in-process: techniques for practical address-space sharing
A Hori, M Si, B Gerofi, M Takagi, J Dayal, P Balaji, Y Ishikawa
Proceedings of the 27th International Symposium on High-Performance Parallel …, 2018
152018
Process-based asynchronous progress model for MPI point-to-point communication
M Si, P Balaji
2017 IEEE 19th International Conference on High Performance Computing and …, 2017
132017
Scaling NWChem with efficient and portable asynchronous communication in MPI RMA
M Si, AJ Pena, J Hammond, P Balaji, Y Ishikawa
2015 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid …, 2015
102015
Software combining to mitigate multithreaded MPI contention
A Amer, C Archer, M Blocksome, C Cao, M Chuvelev, H Fujita, ...
Proceedings of the ACM International Conference on Supercomputing, 367-379, 2019
82019
Dynamic adaptable asynchronous progress model for MPI RMA multiphase applications
M Si, AJ Pena, J Hammond, P Balaji, M Takagi, Y Ishikawa
IEEE Transactions on Parallel and Distributed Systems 29 (9), 1975-1989, 2018
72018
An MPI Library Implementing Direct Communication for Many-Core based Accelerators
M Si, Y Ishikawa
2012 SC Companion: High Performance Computing, Networking, Storage and …, 2012
62012
Cab-mpi: Exploring interprocess work-stealing towards balanced mpi communication
K Ouyang, M Si, A Hori, Z Chen, P Balaji
SC20: International Conference for High Performance Computing, Networking …, 2020
32020
Daps: A Dynamic Asynchronous Progress Stealing Model for MPI Communication
K Ouyang, M Si, A Hori, Z Chen, P Balaji
2021 IEEE International Conference on Cluster Computing (CLUSTER), 516-527, 2021
22021
Auto-precision scaling for distributed deep learning
R Han, J Demmel, Y You
International Conference on High Performance Computing, 79-97, 2021
12021
Proceedings of the 12th International Workshop on Programming Models and Applications for Multicores and Manycores
Q Chen, Z Huang, M Si
ACM, 2021
12021
Dynamic scaling for low-precision learning
R Han, M Si, J Demmel, Y You
Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of …, 2021
12021
The system can't perform the operation now. Try again later.
Articles 1–20