Prati
Tan Nguyen
Naslov
Citirano
Citirano
Godina
AMReX: a framework for block-structured adaptive mesh refinement
W Zhang, A Almgren, V Beckner, J Bell, J Blaschke, C Chan, M Day, ...
Journal of Open Source Software 4 (37), 1370-1370, 2019
3282019
Accelerating Viola-Jones face detection to FPGA-level using GPUs
D Hefenbrock, J Oberg, NTN Thanh, R Kastner, SB Baden
2010 18th IEEE Annual International Symposium on Field-Programmable Custom …, 2010
1572010
Boxlib with tiling: An adaptive mesh refinement software framework
W Zhang, A Almgren, M Day, T Nguyen, J Shalf, D Unat
SIAM Journal on Scientific Computing 38 (5), S156-S172, 2016
662016
Bamboo--Translating MPI applications to a latency-tolerant, data-driven form
T Nguyen, P Cicotti, E Bylaska, D Quinlan, SB Baden
SC'12: Proceedings of the International Conference on High Performance …, 2012
412012
Tida: High-level programming abstractions for data locality management
D Unat, T Nguyen, W Zhang, MN Farooqi, B Bastem, G Michelogiannakis, ...
International Conference on High Performance Computing, 116-135, 2016
402016
The performance and energy efficiency potential of fpgas in scientific computing
T Nguyen, S Williams, M Siracusa, C MacLean, D Doerfler, NJ Wright
2020 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High …, 2020
262020
A software-based dynamic-warp scheduling approach for load-balancing the Viola–Jones face detection algorithm on GPUs
T Nguyen, D Hefenbrock, J Oberg, R Kastner, S Baden
Journal of Parallel and Distributed Computing 73 (5), 677-685, 2013
232013
FPGA‐based HPC accelerators: An evaluation on performance and energy efficiency
T Nguyen, C MacLean, M Siracusa, D Doerfler, NJ Wright, S Williams
Concurrency and Computation: Practice and Experience, e6570, 2021
222021
Phase asynchronous AMR execution for productive and performant astrophysical flows
MN Farooqi, T Nguyen, W Zhang, AS Almgren, J Shalf, D Unat
SC18: International Conference for High Performance Computing, Networking …, 2018
102018
Architectural Requirements for Deep Learning Workloads in HPC Environments
KZ Ibrahim, T Nguyen, HA Nam, W Bhimji, S Farrell, L Oliker, M Rowan, ...
2021 International Workshop on Performance Modeling, Benchmarking and …, 2021
92021
Perilla: Metadata-based optimizations of an asynchronous runtime for adaptive mesh refinement
T Nguyen, D Unat, W Zhang, A Almgren, N Farooqi, J Shalf
SC'16: Proceedings of the International Conference for High Performance …, 2016
82016
Automatic translation of MPI source into a latency-tolerant, data-driven form
T Nguyen, P Cicotti, E Bylaska, D Quinlan, S Baden
Journal of Parallel and Distributed Computing 106, 1-13, 2017
62017
Nonintrusive AMR asynchrony for communication optimization
MN Farooqi, D Unat, T Nguyen, W Zhang, A Almgren, J Shalf
European Conference on Parallel Processing, 682-694, 2017
52017
Lu factorization: Towards hiding communication overheads with a lookahead-free algorithm
T Nguyen, SB Baden
2015 IEEE International Conference on Cluster Computing, 394-397, 2015
52015
Preliminary scaling results on multiple hybrid nodes of Knights Corner and Sandy Bridge processors
T Nguyen, SB Baden
Third International Workshop on Domain-Specific Languages and High-Level …, 2013
52013
Experiences Porting the SU3_Bench Microbenchmark to the Intel Arria 10 and Xilinx Alveo U280 FPGAs
D Doerfler, F Fatollahi-Fard, C MacLean, T Nguyen, S Williams, N Wright, ...
International Workshop on OpenCL, 1-9, 2021
32021
Asynchronous AMR on Multi-GPUs
MN Farooqi, T Nguyen, W Zhang, AS Almgren, J Shalf, D Unat
International Conference on High Performance Computing, 113-123, 2019
22019
AMReX
W Zhang, A Myers, A Almgren, V Beckner, M Zingale, M Katz, K Gott, ...
Lawrence Berkeley National Lab.(LBNL), Berkeley, CA (United States …, 2017
22017
Hardware Evaluation Analytical Modeling and Node Simulation: Benefits of Tighter GPU Integration
B Austin, R Bair, K Barker, A Cabrera, A Chien, N Ding, J Firoz, K Ibrahim, ...
Lawrence Berkeley National Lab.(LBNL), Berkeley, CA (United States), 2021
12021
Facilitating CoDesign with Automatic Code Similarity Learning
T Nguyen, E Strohmaier, J Shalf
2021 IEEE/ACM 7th Workshop on the LLVM Compiler Infrastructure in HPC (LLVM …, 2021
2021
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–20