From loop fusion to kernel fusion: A domain-specific approach to locality optimization B Qiao, O Reiche, F Hannig, J Teich 2019 IEEE/ACM International Symposium on Code Generation and Optimization …, 2019 | 42 | 2019 |
Automatic kernel fusion for image processing DSLs B Qiao, O Reiche, F Hannig, J Teich Proceedings of the 21st International Workshop on Software and Compilers for …, 2018 | 26 | 2018 |
The best of both worlds: Combining CUDA graph with an image processing DSL B Qiao, MA Özkan, J Teich, F Hannig 2020 57th ACM/IEEE Design Automation Conference (DAC), 1-6, 2020 | 11 | 2020 |
HipaccVX: wedding of OpenVX and DSL-based code generation MA Özkan, B Ok, B Qiao, J Teich, F Hannig Journal of Real-Time Image Processing, 1-13, 2020 | 8 | 2020 |
Unveiling kernel concurrency in multiresolution filters on gpus with an image processing dsl B Qiao, O Reiche, J Teich, F Hannig Proceedings of the 13th Annual Workshop on General Purpose Processing using …, 2020 | 4 | 2020 |
Efficient parallel reduction on GPUs with Hipacc B Qiao, O Reiche, MA Özkan, J Teich, F Hannig Proceedings of the 23th International Workshop on Software and Compilers for …, 2020 | 3 | 2020 |
Synthesizing High-Performance Image Processing Applications with Hipacc MA Özkan, O Reiche, B Qiao, R Membarth, J Teich, F Hannig University Booth at Design, Automation and Test in Europe (DATE), 2019 | 2 | 2019 |
An Efficient Approach for Image Border Handling on GPUs via Iteration Space Partitioning B Qiao, J Teich, F Hannig IEEE International Parallel and Distributed Processing Symposium Workshops …, 2021 | 1 | 2021 |
Hardware Accelerated Real-time Feature Tracking for Drift Correction B Qiao Eindhoven University of Technology, 2017 | 1 | 2017 |
System-Level Optimization and Code Generation for Graphics Processors using a Domain-Specific Language B Qiao Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), 2021 | | 2021 |