Supporting compressed-sparse activations and weights on SIMD-like accelerator for sparse convolutional neural networks CY Lin, BC Lai 2018 23rd Asia and South Pacific Design Automation Conference (ASP-DAC), 105-110, 2018 | 19 | 2018 |
Atom: Low-bit quantization for efficient and accurate llm serving Y Zhao, CY Lin, K Zhu, Z Ye, L Chen, S Zheng, L Ceze, A Krishnamurthy, ... arXiv preprint arXiv:2310.19102, 2023 | 13 | 2023 |
Enhancing utilization of SIMD-like accelerator for sparse convolutional neural networks BC Lai, JW Pan, CY Lin IEEE Transactions on Very Large Scale Integration (VLSI) Systems 27 (5 …, 2019 | 13 | 2019 |
Apparatus and Method of Using Dual Indexing in Input Neurons and Corresponding Weights of Sparse Neural Network CY Lin, BC Lai US Patent App. 15/594,667, 2018 | 11 | 2018 |
Accelerating spmm kernel with cache-first edge sampling for graph neural networks CY Lin, L Luo, L Ceze arXiv preprint arXiv:2104.10716, 2021 | 3 | 2021 |
Design of Application Specific Throughput Processor for Matrix Operations PJ Wu, CY Lin, BCC Lai 2015 18th International Conference on Network-Based Information Systems, 324-331, 2015 | 2 | 2015 |
SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks CY Lin, A Prabhu, T Merth, S Mehta, A Ranjan, M Horton, M Rastegari European Conference on Computer Vision, 553-568, 2022 | 1 | 2022 |
Encode Once and Decode in Parallel: Efficient Transformer Decoding Bo-Ru Lu, Nikita Haduong, Chien-Yu Lin, Hao Cheng, Noah A. Smith, Mari Ostendorf arXiv preprint arXiv:2403.13112, 2024 | | 2024 |
FastSR-NeRF: Improving NeRF Efficiency on Consumer Devices with A Simple Super-Resolution Pipeline CY Lin, Q Fu, T Merth, K Yang, A Ranjan Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024 | | 2024 |
Supplementary Material for SPIN: An Empirical Evaluation on Sharing Parameters of Isotropic Networks CY Lin, A Prabhu, T Merth, S Mehta, A Ranjan, M Horton, M Rastegari | | |