Yakun Sophia Shao

Cited by

	All	Since 2019
Citations	3434	2927
h-index	26	24
i10-index	34	33

940

470

235

705

2014201520162017201820192020202120222023202433 89 99 120 137 190 358 577 682 931 186

Public access

View all

23 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Steve KecklerVice President of Architecture Research, NVIDIAVerified email at cs.utexas.edu
Joel EmerProfessor of the Practice, MIT - Sr. Distinguished Research Scientist, NvidiaVerified email at csail.mit.edu
David BrooksHaley Family Professor of Computer Science, Harvard UniversityVerified email at eecs.harvard.edu
Rangharajan VenkatesanSenior Research ScientistVerified email at nvidia.com
Gu-Yeon WeiRobert and Suzanne Case Professor of EE and CS, Harvard UniversityVerified email at eecs.harvard.edu
William DallyChief Scientist, NVIDIA; Adjunct Professor of CS and EE, Stanford University;Verified email at stanford.edu
Jason ClemonsNVIDIA - Senior Research ScientistVerified email at nvidia.com
Brian ZimmerSenior Research Scientist, NVIDIAVerified email at nvidia.com
Nathaniel PinckneyNVIDIAVerified email at umich.edu
Brandon ReagenAssistant Professor, New York UniversityVerified email at nyu.edu
Alicia KlinefelterResearch Scientist at NVidiaVerified email at nvidia.com
Nan JiangNVIDIAVerified email at stanford.edu
Angshuman ParasharSr. Research Scientist, NVIDIAVerified email at nvidia.com
Sam (Likun) XiGoogle Inc.Verified email at google.com
Vijayalakshmi SrinivasanIBM T.J. Watson Research CenterVerified email at us.ibm.com
Bob AdolfSoftware Research Scientist, IntelVerified email at intel.com
Rafael Garibotti, PhD.Principal FPGA Engineer - Vector TradingVerified email at vectrading.com
Judson PorterHarvard UniversityVerified email at eecs.harvard.edu

Yakun Sophia Shao

Assistant Professor, UC Berkeley

Verified email at berkeley.edu - Homepage

Computer Architecture VLSI


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Aladdin: A Pre-RTL, Power-Performance Accelerator Simulator Enabling Large Design Space Exploration of Customized Architectures YS Shao, B Reagen, GY Wei, D Brooks International Symposium on Computer Architecture (ISCA), 2014	382	2014
Timeloop: A Systematic Approach to DNN Accelerator Evaluation A Parashar, P Raina, YS Shao, YH Chen, VA Ying, A Mukkara, ... International Symposium on Performance Analysis of Systems and Software (ISPASS), 2019	368	2019
Simba: Scaling deep-learning inference with multi-chip-module-based architecture YS Shao, J Clemons, R Venkatesan, B Zimmer, M Fojtik, N Jiang, B Keller, ... Proceedings of the 52nd Annual IEEE/ACM International Symposium on …, 2019	361	2019
MachSuite: Benchmarks for Accelerator Design and Customized Architectures B Reagen, R Adolf, YS Shao, GY Wei, D Brooks IEEE International Symposium on Workload Characterization (IISWC), 2014	307	2014
Chipyard: Integrated design, simulation, and implementation framework for custom socs A Amid, D Biancolin, A Gonzalez, D Grubb, S Karandikar, H Liew, ... IEEE Micro 40 (4), 10-21, 2020	211	2020
Co-Designing Accelerators and SoC Interfaces using gem5-Aladdin YS Shao, SL Xi, V Srinivasan, GY Wei, D Brooks International Symposium on Microarchitecture (MICRO), 2016	194	2016
Energy Characterization and Instruction-Level Energy Model of Intel’s Xeon Phi Processor YS Shao, D Brooks International Symposium on Low Power Electronics and Design (ISLPED), 2013	176	2013
Gemmini: Enabling systematic deep-learning architecture evaluation via full-stack integration H Genc, S Kim, A Amid, A Haj-Ali, V Iyer, P Prakash, J Zhao, D Grubb, ... Design Automation Conference (DAC), 2021	162	2021
Magnet: A modular accelerator generator for neural networks R Venkatesan, YS Shao, M Wang, J Clemons, S Dai, M Fojtik, B Keller, ... 2019 IEEE/ACM International Conference on Computer-Aided Design (ICCAD), 1-8, 2019	113	2019
Neurovectorizer: End-to-end vectorization with deep reinforcement learning A Haj-Ali, NK Ahmed, T Willke, YS Shao, K Asanovic, I Stoica Proceedings of the 18th ACM/IEEE International Symposium on Code Generation …, 2020	101	2020
ISA-Independent Workload Characterization and its Implications for Specialized Architectures YS Shao, D Brooks International Symposium on Performance Analysis of Systems and Software …, 2013	94	2013
A 0.32–128 TOPS, scalable multi-chip-module-based deep neural network inference accelerator with ground-referenced signaling in 16 nm B Zimmer, R Venkatesan, YS Shao, J Clemons, M Fojtik, N Jiang, B Keller, ... IEEE Journal of Solid-State Circuits 55 (4), 920-932, 2020	90	2020
Gemmini: An agile systolic array generator enabling systematic evaluations of deep-learning architectures H Genc, A Haj-Ali, V Iyer, A Amid, H Mao, J Wright, C Schmidt, J Zhao, ... arXiv preprint arXiv:1911.09925 3, 25, 2019	87	2019
CoSA: Scheduling by Constrained Optimization for Spatial Accelerators Q Huang, M Kang, G Dinh, T Norell, A Kalaiah, J Demmel, J Wawrzynek, ... International Symposium on Computer Architecture (ISCA), 2021	83	2021
A modular digital VLSI flow for high-productivity SoC design B Khailany, E Khmer, R Venkatesan, J Clemons, JS Emer, M Fojtik, ... Proceedings of the 55th Annual Design Automation Conference, 1-6, 2018	73	2018
Buffets: An Efficient and Composable Storage Idiom for Explicit Decoupled Data Orchestration M Pellauer, YS Shao, J Clemons, N Crago, K Hegde, R Ventakesan, ... International Conference on Architectural Support for Programming Languages …, 2019	60	2019
The Aladdin Approach to Accelerator Design and Modeling YS Shao, B Reagen, GY Wei, D Brooks IEEE Micro, 2015	58	2015
A 0.11 pj/op, 0.32-128 tops, scalable multi-chip-module-based deep neural network accelerator with ground-reference signaling in 16nm B Zimmer, R Venkatesan, YS Shao, J Clemons, M Fojtik, N Jiang, B Keller, ... 2019 Symposium on VLSI Circuits, C300-C301, 2019	52	2019
SNAP: A 1.67—21.55 TOPS/W sparse neural acceleration processor for unstructured sparse deep neural network inference in 16nm CMOS JF Zhang, CE Lee, C Liu, YS Shao, SW Keckler, Z Zhang 2019 Symposium on VLSI Circuits, C306-C307, 2019	50	2019
SNAP: An Efficient Sparse Neural Acceleration Processor for Unstructured Sparse Deep Neural Network Inference JF Zhang, CE Lee, C Liu, YS Shao, SW Keckler, Z Zhang IEEE Journal of Solid-State Circuits, 2020	49	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors