SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition DS Park, W Chan, Y Zhang, CC Chiu, B Zoph, ED Cubuk, QV Le INTERSPEECH, 2019 | 2781 | 2019 |
Listen, Attend and Spell: A Neural Network for Large Vocabulary Conversational Speech Recognition W Chan, N Jaitly, QV Le, O Vinyals ICASSP, 2016 | 2713* | 2016 |
Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding C Saharia, W Chan, S Saxena, L Li, J Whang, E Denton, ... NeurIPS, 2022 | 840 | 2022 |
Very Deep Convolutional Networks for End-to-End Speech Recognition Y Zhang, W Chan, N Jaitly ICASSP, 2017 | 503 | 2017 |
Image Super-Resolution via Iterative Refinement C Saharia, J Ho, W Chan, T Salimans, D Fleet, M Norouzi IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022 | 350 | 2022 |
WaveGrad: Estimating Gradients for Waveform Generation N Chen, Y Zhang, H Zen, R Weiss, M Norouzi, W Chan ICLR, 2021 | 322 | 2021 |
Advances in Joint CTC-Attention based End-to-End Speech Recognition with a Deep CNN Encoder and RNN-LM T Hori, S Watanabe, Y Zhang, W Chan INTERSPEECH, 2017 | 315 | 2017 |
Palette: Image-to-Image Diffusion Models C Saharia, W Chan, H Chang, C A. Lee, J Ho, D Tim Salimans, J. Fleet, ... SIGGRAPH, 2022 | 275 | 2022 |
Cascaded Diffusion Models for High Fidelity Image Generation J Ho, C Saharia, W Chan, D Fleet, M Norouzi, T Salimans Journal of Machine Learning Research 23 (47), 1-33, 2022 | 266 | 2022 |
Video Diffusion Models J Ho, T Salimans, A Gritsenko, W Chan, M Norouzi, D Fleet NeurIPS, 2022 | 217 | 2022 |
Insertion Transformer: Flexible Sequence Generation via Insertion Operations M Stern, W Chan, J Kiros, J Uszkoreit ICML, 2019 | 202 | 2019 |
Lingvo: a Modular and Scalable Framework for Sequence-to-Sequence Modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 170 | 2019 |
Imagen Video: High Definition Video Generation with Diffusion Models J Ho, W Chan, C Saharia, J Whang, R Gao, A Gritsenko, D P. Kingma, ... arXiv:2210.02303, 2022 | 146 | 2022 |
Predicting Collective Sentiment Dynamics from Time-series Social Media L Nguyen, P Wu, W Chan, W Peng, Y Zhang SIGKDD WISDOM, 2012 | 133 | 2012 |
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes B Li, Y Zhang, T Sainath, Y Wu, W Chan ICASSP, 2019 | 125 | 2019 |
SpecAugment on Large Scale Datasets D Park, Y Zhang, CC Chiu, Y Chen, B Li, W Chan, Q Le, Y Wu ICASSP, 2020 | 108 | 2020 |
Non-Autoregressive Machine Translation with Latent Alignments C Saharia, W Chan, S Saxena, Norouzi, Mohammad EMNLP, 2020 | 104 | 2020 |
Imputer: Sequence Modelling via Imputation and Dynamic Programming W Chan, C Saharia, G Hinton, M Norouzi, N Jaitly ICML, 2020 | 99 | 2020 |
Transferring Knowledge from a RNN to a DNN W Chan, NR Ke, I Lane INTERSPEECH, 2015 | 92 | 2015 |
BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ... IEEE Journal of Selected Topics in Signal Processing, 2021 | 78 | 2021 |