Streaming end-to-end speech recognition for mobile devices Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 579 | 2019 |
Residual belief propagation: Informed scheduling for asynchronous message passing G Elidan, I McGraw, D Koller arXiv preprint arXiv:1206.6837, 2012 | 360 | 2012 |
Personalized speech recognition on mobile devices I McGraw, R Prabhavalkar, R Alvarez, MG Arenas, K Rao, D Rybach, ... 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 190 | 2016 |
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ... ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 181 | 2020 |
Lingvo: a modular and scalable framework for sequence-to-sequence modeling J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ... arXiv preprint arXiv:1902.08295, 2019 | 170 | 2019 |
Tool for selecting ink and other objects in an electronic document AJ Simmons, IC McGraw, PL Engrav, B Barabe, OC Braun US Patent 7,454,702, 2008 | 127 | 2008 |
Two-pass end-to-end speech recognition TN Sainath, R Pang, D Rybach, Y He, R Prabhavalkar, W Li, M Visontai, ... arXiv preprint arXiv:1908.10992, 2019 | 124 | 2019 |
The WAMI toolkit for developing, deploying, and evaluating web-accessible multimodal interfaces A Gruenstein, I McGraw, I Badr Proceedings of the 10th international conference on Multimodal interfaces …, 2008 | 112 | 2008 |
A summary of the 2012 JHU CLSP workshop on zero resource speech technologies and models of early language acquisition A Jansen, E Dupoux, S Goldwater, M Johnson, S Khudanpur, K Church, ... 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 109 | 2013 |
On the compression of recurrent neural networks with an application to LVCSR acoustic modeling for embedded speech recognition R Prabhavalkar, O Alsharif, A Bruguier, L McGraw 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 108 | 2016 |
Collecting Voices from the Cloud. I McGraw, C Lee, IL Hetherington, S Seneff, JR Glass LREC, 1576-1583, 2010 | 97 | 2010 |
Collecting Voices from the Cloud. I McGraw, C Lee, IL Hetherington, S Seneff, JR Glass LREC, 1576-1583, 2010 | 97 | 2010 |
Streaming small-footprint keyword spotting using sequence-to-sequence models Y He, R Prabhavalkar, K Rao, W Li, A Bakhtin, I McGraw 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017 | 86 | 2017 |
Speech-enabled card games for incidental vocabulary acquisition in a foreign language I McGraw, B Yoshimoto, S Seneff Speech Communication 51 (10), 1006-1023, 2009 | 70 | 2009 |
Optimizing speech recognition for the edge Y Shangguan, J Li, Q Liang, R Alvarez, I McGraw arXiv preprint arXiv:1909.12408, 2019 | 59 | 2019 |
Learning lexicons from speech using a pronunciation mixture model I McGraw, I Badr, JR Glass IEEE Transactions on Audio, Speech, and Language Processing 21 (2), 357-366, 2012 | 58 | 2012 |
A self-transcribing speech corpus: collecting continuous speech with an online educational game A Gruenstein, I McGraw, A Sutherland International Workshop on Speech and Language Technology in Education, 2009 | 53 | 2009 |
A conversational movie search system based on conditional random fields J Liu, S Cyphers, P Pasupat, I McGraw, J Glass Thirteenth Annual Conference of the International Speech Communication …, 2012 | 49 | 2012 |
A self-labeling speech corpus: Collecting spoken words with an online educational game I McGraw, A Gruenstein, A Sutherland Tenth Annual Conference of the International Speech Communication Association, 2009 | 37 | 2009 |
An efficient streaming non-recurrent on-device end-to-end model with improvements to rare-word modeling TN Sainath, YR He, A Narayanan, R Botros, R Pang, DJ Rybach, ... | 31 | 2021 |