Prati
Bo Li
Naslov
Citirano
Citirano
Godina
State-of-the-art speech recognition with sequence-to-sequence models
CC Chiu, TN Sainath, Y Wu, R Prabhavalkar, P Nguyen, Z Chen, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
11252018
Streaming End-to-end Speech Recognition for Mobile Devices
Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
5192019
Deep Learning for Audio Signal Processing
H Purwins, B Li, T Virtanen, J Schlüter, SY Chang, T Sainath
IEEE Journal of Selected Topics in Signal Processing 13 (2), 206-219, 2019
4372019
A Comparison of Sequence-to-Sequence Models for Speech Recognition
R Prabhavalkar, K Rao, TN Sainath, B Li, L Johnson, N Jaitly
Proc. Interspeech 2017, 939-943, 2017
3062017
Multichannel Signal Processing With Deep Neural Networks for Automatic Speech Recognition
TN Sainath, RJ Weiss, KW Wilson, B Li, A Narayanan, E Variani, ...
IEEE/ACM Transactions on Audio, Speech, and Language Processing 25 (5), 965-979, 2017
2132017
Exploring speech enhancement with generative adversarial networks for robust speech recognition
C Donahue, B Li, R Prabhavalkar
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
2102018
Comparison of discriminative input and output transformations for speaker adaptation in the hybrid NN/HMM systems
B Li, KC Sim
INTERSPEECH, 526-529, 2010
1962010
Multilingual speech recognition with a single end-to-end model
S Toshniwal, TN Sainath, RJ Weiss, B Li, P Moreno, E Weinstein, K Rao
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
1922018
Acoustic Modeling for Google Home
B Li, T Sainath, A Narayanan, J Caroselli, M Bacchiani, A Misra, I Shafran, ...
INTERSPEECH-2017, 2017
1672017
Improved Noisy Student Training for Automatic Speech Recognition
DS Park, Y Zhang, Y Jia, W Han, CC Chiu, B Li, Y Wu, QV Le
arXiv preprint arXiv:2005.09629, 2020
1582020
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency
TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1542020
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
1532019
Neural Network Adaptive Beamforming for Robust Multichannel Speech Recognition.
B Li, TN Sainath, RJ Weiss, KW Wilson, M Bacchiani
INTERSPEECH, 1976-1980, 2016
1222016
Bytes are All You Need: End-to-End Multilingual Speech Recognition and Synthesis with Bytes
B Li, Y Zhang, T Sainath, Y Wu, W Chan
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
1092019
Multi-dialect speech recognition with a single sequence-to-sequence model
B Li, TN Sainath, KC Sim, M Bacchiani, E Weinstein, P Nguyen, Z Chen, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
1062018
Specaugment on large scale datasets
DS Park, Y Zhang, CC Chiu, Y Chen, B Li, W Chan, QV Le, Y Wu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
972020
Towards Fast and Accurate Streaming End-To-End ASR
B Li, S Chang, TN Sainath, R Pang, Y He, T Strohman, Y Wu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
912020
The NUS sung and spoken lyrics corpus: A quantitative comparison of singing and speech
Z Duan, H Fang, B Li, KC Sim, Y Wang
2013 Asia-Pacific Signal and Information Processing Association Annual …, 2013
752013
Multi-Language Multi-Speaker Acoustic Modeling for LSTM-RNN based Statistical Parametric Speech Synthesis
B Li, H Zen
INTERSPEECH, 2016
732016
Modeling Time-Frequency Patterns with LSTM vs. Convolutional Architectures for LVCSR Tasks
TN Sainath, B Li
INTERSPEECH, 2016
642016
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–20