Prati
Shuo-yiin Chang
Shuo-yiin Chang
Senior Staff Research Scientist, Google
Potvrđena adresa e-pošte na google.com
Naslov
Citirano
Citirano
Godina
Deep learning for audio signal processing
H Purwins, B Li, T Virtanen, J Schlüter, SY Chang, T Sainath
IEEE Journal of Selected Topics in Signal Processing 13 (2), 206-219, 2019
6992019
Streaming end-to-end speech recognition for mobile devices
Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
6732019
A streaming on-device end-to-end model surpassing server-side conventional model quality and latency
TN Sainath, Y He, B Li, A Narayanan, R Pang, A Bruguier, S Chang, W Li, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
2152020
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
1412023
Towards fast and accurate streaming end-to-end ASR
B Li, S Chang, TN Sainath, R Pang, Y He, T Strohman, Y Wu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
1242020
A better and faster end-to-end model for streaming asr
B Li, A Gulati, J Yu, TN Sainath, CC Chiu, A Narayanan, SY Chang, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1072021
Robust CNN-based speech recognition with Gabor filter kernels
SY Chang, N Morgan
Fifteenth annual conference of the international speech communication …, 2014
972014
Fastemit: Low-latency streaming asr with sequence-level emission regularization
J Yu, CC Chiu, B Li, S Chang, TN Sainath, Y He, A Narayanan, W Han, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
872021
Temporal modeling using dilated convolution and gating for voice-activity-detection
SY Chang, B Li, G Simko, TN Sainath, A Tripathi, A van den Oord, ...
2018 IEEE international conference on acoustics, speech and signal …, 2018
782018
Personal VAD: Speaker-conditioned voice activity detection
S Ding, Q Wang, S Chang, L Wan, IL Moreno
arXiv preprint arXiv:1908.04284, 2019
752019
Improved End-of-Query Detection for Streaming Speech Recognition.
M Shannon, G Simko, SY Chang, C Parada
Interspeech, 1909-1913, 2017
472017
Joint endpointing and decoding with end-to-end models
SY Chang, R Prabhavalkar, Y He, TN Sainath, G Simko
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
402019
An efficient streaming non-recurrent on-device end-to-end model with improvements to rare-word modeling
TN Sainath, YR He, A Narayanan, R Botros, R Pang, DJ Rybach, ...
382021
Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition.
SY Chang, B Li, TN Sainath, G Simko, C Parada
Interspeech, 3812-3816, 2017
352017
Endpoint Detection Using Grid Long Short-Term Memory Networks for Streaming Speech Recognition.
SY Chang, B Li, TN Sainath, G Simko, C Parada
Interspeech, 3812-3816, 2017
352017
Improving the latency and quality of cascaded encoders
TN Sainath, Y He, A Narayanan, R Botros, W Wang, D Qiu, CC Chiu, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
242022
Spectro-temporal features for noise-robust speech recognition using power-law nonlinearity and power-bias subtraction
SY Chang, BT Meyer, N Morgan
2013 IEEE international conference on acoustics, speech and signal …, 2013
212013
The blame game in meeting room ASR: An analysis of feature versus model errors in noisy and mismatched conditions
SHK Parthasarathi, SY Chang, J Cohen, N Morgan, S Wegmann
2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013
202013
E2e segmenter: Joint segmenting and decoding for long-form asr
WR Huang, S Chang, D Rybach, R Prabhavalkar, TN Sainath, C Allauzen, ...
arXiv preprint arXiv:2204.10749, 2022
182022
Streaming end-to-end multilingual speech recognition with joint language identification
C Zhang, B Li, T Sainath, T Strohman, S Mavandadi, S Chang, P Haghani
arXiv preprint arXiv:2209.06058, 2022
172022
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–20