Prati
James Qin
James Qin
Potvrđena adresa e-pošte na google.com
Naslov
Citirano
Citirano
Godina
Conformer: Convolution-augmented transformer for speech recognition
A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ...
arXiv preprint arXiv:2005.08100, 2020
23942020
Lamda: Language models for dialog applications
R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ...
arXiv preprint arXiv:2201.08239, 2022
9442022
Pushing the limits of semi-supervised learning for automatic speech recognition
Y Zhang, J Qin, DS Park, W Han, CC Chiu, R Pang, QV Le, Y Wu
arXiv preprint arXiv:2010.10504, 2020
3062020
W2v-bert: Combining contrastive learning and masked language modeling for self-supervised speech pre-training
YA Chung, Y Zhang, W Han, CC Chiu, J Qin, R Pang, Y Wu
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
2632021
Contextnet: Improving convolutional neural networks for automatic speech recognition with global context
W Han, Z Zhang, Y Zhang, J Yu, CC Chiu, J Qin, A Gulati, R Pang, Y Wu
arXiv preprint arXiv:2005.03191, 2020
2552020
Vector-quantized image modeling with improved vqgan
J Yu, X Li, JY Koh, H Zhang, R Pang, J Qin, A Ku, Y Xu, J Baldridge, Y Wu
arXiv preprint arXiv:2110.04627, 2021
2032021
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
1942019
Gemini: a family of highly capable multimodal models
G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ...
arXiv preprint arXiv:2312.11805, 2023
1812023
Bigssl: Exploring the frontier of large-scale semi-supervised learning for automatic speech recognition
Y Zhang, DS Park, W Han, J Qin, A Gulati, J Shor, A Jansen, Y Xu, ...
IEEE Journal of Selected Topics in Signal Processing 16 (6), 1519-1532, 2022
1282022
A better and faster end-to-end model for streaming asr
B Li, A Gulati, J Yu, TN Sainath, CC Chiu, A Narayanan, SY Chang, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1082021
Google usm: Scaling automatic speech recognition beyond 100 languages
Y Zhang, W Han, J Qin, Y Wang, A Bapna, Z Chen, N Chen, B Li, ...
arXiv preprint arXiv:2303.01037, 2023
992023
Self-supervised learning with random-projection quantizer for speech recognition
CC Chiu, J Qin, Y Zhang, J Yu, Y Wu
International Conference on Machine Learning, 3915-3924, 2022
892022
Renelito Delos Santos
R Thoppilan, D De Freitas, J Hall, N Shazeer, A Kulshreshtha, HT Cheng, ...
882022
Scaling end-to-end models for large-scale multilingual asr
B Li, R Pang, TN Sainath, A Gulati, Y Zhang, J Qin, P Haghani, WR Huang, ...
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
602021
AudioPaLM: A Large Language Model That Can Speak and Listen
PK Rubenstein, C Asawaroengchai, DD Nguyen, A Bapna, Z Borsos, ...
arXiv preprint arXiv:2306.12925, 2023
562023
Conformer: Convolution-augmented transformer for speech recognition. arXiv 2020
A Gulati, J Qin, CC Chiu, N Parmar, Y Zhang, J Yu, W Han, S Wang, ...
arXiv preprint arXiv:2005.08100, 2020
452020
An efficient streaming non-recurrent on-device end-to-end model with improvements to rare-word modeling
TN Sainath, YR He, A Narayanan, R Botros, R Pang, DJ Rybach, ...
382021
Improving the latency and quality of cascaded encoders
TN Sainath, Y He, A Narayanan, R Botros, W Wang, D Qiu, CC Chiu, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
242022
Parallel rescoring with transformer for streaming on-device speech recognition
W Li, J Qin, CC Chiu, R Pang, Y He
arXiv preprint arXiv:2008.13093, 2020
162020
Efficient Adapters for Giant Speech Models
N Chen, I Shafran, Y Zhang, CC Chiu, H Soltau, J Qin, Y Wu
arXiv preprint arXiv:2306.08131, 2023
22023
Sustav trenutno ne može provesti ovu radnju. Pokušajte ponovo kasnije.
Članci 1–20