Shaojin Ding
Cited by
Cited by
Abd-net: Attentive but diverse person re-identification
T Chen, S Ding, J Xie, Y Yuan, W Chen, Y Yang, Z Ren, Z Wang
Proceedings of the IEEE/CVF international conference on computer vision …, 2019
Personal VAD: Speaker-conditioned voice activity detection
S Ding, Q Wang, S Chang, L Wan, IL Moreno
arXiv preprint arXiv:1908.04284, 2019
Golden speaker builder–An interactive tool for pronunciation training
S Ding, C Liberatore, S Sonsaat, I Lučić, A Silpachai, G Zhao, ...
Speech Communication 115, 51-66, 2019
Foreign Accent Conversion by Synthesizing Speech from Phonetic Posteriorgrams.
G Zhao, S Ding, R Gutierrez-Osuna
Interspeech, 2843-2847, 2019
Autospeech: Neural architecture search for speaker recognition
S Ding, T Chen, X Gong, W Zha, Z Wang
arXiv preprint arXiv:2005.03215, 2020
Group Latent Embedding for Vector Quantized Variational Autoencoder in Non-Parallel Voice Conversion.
S Ding, R Gutierrez-Osuna
Interspeech, 724-728, 2019
4-bit conformer with native quantization aware training for speech recognition
S Ding, P Meadowlark, Y He, L Lew, S Agrawal, O Rybakov
arXiv preprint arXiv:2203.15952, 2022
Audio lottery: Speech recognition made ultra-lightweight, noise-robust, and transferable
S Ding, T Chen, Z Wang
International Conference on Learning Representations, 2022
Converting foreign accent speech without a reference
G Zhao, S Ding, R Gutierrez-Osuna
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 2367-2381, 2021
Accentron: Foreign accent conversion to arbitrary non-native speakers using zero-shot learning
S Ding, G Zhao, R Gutierrez-Osuna
Computer Speech & Language 72, 101302, 2022
Personal VAD 2.0: Optimizing personal voice activity detection for on-device speech recognition
S Ding, R Rikhye, Q Liang, Y He, Q Wang, A Narayanan, T O'Malley, ...
arXiv preprint arXiv:2204.03793, 2022
Improving the Speaker Identity of Non-Parallel Many-to-Many Voice Conversion with Adversarial Speaker Recognition.
S Ding, G Zhao, R Gutierrez-Osuna
INTERSPEECH, 776-780, 2020
A unified cascaded encoder asr model for dynamic model sizes
S Ding, W Wang, D Zhao, TN Sainath, Y He, R David, R Botros, X Wang, ...
arXiv preprint arXiv:2204.06164, 2022
Learning structured sparse representations for voice conversion
S Ding, G Zhao, C Liberatore, R Gutierrez-Osuna
IEEE/ACM Transactions on Audio, Speech, and Language Processing 28, 343-354, 2019
Sharing low rank conformer weights for tiny always-on ambient speech recognition models
SM Hernandez, D Zhao, S Ding, A Bruguier, R Prabhavalkar, TN Sainath, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
2-bit conformer quantization for automatic speech recognition
O Rybakov, P Meadowlark, S Ding, D Qiu, J Li, D Rim, Y He
arXiv preprint arXiv:2305.16619, 2023
Textual echo cancellation
S Ding, Y Jia, K Hu, Q Wang
2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021
Learning Structured Dictionaries for Exemplar-based Voice Conversion.
S Ding, C Liberatore, R Gutierrez-Osuna
INTERSPEECH, 481-485, 2018
Golden Speaker Builder: an interactive online tool for L2 learners to build pronunciation models
S Ding, C Liberatore, G Zhao, S Sonsaat, E Chukharev-Hudilainen, ...
Pronunciation in Second Language Learning & Teaching (PSLLT) 9th Annual …, 2017
Multi-output RNN-T joint networks for multi-task learning of ASR and auxiliary tasks
W Wang, D Zhao, S Ding, H Zhang, SY Chang, D Rybach, TN Sainath, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
The system can't perform the operation now. Try again later.
Articles 1–20