Deep speech 2: End-to-end speech recognition in english and mandarin D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ... International conference on machine learning, 173-182, 2016 | 3347 | 2016 |
Deep speaker: an end-to-end neural speaker embedding system C Li, X Ma, B Jiang, X Li, X Zhang, X Liu, Y Cao, A Kannan, Z Zhu arXiv preprint arXiv:1705.02304, 2017 | 523 | 2017 |
Constructing long short-term memory based deep recurrent neural networks for large vocabulary speech recognition X Li, X Wu 2015 ieee international conference on acoustics, speech and signal …, 2015 | 462 | 2015 |
Learning alignment for multimodal emotion recognition from speech H Xu, H Zhang, K Han, Y Wang, Y Peng, X Li arXiv preprint arXiv:1909.05645, 2019 | 132 | 2019 |
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... arXiv preprint arXiv:2106.06909, 2021 | 118 | 2021 |
Improving transformer-based speech recognition using unsupervised pre-training D Jiang, X Lei, W Li, N Luo, Y Hu, W Zou, X Li arXiv preprint arXiv:1910.09932, 2019 | 85 | 2019 |
Gram-CTC: Automatic unit selection and target decomposition for sequence labelling H Liu, Z Zhu, X Li, S Satheesh International Conference on Machine Learning, 2188-2197, 2017 | 62 | 2017 |
Speech simclr: Combining contrastive and reconstruction objective for self-supervised speech representation learning D Jiang, W Li, M Cao, W Zou, X Li arXiv preprint arXiv:2010.13991, 2020 | 59 | 2020 |
Implicit discourse relation recognition using neural tensor network with interactive attention and sparse learning F Guo, R He, D Jin, J Dang, L Wang, X Li Proceedings of the 27th International Conference on Computational …, 2018 | 51 | 2018 |
Towards end-to-end code-switching speech recognition N Luo, D Jiang, S Zhao, C Gong, W Zou, X Li arXiv preprint arXiv:1810.13091, 2018 | 47 | 2018 |
A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition X Li, Y Yang, Z Pang, X Wu Neurocomputing 170, 251-256, 2015 | 44 | 2015 |
On loss functions and recurrency training for GAN-based speech enhancement systems Z Zhang, C Deng, Y Shen, DS Williamson, Y Sha, Y Zhang, H Song, X Li arXiv preprint arXiv:2007.14974, 2020 | 35 | 2020 |
A further study of unsupervised pretraining for transformer based speech recognition D Jiang, W Li, R Zhang, M Cao, N Luo, Y Han, W Zou, K Han, X Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 34 | 2021 |
Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network. L Guo, L Wang, J Dang, L Zhang, H Guan, X Li INTERSPEECH, 1611-1615, 2018 | 32 | 2018 |
Comparable study of modeling units for end-to-end mandarin speech recognition W Zou, D Jiang, S Zhao, G Yang, X Li 2018 11th International Symposium on Chinese Spoken Language Processing …, 2018 | 31 | 2018 |
Transformer based unsupervised pre-training for acoustic representation learning R Zhang, H Wu, W Li, D Jiang, W Zou, X Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 28 | 2021 |
Didispeech: A large scale mandarin speech corpus T Guo, C Wen, D Jiang, N Luo, R Zhang, S Zhao, W Li, C Gong, W Zou, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 26 | 2021 |
Modeling speaker variability using long short-term memory networks for speech recognition X Li, X Wu Sixteenth Annual Conference of the International Speech Communication …, 2015 | 25 | 2015 |
Exploring the impact of instruction data scaling on large language models: An empirical study on real-world use cases Y Ji, Y Deng, Y Gong, Y Peng, Q Niu, L Zhang, B Ma, X Li arXiv preprint arXiv:2303.14742, 2023 | 21 | 2023 |
Replay attack detection using magnitude and phase information with attention-based adaptive filters M Liu, L Wang, J Dang, S Nakagawa, H Guan, X Li ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 21 | 2019 |