Xiangang Li
Xiangang Li
Baidu, DiDi, Beike
No verified email
Cited by
Cited by
Deep speech 2: End-to-end speech recognition in english and mandarin
D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ...
International conference on machine learning, 173-182, 2016
Deep speaker: an end-to-end neural speaker embedding system
C Li, X Ma, B Jiang, X Li, X Zhang, X Liu, Y Cao, A Kannan, Z Zhu
arXiv preprint arXiv:1705.02304, 2017
Constructing long short-term memory based deep recurrent neural networks for large vocabulary speech recognition
X Li, X Wu
2015 ieee international conference on acoustics, speech and signal …, 2015
Learning alignment for multimodal emotion recognition from speech
H Xu, H Zhang, K Han, Y Wang, Y Peng, X Li
arXiv preprint arXiv:1909.05645, 2019
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
arXiv preprint arXiv:2106.06909, 2021
Improving transformer-based speech recognition using unsupervised pre-training
D Jiang, X Lei, W Li, N Luo, Y Hu, W Zou, X Li
arXiv preprint arXiv:1910.09932, 2019
Gram-CTC: Automatic unit selection and target decomposition for sequence labelling
H Liu, Z Zhu, X Li, S Satheesh
International Conference on Machine Learning, 2188-2197, 2017
Speech simclr: Combining contrastive and reconstruction objective for self-supervised speech representation learning
D Jiang, W Li, M Cao, W Zou, X Li
arXiv preprint arXiv:2010.13991, 2020
Implicit discourse relation recognition using neural tensor network with interactive attention and sparse learning
F Guo, R He, D Jin, J Dang, L Wang, X Li
Proceedings of the 27th International Conference on Computational …, 2018
Towards end-to-end code-switching speech recognition
N Luo, D Jiang, S Zhao, C Gong, W Zou, X Li
arXiv preprint arXiv:1810.13091, 2018
A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition
X Li, Y Yang, Z Pang, X Wu
Neurocomputing 170, 251-256, 2015
On loss functions and recurrency training for GAN-based speech enhancement systems
Z Zhang, C Deng, Y Shen, DS Williamson, Y Sha, Y Zhang, H Song, X Li
arXiv preprint arXiv:2007.14974, 2020
A further study of unsupervised pretraining for transformer based speech recognition
D Jiang, W Li, R Zhang, M Cao, N Luo, Y Han, W Zou, K Han, X Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network.
L Guo, L Wang, J Dang, L Zhang, H Guan, X Li
INTERSPEECH, 1611-1615, 2018
Comparable study of modeling units for end-to-end mandarin speech recognition
W Zou, D Jiang, S Zhao, G Yang, X Li
2018 11th International Symposium on Chinese Spoken Language Processing …, 2018
Transformer based unsupervised pre-training for acoustic representation learning
R Zhang, H Wu, W Li, D Jiang, W Zou, X Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Didispeech: A large scale mandarin speech corpus
T Guo, C Wen, D Jiang, N Luo, R Zhang, S Zhao, W Li, C Gong, W Zou, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
Modeling speaker variability using long short-term memory networks for speech recognition
X Li, X Wu
Sixteenth Annual Conference of the International Speech Communication …, 2015
Exploring the impact of instruction data scaling on large language models: An empirical study on real-world use cases
Y Ji, Y Deng, Y Gong, Y Peng, Q Niu, L Zhang, B Ma, X Li
arXiv preprint arXiv:2303.14742, 2023
Replay attack detection using magnitude and phase information with attention-based adaptive filters
M Liu, L Wang, J Dang, S Nakagawa, H Guan, X Li
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
The system can't perform the operation now. Try again later.
Articles 1–20