Xiangang Li

Cited by

	All	Since 2019
Citations	6067	4920
h-index	22	21
i10-index	36	30

1100

550

275

825

201620172018201920202021202220232024157 319 616 776 896 1013 974 969 289

Public access

View all

9 articles

10 articles

available

not available

Based on funding mandates

Xiangang Li

Baidu, DiDi, Beike

No verified email

speech recognition natural language processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Deep speech 2: End-to-end speech recognition in english and mandarin D Amodei, S Ananthanarayanan, R Anubhai, J Bai, E Battenberg, C Case, ... International conference on machine learning, 173-182, 2016	3590	2016
Deep speaker: an end-to-end neural speaker embedding system C Li, X Ma, B Jiang, X Li, X Zhang, X Liu, Y Cao, A Kannan, Z Zhu arXiv preprint arXiv:1705.02304, 2017	546	2017
Constructing long short-term memory based deep recurrent neural networks for large vocabulary speech recognition X Li, X Wu 2015 ieee international conference on acoustics, speech and signal …, 2015	493	2015
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... arXiv preprint arXiv:2106.06909, 2021	159	2021
Learning alignment for multimodal emotion recognition from speech H Xu, H Zhang, K Han, Y Wang, Y Peng, X Li arXiv preprint arXiv:1909.05645, 2019	155	2019
Improving transformer-based speech recognition using unsupervised pre-training D Jiang, X Lei, W Li, N Luo, Y Hu, W Zou, X Li arXiv preprint arXiv:1910.09932, 2019	99	2019
Speech simclr: Combining contrastive and reconstruction objective for self-supervised speech representation learning D Jiang, W Li, M Cao, W Zou, X Li arXiv preprint arXiv:2010.13991, 2020	70	2020
Gram-CTC: Automatic unit selection and target decomposition for sequence labelling H Liu, Z Zhu, X Li, S Satheesh International Conference on Machine Learning, 2188-2197, 2017	64	2017
Openchat: Advancing open-source language models with mixed-quality data G Wang, S Cheng, X Zhan, X Li, S Song, Y Liu arXiv preprint arXiv:2309.11235, 2023	59	2023
Towards end-to-end code-switching speech recognition N Luo, D Jiang, S Zhao, C Gong, W Zou, X Li arXiv preprint arXiv:1810.13091, 2018	59	2018
Exploring the impact of instruction data scaling on large language models: An empirical study on real-world use cases Y Ji, Y Deng, Y Gong, Y Peng, Q Niu, L Zhang, B Ma, X Li arXiv preprint arXiv:2303.14742, 2023	57	2023
Implicit discourse relation recognition using neural tensor network with interactive attention and sparse learning F Guo, R He, D Jin, J Dang, L Wang, X Li Proceedings of the 27th International Conference on Computational …, 2018	52	2018
A comparative study on selecting acoustic modeling units in deep neural networks based large vocabulary Chinese speech recognition X Li, Y Yang, Z Pang, X Wu Neurocomputing 170, 251-256, 2015	44	2015
A further study of unsupervised pretraining for transformer based speech recognition D Jiang, W Li, R Zhang, M Cao, N Luo, Y Han, W Zou, K Han, X Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	40	2021
On loss functions and recurrency training for GAN-based speech enhancement systems Z Zhang, C Deng, Y Shen, DS Williamson, Y Sha, Y Zhang, H Song, X Li arXiv preprint arXiv:2007.14974, 2020	39	2020
Speech Emotion Recognition by Combining Amplitude and Phase Information Using Convolutional Neural Network. L Guo, L Wang, J Dang, L Zhang, H Guan, X Li INTERSPEECH, 1611-1615, 2018	35	2018
Comparable study of modeling units for end-to-end mandarin speech recognition W Zou, D Jiang, S Zhao, G Yang, X Li 2018 11th International Symposium on Chinese Spoken Language Processing …, 2018	34	2018
Transformer based unsupervised pre-training for acoustic representation learning R Zhang, H Wu, W Li, D Jiang, W Zou, X Li ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	33	2021
Didispeech: A large scale mandarin speech corpus T Guo, C Wen, D Jiang, N Luo, R Zhang, S Zhao, W Li, C Gong, W Zou, ... ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	29	2021
Modeling speaker variability using long short-term memory networks for speech recognition. X Li, X Wu Interspeech, 1086-1090, 2015	28	2015

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by