Dan Su

Cited by

	All	Since 2019
Citations	3034	2979
h-index	31	31
i10-index	66	65

880

440

220

660

20182019202020212022202320249 125 245 508 657 879 550

Public access

View all

26 articles

4 articles

available

not available

Based on funding mandates

Co-authors

Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
Meng YUTencent AI LabVerified email at tencent.com
Jun WangPeking UniversityVerified email at tencent.com
Lianwu CHENKuaishou TechnologyVerified email at kuaishou.com
Shiyin KangXVerse Inc.Verified email at xverse.cn
Zhiyong WU (吴志勇)Associate Professor, Tsinghua UniversityVerified email at sz.tsinghua.edu.cn
Lei XieNorthwestern Polytechnical UniversityVerified email at nwpu.edu.cn
Xunying LiuChinese University of Hong KongVerified email at se.cuhk.edu.hk
Guangsen WangTencent AI LabVerified email at tencent.com
Shan YangTencent AI LabVerified email at nwpu-aslp.org
Yong XuPrincipal Researcher, Tencent America, Bellevue, USAVerified email at tencent.com
Shi-Xiong (Austin) ZhangSr. Director | AI Foundations@Capital One | ex-Microsoft, ex-Tencent, Cambridge PhDVerified email at capitalone.com
Rongzhi GuTencent AI LabVerified email at pku.edu.cn
Yuexian ZouPeking University Shenzhen Graduate SchoolVerified email at pku.edu.cn
Jia CuiTencentVerified email at tencent.com
Xihong WuPeking UniversityVerified email at cis.pku.edu.cn
Chao Weng
Songxiang LiuPhD. from CUHK
Max W. Y. LamIndependent Researcher

Dan Su

Tencent AI Lab

Verified email at tencent.com

speech recognition speech synthesis speaker recognition


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ... arXiv preprint arXiv:2106.06909, 2021	189	2021
Replay and synthetic speech detection with res2net architecture X Li, N Li, C Weng, X Liu, D Su, D Yu, H Meng ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021	150	2021
Fastdiff: A fast conditional diffusion model for high-quality speech synthesis R Huang, MWY Lam, J Wang, D Su, D Yu, Y Ren, Z Zhao arXiv preprint arXiv:2204.09934, 2022	134	2022
DurIAN: Duration Informed Attention Network for Speech Synthesis. C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ... Interspeech, 2027-2031, 2020	104	2020
Deep extractor network for target speaker recovery from single channel speech mixtures J Wang, J Chen, D Su, L Chen, M Yu, Y Qian, D Yu arXiv preprint arXiv:1807.08974, 2018	103	2018
Durian: Duration informed attention network for multimodal synthesis C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ... arXiv preprint arXiv:1909.01700, 2019	99	2019
Component fusion: Learning replaceable language model component for end-to-end speech recognition system C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	99	2019
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information. R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu Interspeech, 4290-4294, 2019	95	2019
Investigating end-to-end speech recognition for mandarin-english code-switching C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	85	2019
BDDM: Bilateral denoising diffusion models for fast and high-quality speech synthesis MWY Lam, J Wang, D Su, D Yu arXiv preprint arXiv:2203.13508, 2022	84	2022
End-to-end multi-channel speech separation R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu arXiv preprint arXiv:1905.06286, 2019	80	2019
Deep Discriminative Embeddings for Duration Robust Speaker Verification. N Li, D Tuo, D Su, Z Li, D Yu, A Tencent Interspeech, 2262-2266, 2018	79	2018
Enhancing end-to-end multi-channel speech separation via spatial feature learning R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	62	2020
Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition. C Weng, J Cui, G Wang, J Wang, C Yu, D Su, D Yu Interspeech, 761-765, 2018	61	2018
Speech-XLNet: Unsupervised acoustic model pretraining for self-attention networks X Song, G Wang, Z Wu, Y Huang, D Su, D Yu, H Meng arXiv preprint arXiv:1910.10387, 2019	56	2019
Mm-llms: Recent advances in multimodal large language models D Zhang, Y Yu, C Li, J Dong, D Su, C Chu, D Yu arXiv preprint arXiv:2401.13601, 2024	54	2024
Diffgan-tts: High-fidelity and efficient text-to-speech with denoising diffusion gans S Liu, D Su, D Yu arXiv preprint arXiv:2201.11972, 2022	50	2022
Simple attention module based speaker verification with iterative noisy label detection X Qin, N Li, C Weng, D Su, M Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022	49	2022
Diffsvc: A diffusion probabilistic model for singing voice conversion S Liu, Y Cao, D Su, H Meng 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021	46	2021
Sandglasset: A light multi-granularity self-attentive network for time-domain speech separation MWY Lam, J Wang, D Su, D Yu ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021	45	2021

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors