Follow
Dan Su
Dan Su
Tencent AI Lab
Verified email at tencent.com
Title
Cited by
Cited by
Year
Durian: Duration informed attention network for multimodal synthesis
C Yu, H Lu, N Hu, M Yu, C Weng, K Xu, P Liu, D Tuo, S Kang, G Lei, D Su, ...
arXiv preprint arXiv:1909.01700, 2019
1912019
Gigaspeech: An evolving, multi-domain asr corpus with 10,000 hours of transcribed audio
G Chen, S Chai, G Wang, J Du, WQ Zhang, C Weng, D Su, D Povey, ...
arXiv preprint arXiv:2106.06909, 2021
1492021
Replay and synthetic speech detection with res2net architecture
X Li, N Li, C Weng, X Liu, D Su, D Yu, H Meng
ICASSP 2021-2021 IEEE international conference on acoustics, speech and …, 2021
1212021
Fastdiff: A fast conditional diffusion model for high-quality speech synthesis
R Huang, MWY Lam, J Wang, D Su, D Yu, Y Ren, Z Zhao
arXiv preprint arXiv:2204.09934, 2022
1062022
Deep extractor network for target speaker recovery from single channel speech mixtures
J Wang, J Chen, D Su, L Chen, M Yu, Y Qian, D Yu
arXiv preprint arXiv:1807.08974, 2018
1032018
Component fusion: Learning replaceable language model component for end-to-end speech recognition system
C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
972019
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information.
R Gu, L Chen, SX Zhang, J Zheng, Y Xu, M Yu, D Su, Y Zou, D Yu
Interspeech, 4290-4294, 2019
872019
End-to-end multi-channel speech separation
R Gu, J Wu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
arXiv preprint arXiv:1905.06286, 2019
782019
Investigating end-to-end speech recognition for mandarin-english code-switching
C Shan, C Weng, G Wang, D Su, M Luo, D Yu, L Xie
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
782019
Deep Discriminative Embeddings for Duration Robust Speaker Verification.
N Li, D Tuo, D Su, Z Li, D Yu, A Tencent
Interspeech, 2262-2266, 2018
762018
BDDM: Bilateral denoising diffusion models for fast and high-quality speech synthesis
MWY Lam, J Wang, D Su, D Yu
arXiv preprint arXiv:2203.13508, 2022
702022
Enhancing end-to-end multi-channel speech separation via spatial feature learning
R Gu, SX Zhang, L Chen, Y Xu, M Yu, D Su, Y Zou, D Yu
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
612020
Improving Attention Based Sequence-to-Sequence Models for End-to-End English Conversational Speech Recognition.
C Weng, J Cui, G Wang, J Wang, C Yu, D Su, D Yu
Interspeech, 761-765, 2018
612018
Speech-XLNet: Unsupervised acoustic model pretraining for self-attention networks
X Song, G Wang, Z Wu, Y Huang, D Su, D Yu, H Meng
arXiv preprint arXiv:1910.10387, 2019
552019
Sandglasset: A light multi-granularity self-attentive network for time-domain speech separation
MWY Lam, J Wang, D Su, D Yu
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
462021
Joint training of complex ratio mask based beamformer and acoustic model for noise robust asr
Y Xu, C Weng, L Hui, J Liu, M Yu, D Su, D Yu
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
412019
Investigating robustness of adversarial samples detection for automatic speaker verification
X Li, N Li, J Zhong, X Wu, X Liu, D Su, D Yu, H Meng
arXiv preprint arXiv:2006.06186, 2020
402020
GMM-HMM acoustic model training by a two level procedure with Gaussian components determined by automatic model selection
D Su, X Wu, L Xu
2010 IEEE International Conference on Acoustics, Speech and Signal …, 2010
382010
Simple attention module based speaker verification with iterative noisy label detection
X Qin, N Li, C Weng, D Su, M Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
372022
Speechmoe: Scaling to large acoustic models with dynamic routing mixture of experts
Z You, S Feng, D Su, D Yu
arXiv preprint arXiv:2105.03036, 2021
362021
The system can't perform the operation now. Try again later.
Articles 1–20