Shan Yang

Cited by

	All	Since 2019
Citations	1061	991
h-index	16	16
i10-index	26	23

300

150

225

2016201720182019202020212022202320247 20 31 49 57 180 293 272 132

Public access

View all

14 articles

7 articles

available

not available

Based on funding mandates

Co-authors

Lei XieNorthwestern Polytechnical UniversityVerified email at nwpu.edu.cn
Dan SuTencent AI LabVerified email at tencent.com
Dong Yu (俞栋)Distinguished Scientist @ Tencent AI Lab, ACM/IEEE/ISCA FellowVerified email at global.tencent.com
Jian CongByteDanceVerified email at mail.nwpu.edu.cn
Haizhou LiThe Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), China; NUS, SingaporeVerified email at u.nus.edu
Yuxuan WangByteDanceVerified email at cse.ohio-state.edu
Zhizheng WuChinese University of Hong Kong, Shenzhen, Mel LabVerified email at cuhk.edu.cn
Heng LuAlibaba Tongyi Lab

Shan Yang

Tencent AI Lab

Verified email at nwpu-aslp.org

Speech Synthesis Voice Conversion


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Multi-band melgan: Faster waveform generation for high-quality text-to-speech G Yang, S Yang, K Liu, P Fang, W Chen, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 492-498, 2021	236	2021
Controllable emotion transfer for end-to-end speech synthesis T Li, S Yang, L Xue, L Xie 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021	80	2021
A deep bidirectional LSTM approach for video-realistic talking head B Fan, L Xie, S Yang, L Wang, FK Soong Multimedia Tools and Applications 75, 5287-5309, 2016	67	2016
Statistical parametric speech synthesis using generative adversarial networks under a multi-task learning framework S Yang, L Xie, X Chen, X Lou, X Zhu, D Huang, H Li 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2017	63	2017
Msemotts: Multi-scale emotion transfer, prediction, and control for emotional speech synthesis Y Lei, S Yang, X Wang, L Xie IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 853-864, 2022	60	2022
Fine-grained emotion strength transfer, control and prediction for emotional speech synthesis Y Lei, S Yang, L Xie 2021 IEEE Spoken Language Technology Workshop (SLT), 423-430, 2021	60	2021
Controlling emotion strength with relative attribute for end-to-end speech synthesis Z Xiaolian, Y Shan, X Geng, Yang, Lei 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	53	2019
Pre-alignment guided attention for improving training efficiency and model stability in end-to-end speech synthesis X Zhu, Y Zhang, S Yang, L Xue, L Xie IEEE Access 7, 65955-65964, 2019	38	2019
Accent and speaker disentanglement in many-to-many voice conversion Z Wang, W Ge, X Wang, S Yang, W Gan, H Chen, H Li, L Xie, X Li 2021 12th International Symposium on Chinese Spoken Language Processing …, 2021	32	2021
Data efficient voice cloning from noisy samples with domain adversarial training J Cong, S Yang, L Xie, G Yu, G Wan arXiv preprint arXiv:2008.04265, 2020	32	2020
On the localness modeling for the self-attention based end-to-end speech synthesis S Yang, H Lu, S Kang, L Xue, J Xiao, D Su, L Xie, D Yu Neural Networks 125, 121-130, 2020	31	2020
Controllable context-aware conversational speech synthesis J Cong, S Yang, N Hu, G Li, L Xie, D Su Interspeech, 2021, 4658-4662, 2021	30	2021
Glow-wavegan: Learning speech representations from gan-based variational auto-encoder for high fidelity flow-based speech synthesis J Cong, S Yang, L Xie, D Su Interspeech, 2021, 2021	28	2021
Learning Hierarchical Representations for Expressive Speaking Style in End-to-End Speech Synthesis X An, Y Wang, S Yang, Z Ma, L Xie 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	21	2019
Enhancing Hybrid Self-attention Structure with Relative-position-aware Bias for Speech Synthesis S Yang, H Lu, S Kang, L Xie, D Yu 2019 IEEE International Conference on Acoustics, Speech and Signal …, 2019	18	2019
Glow-WaveGAN 2: High-quality Zero-shot Text-to-speech Synthesis and Any-to-any Voice Conversion Y Lei, S Yang, J Cong, L Xie, D Su Interspeech, 2022, 2022	16	2022
Improving Mandarin End-to-End Speech Synthesis by Self-Attention and Learnable Gaussian Bias F Yang, S Yang, P Zhu, P Yan, L Xie 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	16	2019
The USTC system for blizzard challenge 2017 YJ Hu, C Ding, LJ Liu, ZH Ling, LR Dai Proc. Blizzard Challenge Workshop, 2017	16	2017
On the training of dnn-based average voice model for speech synthesis S Yang, Z Wu, L Xie 2016 Asia-Pacific Signal and Information Processing Association Annual …, 2016	16	2016
Clinical genetics testing laboratories have a remarkably low rate of clinically significant discordance when interpreting variants in hereditary cancer syndrome genes RL Nussbaum, S Yang, SE Lincoln J Clin Oncol 35 (11), 1259-1261, 2017	14	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors