Emotion controllable speech synthesis using emotion-unlabeled dataset with the assistance of cross-domain speech emotion recognition X Cai, D Dai, Z Wu, X Li, J Li, H Meng ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 72 | 2021 |
Towards Multi-Scale Style Control for Expressive Speech Synthesis X Li, C Song, J Li, Z Wu, J Jia, H Meng Proc. Interspeech 2021, 4673-4677, 2021 | 47 | 2021 |
Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synthesis Y Zhou, C Song, X Li, L Zhang, Z Wu, Y Bian, D Su, H Meng Proc. Interspeech 2022, 2573-2577, 2022 | 18 | 2022 |
Understanding the Teaching Styles by an Attention based Multi-task Cross-media Dimensional Modeling S Zhou, J Jia, Y Yin, X Li, Y Yao, Y Zhang, Z Ye, K Lei, Y Huang, J Shen Proceedings of the 27th ACM International Conference on Multimedia, 1322-1330, 2019 | 9 | 2019 |
CALM: Constrastive Cross-modal Speaking Style Modeling for Expressive Text-to-Speech Synthesis Y Meng, X Li, Z Wu, T Li, Z Sun, X Xiao, C Sun, H Zhan, H Meng Proc. Interspeech 2022, 5533-5537, 2022 | 6 | 2022 |
Towards Cross-speaker Reading Style Transfer on Audiobook Dataset X Li, C Song, X Wei, Z Wu, J Jia, H Meng Proc. Interspeech 2022, 5528-5532, 2022 | 5 | 2022 |
An End-to-End Chinese Text Normalization Model Based on Rule-Guided Flat-Lattice Transformer W Dai, C Song, X Li, Z Wu, H Pan, X Li, H Meng ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 4 | 2022 |
Emotional design for children’s electronic picture book Y Bu, J Jia, X Li, X Lu International Conference on Human-Computer Interaction, 392-403, 2019 | 4 | 2019 |
IcooBook: when the picture book for children encounters aesthetics of interaction Y Bu, J Jia, X Li, S Zhou, X Lu Proceedings of the 26th ACM international conference on Multimedia, 1260-1262, 2018 | 3 | 2018 |
AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data J Yu, H Chen, Y Bian, X Li, Y Luo, J Tian, M Liu, J Jiang, S Wang arXiv preprint arXiv:2309.13905, 2023 | 2 | 2023 |
Diverse and Expressive Speech Prosody Prediction with Denoising Diffusion Probabilistic Model X Li, S Liu, MWY Lam, Z Wu, C Weng, H Meng Proc. Interspeech 2023, 4858-4862, 2023 | 2 | 2023 |