Follow
Rui Liu (刘 瑞)
Rui Liu (刘 瑞)
Verified email at mail.imu.edu.cn - Homepage
Title
Cited by
Cited by
Year
Seen and unseen emotional style transfer for voice conversion with a new emotional speech dataset
K Zhou, B Sisman, R Liu, H Li
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
1552021
Emotional voice conversion: Theory, databases and ESD
K Zhou, B Sisman, R Liu, H Li
Speech Communication 137, 1-18, 2022
1082022
Expressive TTS training with frame and style reconstruction loss
R Liu, B Sisman, G Gao, H Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1806-1818, 2021
782021
Teacher-student training for robust tacotron-based tts
R Liu, B Sisman, J Li, F Bao, G Gao, H Li
ICASSP 2020-2020 IEEE international conference on acoustics, speech and …, 2020
642020
Reinforcement learning for emotional text-to-speech synthesis with improved emotion discriminability
R Liu, B Sisman, H Li
arXiv preprint arXiv:2104.01408, 2021
342021
GraphSpeech: Syntax-Aware Graph Attention Network For Neural Speech Synthesis
R Liu, B Sisman, H Li
IEEE ICASSP 2021. IEEE International Conference on Acoustics, Speech and …, 2021
342021
Mongolian text-to-speech system based on deep neural network
R Liu, F Bao, G Gao, Y Wang
Man-Machine Speech Communication: 14th National Conference, NCMMSC 2017 …, 2018
302018
Exploiting Morphological and Phonological Features to Improve Prosodic Phrasing for Mongolian Speech Synthesis
R Liu, B Sisman, F Bao, J Yang, G Gao, H Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 274-285, 2021
272021
Improving Mongolian Phrase Break Prediction by Using Syllable and Morphological Embeddings with BiLSTM Model.
R Liu, F Bao, G Gao, H Zhang, Y Wang
Interspeech, 57-61, 2018
222018
Modeling prosodic phrasing with multi-task learning in tacotron-based TTS
R Liu, B Sisman, F Bao, G Gao, H Li
IEEE Signal Processing Letters 27, 1470-1474, 2020
202020
Wavetts: Tacotron-based tts with joint time-frequency domain loss
R Liu, B Sisman, F Bao, G Gao, H Li
arXiv preprint arXiv:2002.00417, 2020
162020
Fasttalker: A neural text-to-speech architecture with shallow and group autoregression
R Liu, B Sisman, Y Lin, H Li
Neural Networks 141, 306-314, 2021
142021
Exploiting modality-invariant feature for robust multimodal emotion recognition with missing modalities
H Zuo, R Liu, J Zhao, G Gao, H Li
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
132023
Decoding knowledge transfer for neural text-to-speech training
R Liu, B Sisman, G Gao, H Li
IEEE/ACM Transactions on Audio, Speech, and Language Processing 30, 1789-1802, 2022
122022
End-to-end mongolian text-to-speech system
J Li, H Zhang, R Liu, X Zhang, F Bao
2018 11th international symposium on chinese spoken language processing …, 2018
112018
Multistage deep transfer learning for emIoT-enabled human–computer interaction
R Liu, Q Liu, H Zhu, H Cao
IEEE Internet of Things Journal 9 (16), 15128-15137, 2022
102022
A lstm approach with sub-word embeddings for mongolian phrase break prediction
R Liu, F Bao, G Gao, H Zhang, Y Wang
Proceedings of the 27th International Conference on Computational …, 2018
102018
Visualtts: Tts with accurate lip-speech synchronization for automatic voice over
J Lu, B Sisman, R Liu, M Zhang, H Li
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
92022
Decoupling speaker-independent emotions for voice conversion via source-filter networks
Z Luo, S Lin, R Liu, J Baba, Y Yoshikawa, H Ishiguro
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 11-24, 2022
82022
Accurate emotion strength assessment for seen and unseen speech based on data-driven deep learning
R Liu, B Sisman, B Schuller, G Gao, H Li
arXiv preprint arXiv:2206.07229, 2022
82022
The system can't perform the operation now. Try again later.
Articles 1–20