Follow
Xueyuan Chen
Title
Cited by
Cited by
Year
A Character-level Span-based Model for Mandarin Prosodic Structure Prediction
X Chen, C Song, Y Zhou, Z Wu, C Chen, Z Wu, H Meng
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
92022
Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis
X Chen, S Lei, Z Wu, D Xu, W Zhao, H Meng
Proc. COLING 2022, 7193-7202, 2022
72022
StyleSpeech: Self-supervised Style Enhancing with VQ-VAE-based Pre-training for Expressive Audiobook Speech Synthesis
X Chen, X Wang, S Zhang, L He, Z Wu, X Wu, H Meng
ICASSP 2024, 2024
42024
HILvoice: Human-in-the-Loop Style Selection for Elder-Facing Speech Synthesis
X Chen, Q Huang, X Wu, Z Wu, H Meng
ISCSLP 2022, 86-90, 2022
42022
Exploiting Audio-Visual Features with Pretrained AV-HuBERT for Multi-Modal Dysarthric Speech Reconstruction
X Chen, Y Wang, X Wu, D Wang, Z Wu, X Liu, H Meng
ICASSP 2024, 2024
32024
SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models
D Yang, D Wang, H Guo, X Chen, X Wu, H Meng
Proc. Interspeech 2024, 2024
2024
CoLM-DSR: Leveraging Neural Codec Language Modeling for Multi-Modal Dysarthric Speech Reconstruction
X Chen, D Yang, D Wang, X Wu, Z Wu, H Meng
Proc. Interspeech 2024, 2024
2024
Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy
W Wu, X Chen, X Wu, H Li, H Meng
Proc. IJCNN 2024, 2024
2024
AVHuMAR: Audio-Visual Target Speech Extraction with Pre-trained AV-HuBERT and Mask-And-Recover Strategy
W Wu, X Chen, X Wu, H Li, H Meng
CVPR 2024 Sight and Sound Workshop, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–9