Very deep convolutional neural networks for noise robust speech recognition Y Qian, M Bi, T Tan, K Yu IEEE/ACM Transactions on Audio, Speech, and Language Processing 24 (12 …, 2016 | 379 | 2016 |
Opencpop: A high-quality open source chinese popular song corpus for singing voice synthesis Y Wang, X Wang, P Zhu, J Wu, H Li, H Xue, Y Zhang, L Xie, M Bi arXiv preprint arXiv:2201.07429, 2022 | 61 | 2022 |
Visinger: Variational inference with adversarial learning for end-to-end singing voice synthesis Y Zhang, J Cong, H Xue, L Xie, P Zhu, M Bi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 53 | 2022 |
Very deep convolutional neural networks for LVCSR. M Bi, Y Qian, K Yu Interspeech, 3259-3263, 2015 | 51 | 2015 |
Deep feed-forward sequential memory networks for speech synthesis M Bi, H Lu, S Zhang, M Lei, Z Yan 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018 | 16 | 2018 |
One-shot voice conversion for style transfer based on speaker adaptation Z Wang, Q Xie, T Li, H Du, L Xie, P Zhu, M Bi ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 13 | 2022 |
Learn2sing 2.0: Diffusion and mutual information-based target speaker SVS by learning from singing teacher H Xue, X Wang, Y Zhang, L Xie, P Zhu, M Bi arXiv preprint arXiv:2203.16408, 2022 | 8 | 2022 |
Expressive-vc: Highly expressive voice conversion with attention fusion of bottleneck and perturbation features Z Ning, Q Xie, P Zhu, Z Wang, L Xue, J Yao, L Xie, M Bi ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 7 | 2023 |
Dualvc: Dual-mode voice conversion using intra-model knowledge distillation and hybrid predictive coding Z Ning, Y Jiang, P Zhu, J Yao, S Wang, L Xie, M Bi arXiv preprint arXiv:2305.12425, 2023 | 2 | 2023 |
Dualvc 2: Dynamic masked convolution for unified streaming and non-streaming voice conversion Z Ning, Y Jiang, P Zhu, S Wang, J Yao, L Xie, M Bi ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Preconditioned Nonlinear Conjugate Gradient Method for Real-time Interior-point Hyperelasticity X Shen, R Cai, M Bi, T Lv arXiv preprint arXiv:2405.08001, 2024 | | 2024 |
EDTalk: Efficient Disentanglement for Emotional Talking Head Synthesis S Tan, B Ji, M Bi, Y Pan arXiv preprint arXiv:2404.01647, 2024 | | 2024 |
Multi-GradSpeech: Towards Diffusion-based Multi-Speaker Text-to-speech Using Consistent Diffusion Models H Xue, S Guo, P Zhu, M Bi arXiv preprint arXiv:2308.10428, 2023 | | 2023 |