Hifi-gan: Generative adversarial networks for efficient and high fidelity speech synthesis J Kong, J Kim, J Bae Advances in neural information processing systems 33, 17022-17033, 2020 | 1393 | 2020 |
Conditional variational autoencoder with adversarial learning for end-to-end text-to-speech J Kim, J Kong, J Son International Conference on Machine Learning, 5530-5540, 2021 | 552 | 2021 |
Glow-tts: A generative flow for text-to-speech via monotonic alignment search J Kim, S Kim, J Kong, S Yoon Advances in Neural Information Processing Systems 33, 8067-8077, 2020 | 406 | 2020 |
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design J Kong, J Park, B Kim, J Kim, D Kong, S Kim arXiv preprint arXiv:2307.16430, 2023 | 5 | 2023 |
Encoding Speaker-Specific Latent Speech Feature for Speech Synthesis J Kong, J Lee, J Kim, B Kim, J Park, D Kong, C Lee, S Kim arXiv preprint arXiv:2311.11745, 2023 | | 2023 |