Yong Zhao
Title
Cited by
Cited by
Year
Front-end architecture for a multi-lingual text-to-speech system
M Chu, H Peng, Y Zhao
US Patent 7,496,498, 2009
3362009
Refining of segmental boundaries in speech waveforms using contextual-dependent models
Y Zhao, M Chu, J Zhou, L Wang
US Patent 7,496,512, 2009
2882009
Providing personalized voice font for text-to-speech applications
M Chu, Y Zhao, S Zhao
US Patent 7,693,719, 2010
2752010
Speech unit selection using HMM acoustic models
M Chu, P Liu, Y Zhao, Y Li
US Patent App. 11/508,093, 2008
2192008
Unnatural prosody detection in speech synthesis
Y Zhao, FKP Soong, M Chu, L Wang
US Patent 8,583,438, 2013
2132013
Unnatural prosody detection in speech synthesis
Y Zhao, FKP Soong, M Chu, L Wang
US Patent 8,583,438, 2013
2132013
Defining atom units between phone and syllable for TTS systems
M Chu, Y Zhao
US Patent 7,418,389, 2008
1972008
End-to-end attention based text-dependent speaker verification
SX Zhang, Z Chen, Y Zhao, J Li, Y Gong
2016 IEEE Spoken Language Technology Workshop (SLT), 171-178, 2016
1722016
Optimization of an objective measure for estimating mean opinion score of synthesized speech
M Chu, H Peng, Y Zhao
US Patent 7,386,451, 2008
1632008
Speaker-invariant training via adversarial learning
Z Meng, J Li, Z Chen, Y Zhao, V Mazalov, Y Gong, BH Juang
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
862018
Microsoft Mulan-a bilingual TTS system
M Chu, H Peng, Y Zhao, Z Niu, E Chang
2003 IEEE International Conference on Acoustics, Speech, and Signal …, 2003
842003
Advances in online audio-visual meeting transcription
T Yoshioka, I Abramovski, C Aksoylar, Z Chen, M David, D Dimitriadis, ...
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
492019
Refining segmental boundaries for TTS database using fine contextual-dependent boundary models
L Wang, Y Zhao, M Chu, J Zhou, Z Cao
2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004
492004
Adversarial speaker verification
Z Meng, Y Zhao, J Li, Y Gong
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
482019
Low-rank plus diagonal adaptation for deep neural networks
Y Zhao, J Li, Y Gong
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
442016
Conditional teacher-student learning
Z Meng, J Li, Y Zhao, Y Gong
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
432019
Perpetually optimizing the cost function for unit selection in a TTS system with one single run of MOS evaluation
H Peng, Y Zhao, M Chu
Seventh International Conference on Spoken Language Processing, 2002
432002
INVESTIGATING ONLINE LOW-FOOTPRINT SPEAKER ADAPTATION USING GENERALIZED LINEAR REGRESSION AND CLICK-THROUGH DATA
Y Zhao, J Li, J Xue, Y Gong
372015
Cnn with phonetic attention for text-independent speaker verification
T Zhou, Y Zhao, J Li, Y Gong, J Wu
2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019
362019
Modeling stylized invariance and local variability of prosody in text-to-speech synthesis
M Chu, Y Zhao, E Chang
Speech communication 48 (6), 716-726, 2006
252006
The system can't perform the operation now. Try again later.
Articles 1–20