Yong Zhao

Cited by

	All	Since 2019
Citations	3645	2193
h-index	25	20
i10-index	40	20

560

280

140

420

2005200620072008200920102011201220132014201520162017201820192020202120222023202418 64 44 49 44 38 32 70 85 142 81 150 247 363 506 550 436 391 234 75

Co-authors

Yifan GongPrincipal Science Manager, Microsoft Corp.Verified email at microsoft.com
Jinyu LiPartner Applied Science Manager, MicrosoftVerified email at microsoft.com
Lijuan WangMicrosoft GenAIVerified email at microsoft.com
Shi-Xiong (Austin) ZhangSr. Director | AI Foundations@Capital One | ex-Microsoft, ex-Tencent, Cambridge PhDVerified email at capitalone.com
Kshitiz KumarPrinicipal Scientist Microsoft Corporation; Ph.D. Carnegie Mellon University; B.Tech. IIT KharagpurVerified email at microsoft.com
Zheng-Yu NiuSoftware Engineer at Baidu Inc.Verified email at baidu.com
Xiaodong He (何晓冬)AI Lab, JD.com; IEEE/CAAI FellowVerified email at ieee.org
Diamantino CaseiroGoogle Inc.Verified email at google.com
Qiang FuMicrosoftVerified email at microsoft.com
min chuMicrosoft, Alibaba, AIspeech

Yong Zhao

Microsoft Corporation

Verified email at microsoft.com - Homepage

Speech Recognition Acoustic Modeling Text to Speech Machine Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Front-end architecture for a multi-lingual text-to-speech system M Chu, H Peng, Y Zhao US Patent 7,496,498, 2009	397	2009
Providing personalized voice font for text-to-speech applications M Chu, Y Zhao, S Zhao US Patent 7,693,719, 2010	350	2010
Refining of segmental boundaries in speech waveforms using contextual-dependent models Y Zhao, M Chu, J Zhou, L Wang US Patent 7,496,512, 2009	340	2009
Unnatural prosody detection in speech synthesis Y Zhao, FKP Soong, M Chu, L Wang US Patent 8,583,438, 2013	264	2013
Unnatural prosody detection in speech synthesis Y Zhao, FKP Soong, M Chu, L Wang US Patent 8,583,438, 2013	264	2013
Speech unit selection using HMM acoustic models M Chu, P Liu, Y Zhao, Y Li US Patent App. 11/508,093, 2008	248	2008
Defining atom units between phone and syllable for TTS systems M Chu, Y Zhao US Patent 7,418,389, 2008	225	2008
End-to-end attention based text-dependent speaker verification SX Zhang, Z Chen, Y Zhao, J Li, Y Gong 2016 IEEE Spoken Language Technology Workshop (SLT), 171-178, 2016	201	2016
Optimization of an objective measure for estimating mean opinion score of synthesized speech M Chu, H Peng, Y Zhao US Patent 7,386,451, 2008	179	2008
ResNeXt and Res2Net structures for speaker verification T Zhou, Y Zhao, J Wu 2021 IEEE Spoken Language Technology Workshop (SLT), 301-307, 2021	138	2021
Speaker-invariant training via adversarial learning Z Meng, J Li, Z Chen, Y Zhao, V Mazalov, Y Gong, BH Juang 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	132	2018
Conditional teacher-student learning Z Meng, J Li, Y Zhao, Y Gong ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	101	2019
Microsoft Mulan-a bilingual TTS system M Chu, H Peng, Y Zhao, Z Niu, E Chang 2003 IEEE International Conference on Acoustics, Speech, and Signal …, 2003	89	2003
Advances in online audio-visual meeting transcription T Yoshioka, I Abramovski, C Aksoylar, Z Chen, M David, D Dimitriadis, ... 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	84	2019
Adversarial speaker verification Z Meng, Y Zhao, J Li, Y Gong ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	83	2019
Microsoft speaker diarization system for the voxceleb speaker recognition challenge 2020 X Xiao, N Kanda, Z Chen, T Zhou, T Yoshioka, S Chen, Y Zhao, G Liu, ... arXiv preprint arXiv:2010.11458, 2020	68	2020
Low-rank plus diagonal adaptation for deep neural networks Y Zhao, J Li, Y Gong 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	65	2016
Cnn with phonetic attention for text-independent speaker verification T Zhou, Y Zhao, J Li, Y Gong, J Wu 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019	61	2019
Refining segmental boundaries for TTS database using fine contextual-dependent boundary models L Wang, Y Zhao, M Chu, J Zhou, Z Cao 2004 IEEE International Conference on Acoustics, Speech, and Signal …, 2004	49	2004
Improving Deep CNN Networks with Long Temporal Context for Text-Independent Speaker Verification Y Zhao, T Zhou, Z Chen, J Wu ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020	43	2020

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors