Follow
Wei Ping
Wei Ping
Principal Research Scientist, NVIDIA
Verified email at nvidia.com - Homepage
Title
Cited by
Cited by
Year
DiffWave: A versatile diffusion model for audio synthesis
Z Kong, W Ping, J Huang, K Zhao, B Catanzaro
ICLR 2021, 2021
11032021
Deep Voice 3: Scaling text-to-speech with convolutional sequence learning
W Ping, K Peng, A Gibiansky, SO Arik, A Kannan, S Narang, J Raiman, ...
ICLR 2018, 2018
870*2018
Deep Voice 2: Multi-speaker neural text-to-speech
S Arik, G Diamos, A Gibiansky, J Miller, K Peng, W Ping, J Raiman, ...
NeurIPS 2017, 2017
626*2017
Neural voice cloning with a few samples
S Arik*, J Chen*, K Peng*, W Ping*, Y Zhou
NeurIPS, 2018
4292018
ClariNet: Parallel wave generation in end-to-end text-to-speech
W Ping, K Peng, J Chen
ICLR 2019, 2019
4022019
On fast sampling of diffusion probabilistic models
Z Kong, W Ping
ICML 2021 Workshop on Invertible Neural Networks, Normalizing Flows, and …, 2021
1592021
Non-autoregressive neural text-to-speech
K Peng, W Ping, Z Song, K Zhao
ICML 2020, 2019
156*2019
WaveFlow: A compact flow-based model for raw audio
W Ping, K Peng, K Zhao, Z Song
ICML 2020, 2020
1382020
BigVGAN: A universal neural vocoder with large-scale training
S Lee, W Ping, B Ginsburg, B Catanzaro, S Yoon
ICLR 2023, 2022
1302022
Cancer metastasis detection with neural conditional random field
Y Li, W Ping
Medical Imaging with Deep Learning, 2018
1232018
Long-short transformer: Efficient transformers for language and vision
C Zhu, W Ping, C Xiao, M Shoeybi, T Goldstein, A Anandkumar, ...
NeurIPS 2021, 2021
1172021
Factuality enhanced language models for open-ended text generation
N Lee, W Ping, P Xu, M Patwary, M Shoeybi, B Catanzaro
NeurIPS 2022, 2022
1072022
Topic compositional neural language model
W Wang, Z Gan, W Wang, D Shen, J Huang, W Ping, S Satheesh, L Carin
AISTATS 2018, 2017
892017
End-to-end training of neural retrievers for open-domain question answering
DS Sachan, M Patwary, M Shoeybi, N Kant, W Ping, WL Hamilton, ...
ACL 2021, 2021
872021
Systems and methods for multi-speaker neural text-to-speech
G DIAMOS, A GIBIANSKY, J Miller, P Kainan, P Wei, J RAIMAN, Z Yanqi
US Patent 10,896,669, 2021
752021
One TTS alignment to rule them all
R Badlani, A Łancucki, KJ Shih, R Valle, W Ping, B Catanzaro
ICASSP 2022, 2021
742021
Retrieval meets long context large language models
P Xu, W Ping, X Wu, L McAfee, C Zhu, Z Liu, S Subramanian, ...
ICLR 2024, 2023
702023
Vila: On pre-training for visual language models
J Lin, H Yin, W Ping, Y Lu, P Molchanov, A Tao, H Mao, J Kautz, ...
CVPR 2024, 2023
582023
Million-scale near-duplicate video retrieval system
Y Cai, L Yang, W Ping, F Wang, T Mei, XS Hua, S Li
ACM Multimedia 2011, 2011
522011
Speech denoising in the waveform domain with self-attention
Z Kong, W Ping, A Dantrey, B Catanzaro
ICASSP 2022, 2022
512022
The system can't perform the operation now. Try again later.
Articles 1–20