An empirical study of Conv-TasNet B Kadıoğlu, M Horgan, X Liu, J Pons, D Darcy, V Kumar ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020 | 49 | 2020 |
Quantitative evidence on overlooked aspects of enrollment speaker embeddings for target speaker separation X Liu, X Li, J Serrà ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 23 | 2023 |
Deep convolutional and LSTM neural networks for acoustic modelling in automatic speech recognition X Liu vol, 2018 | 15 | 2018 |
On permutation invariant training for speech source separation X Liu, J Pons ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 8 | 2021 |
Systems and methods for domain adaptation in neural networks R Chen, MH Chen, J Yoo, X Liu US Patent App. 18/145,967, 2023 | 7 | 2023 |
Video tagging by correlating visual features to sound tags S Krishnamurthy, X Liu US Patent US10847186B1, 2020 | 5 | 2020 |
A modulation feature set for robust automatic speech recognition in additive noise and reverberation X Liu, R Sadeghian, SA Zahorian 2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017 | 5 | 2017 |
A unified framework for filterbank and time-frequency basis vectors in ASR frontends X Liu, SA Zahorian 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015 | 3 | 2015 |
CLIPSonic: Text-to-Audio Synthesis with Unlabeled Videos and Pretrained Language-Vision Models HW Dong, X Liu, J Pons, G Bhattacharya, S Pascual, J Serrà, ... 2023 IEEE Workshop on Applications of Signal Processing to Audio and …, 2023 | 2 | 2023 |
GASS: Generalizing Audio Source Separation with Large-scale Data J Pons, X Liu, S Pascual, J Serrà ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2023 | 2 | 2023 |
Combined PNCC feature extractor for robust speech recognition X Liu, SA Zahorian 2014 IEEE China Summit & International Conference on Signal and Information …, 2014 | 1 | 2014 |
Deep-learning based speech enhancement X Liu, MG Horgan, RM Fejgin, P Holmberg US Patent App. 18/250,393, 2023 | | 2023 |
Deep source separation architecture B Kadioglu, M Horgan, J Pons Puig, X Liu US Patent App. 17/770,177, 2022 | | 2022 |
Phoneme recognizer customizable keyword spotting system with keyword adaptation L Kaushik, Z Ge, X Liu US Patent App. 17/567,873, 2022 | | 2022 |
SYSTEMS AND METHODS FOR ADAPTING HUMAN SPEAKER EMBEDDINGS IN SPEECH SYNTHESIS C Zhou, X Liu, MG Horgan, V Kumar US Patent App. 17/636,851, 2022 | | 2022 |