Follow
Runhui Huang
Runhui Huang
Verified email at mail2.sysu.edu.cn
Title
Cited by
Cited by
Year
FILIP: fine-grained interactive language-image pre-training
L Yao*, R Huang*, L Hou*, G Lu, M Niu, H Xu, X Liang, Z Li, X Jiang, C Xu
arXiv preprint arXiv:2111.07783, 2021
3872021
Wukong: 100 million large-scale chinese cross-modal pre-training dataset and a foundation framework
HX Jiaxi Gu, Xiaojun Meng, Guansong Lu, Lu Hou, Minzhe Niu, Xiaodan Liang ...
arXiv preprint arXiv:2202.06767, 2022
66*2022
Deep Feature Fusion with Multiple Granularity for Vehicle Re-identification.
P Huang, R Huang, J Huang, R Yangchen, Z He, X Li, J Chen
CVPR workshops, 80-88, 2019
242019
Nlip: Noise-robust language-image pre-training
R Huang, Y Long, J Han, H Xu, X Liang, C Xu, X Liang
Proceedings of the AAAI Conference on Artificial Intelligence 37 (1), 926-934, 2023
112023
Fine-Grained Visual–Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection
Y Long, J Han, R Huang, H Xu, Y Zhu, C Xu, X Liang
IEEE Transactions on Neural Networks and Learning Systems, 2023
72023
Boosting visual-language models by exploiting hard samples
H Wang, M Huang, R Huang, L Hong, H Xu, T Hu, X Liang, Z Li
arXiv preprint arXiv:2305.05208, 2023
42023
Growclip: Data-aware automatic model growing for large-scale contrastive language-image pre-training
X Deng, H Shi, R Huang, C Li, H Xu, J Han, J Kwok, S Zhao, W Zhang, ...
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
22023
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model
R Huang, K Cai, J Han, X Liang, R Pei, G Lu, S Xu, W Zhang, H Xu
arXiv preprint arXiv:2403.11929, 2024
2024
SYSTEM AND METHOD FOR CROSS-MODAL INTERACTION BASED ON PRE-TRAINED MODEL
H XU, L Hou, G LU, M Niu, Z LI, R Huang, L Yao, C XU, X Liang
US Patent App. 17/900,592, 2024
2024
UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning
X Dong, R Huang, X Wei, Z Jie, J Yu, J Yin, X Liang
arXiv preprint arXiv:2306.00813, 2023
2023
DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability
R Huang, J Han, G Lu, X Liang, Y Zeng, W Zhang, H Xu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–11