Follow
Yiwu Zhong
Title
Cited by
Cited by
Year
Grounded language-image pre-training
LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ...
Computer Vision and Pattern Recognition (CVPR), 2022
7392022
RegionCLIP: Region-based Language-Image Pretraining
Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ...
Computer Vision and Pattern Recognition (CVPR), 2022
3912022
Comprehensive Image Captioning via Scene Graph Decomposition
Y Zhong, L Wang, J Chen, D Yu, Y Li
European Conference on Computer Vision (ECCV), 2020
1252020
Learning to Generate Scene Graph from Natural Language Supervision
Y Zhong, J Shi, J Yang, C Xu, Y Li
International Conference on Computer Vision (ICCV), 2021
682021
Gpt-4v in wonderland: Large multimodal models for zero-shot smartphone gui navigation
A Yan, Z Yang, W Zhu, K Lin, L Li, J Wang, J Yang, Y Zhong, J McAuley, ...
arXiv preprint arXiv:2311.07562, 2023
322023
Learning Concise and Descriptive Attributes for Visual Recognition
A Yan*, Y Wang*, Y Zhong*, C Dong, Z He, Y Lu, W Wang, J Shang, ...
International Conference on Computer Vision (ICCV), 2023
302023
A Simple Baseline for Weakly-Supervised Scene Graph Generation
J Shi, Y Zhong, N Xu, Y Li, C Xu
International Conference on Computer Vision (ICCV), 2021
272021
Robust and interpretable medical image classifiers via concept bottleneck models
A Yan, Y Wang, Y Zhong, Z He, P Karypis, Z Wang, C Dong, A Gentili, ...
arXiv preprint arXiv:2310.03182, 2023
142023
Learning Procedure-Aware Video Representation From Instructional Videos and Their Narrations
Y Zhong, L Yu, Y Bai, S Li, X Yan, Y Li
Computer Vision and Pattern Recognition (CVPR), 2023
142023
Towards learning a generalist model for embodied navigation
D Zheng, S Huang, L Zhao, Y Zhong, L Wang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
12024
Beyond Embeddings: The Promise of Visual Table in Multi-Modal Models
Y Zhong, ZY Hu, MR Lyu, L Wang
arXiv preprint arXiv:2403.18252, 2024
2024
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel Methods
C Qu, Y Zhong, C Liu, G Xu, D Peng, F Guo, L Jin
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
2024
Learning Visual Knowledge From Natural Language Supervision
Y Zhong
The University of Wisconsin-Madison, 2023
2023
Supplementary Materials for Learning Procedure-aware Video Representation from Instructional Videos and Their Narrations
Y Zhong, L Yu, Y Bai, S Li, X Yan, Y Li
2023
The system can't perform the operation now. Try again later.
Articles 1–14