A challenger to gpt-4v? early explorations of gemini in visual expertise C Fu, R Zhang, H Lin, Z Wang, T Gao, Y Luo, Y Huang, Z Zhang, L Qiu, ... arXiv preprint arXiv:2312.12436, 2023 | 20 | 2023 |
RAAT: Relation-augmented attention transformer for relation modeling in document-level event extraction Y Liang, Z Jiang, D Yin, B Ren Proceedings of the 2022 Conference of the North American Chapter of the …, 2022 | 15 | 2022 |
Os-msl: One stage multimodal sequential link framework for scene segmentation and classification Y Liu, L Qiao, D Yin, Z Jiang, X Jiang, D Jiang, B Ren Proceedings of the 30th ACM International Conference on Multimedia, 6269-6277, 2022 | 7 | 2022 |
Mac-sql: Multi-agent collaboration for text-to-sql B Wang, C Ren, J Yang, X Liang, J Bai, QW Zhang, Z Yan, Z Li arXiv preprint arXiv:2312.11242, 2023 | 5 | 2023 |
Memochat: Tuning llms to use memos for consistent long-range open-domain conversation J Lu, S An, M Lin, G Pergola, Y He, D Yin, X Sun, Y Wu arXiv preprint arXiv:2308.08239, 2023 | 5 | 2023 |
OSAN: A One-Stage Alignment Network to Unify Multimodal Alignment and Unsupervised Domain Adaptation Y Liu, L Qiao, C Lu, D Yin, C Lin, H Peng, B Ren Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 5 | 2023 |
Entropy-based Optimization on Individual and Global Predictions for Semi-Supervised Learning Z Zhao, M Zhao, Y Liu, D Yin, L Zhou Proceedings of the 31st ACM International Conference on Multimedia, 8346-8355, 2023 | 2 | 2023 |
Grafting pre-trained models for multimodal headline generation L Qiao, C Wu, Y Liu, H Peng, D Yin, B Ren Proceedings of the 2022 Conference on Empirical Methods in Natural Language …, 2022 | 2 | 2022 |
Unsupervised Extractive Summarization With Heterogeneous Graph Embeddings for Chinese Documents C Lin, Y Liu, S An, D Yin ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 1 | 2023 |
Leveraging Key Information Modeling to Improve Less-Data Constrained News Headline Generation via Duality Fine-Tuning Z Jiang, L Qiao, D Yin, S Feng, B Ren Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the …, 2022 | 1 | 2022 |
Contrastive graph multimodal model for text classification in videos Y Liu, C Lu, C Lin, D Yin, B Ren arXiv preprint arXiv:2206.02343, 2022 | 1 | 2022 |
FIPO: Free-form Instruction-oriented Prompt Optimization with Preference Dataset and Modular Fine-tuning Schema J Lu, S An, M Zhang, Y He, D Yin, X Sun arXiv preprint arXiv:2402.11811, 2024 | | 2024 |
VKIE: The Application of Key Information Extraction on Video Text S An, Y Liu, H Peng, D Yin Proceedings of the 2023 Conference on Empirical Methods in Natural Language …, 2023 | | 2023 |