Efficient Low-rank Multimodal Fusion with Modality-Specific Factors Z Liu, Y Shen, VB Lakshminarasimhan, PP Liang, A Zadeh, LP Morency Proceedings of the 56th Annual Meeting of the Association for Computational …, 2018 | 859 | 2018 |
Words can shift: Dynamically adjusting word representations using nonverbal behaviors Y Wang, Y Shen, Z Liu, PP Liang, A Zadeh, LP Morency Proceedings of the AAAI conference on artificial intelligence 33 (01), 7216-7223, 2019 | 430 | 2019 |
Multiinstruct: Improving multi-modal zero-shot learning via instruction tuning Z Xu, Y Shen, L Huang arXiv preprint arXiv:2212.10773, 2022 | 89 | 2022 |
Efficient low-rank multimodal fusion with modality-specific factors. arXiv 2018 Z Liu, Y Shen, VB Lakshminarasimhan, PP Liang, A Zadeh, LP Morency arXiv preprint arXiv:1806.00064, 1806 | 25 | 1806 |
The art of socratic questioning: Recursive thinking with large language models J Qi, Z Xu, Y Shen, M Liu, D Jin, Q Wang, L Huang arXiv preprint arXiv:2305.14999, 2023 | 17 | 2023 |
Vision-flan: Scaling human-labeled tasks in visual instruction tuning Z Xu, C Feng, R Shao, T Ashby, Y Shen, D Jin, Y Cheng, Q Wang, ... arXiv preprint arXiv:2402.11690, 2024 | 16 | 2024 |
X-eval: Generalizable multi-aspect text evaluation via augmented instruction tuning with auxiliary evaluation aspects M Liu, Y Shen, Z Xu, Y Cao, E Cho, V Kumar, R Ghanadan, L Huang arXiv preprint arXiv:2311.08788, 2023 | 10 | 2023 |
Multimodal Instruction Tuning with Conditional Mixture of LoRA Y Shen, Z Xu, Q Wang, Y Cheng, W Yin, L Huang arXiv preprint arXiv:2402.15896, 2024 | 6 | 2024 |
Efficient low-rank multimodal fusion with modality-specific factors. 2018 Z Liu, Y Shen, VB Lakshminarasimhan, PP Liang, A Zadeh, LP Morency arXiv preprint arXiv:1806.00064, 1806 | 6 | 1806 |
PJapa. Efficient low-rank multimodal fusion with modality-specific factors. 2018 Z Liu, Y Shen, VB Lakshminarasimhan, PP Liang, A Zadeh, L Morency Proceedings of the 56th Annual Meeting of the Association for Computational …, 0 | 6 | |
The art of SOCRATIC QUESTIONING: zero-shot multimodal reasoning with recursive thinking and selfquestioning J Qi, Z Xu, Y Shen, M Liu, D Jin, Q Wang, L Huang CoRR, abs/2305.14999, 2023 | 5 | 2023 |
Kaleido Diffusion: Improving Conditional Diffusion Models with Autoregressive Latent Modeling J Gu, Y Shen, S Zhai, Y Zhang, N Jaitly, JM Susskind arXiv preprint arXiv:2405.21048, 2024 | 3 | 2024 |
MULTISCRIPT: Multimodal Script Learning for Supporting Open Domain Everyday Tasks J Qi, M Liu, Y Shen, Z Xu, L Huang Proceedings of the AAAI Conference on Artificial Intelligence 38 (17), 18888 …, 2024 | 2 | 2024 |
Many-to-many Image Generation with Auto-regressive Diffusion Models Y Shen, Y Zhang, S Zhai, L Huang, JM Susskind, J Gu arXiv preprint arXiv:2404.03109, 2024 | 1 | 2024 |
KnowledgeBot: Improving Assistive Robot for Task Completion and Live Interaction via Neuro-Symbolic Reasoning M Liu, Y Shen, BM Yao, S Wang, J Qi, Z Xu, L Huang Proceedings of the Alexa Prize SimBot Challenge, Virtual Event 6, 2023 | 1 | 2023 |
Learning by asking for embodied visual navigation and task completion Y Shen, I Lourentzou arXiv preprint arXiv:2302.04865, 2023 | 1 | 2023 |