HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination & Visual Illusion in Large Vision-Language Models T Guan*, F Liu*, X Wu, R Xian, Z Li, X Liu, X Wang, L Chen, F Huang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 77* | 2023 |
Aztr: Aerial video action recognition with auto zoom and temporal reasoning X Wang*, R Xian*, T Guan, CM de Melo, SM Nogar, A Bera, D Manocha 2023 IEEE International Conference on Robotics and Automation (ICRA), 2023 | 7 | 2023 |
On the Safety Concerns of Deploying LLMs/VLMs in Robotics: Highlighting the Risks and Vulnerabilities X Wu, R Xian, T Guan, J Liang, S Chakraborty, F Liu, B Sadler, ... arXiv preprint arXiv:2402.10340, 2024 | 2 | 2024 |
PMI Sampler: Patch similarity guided frame selection for Aerial Action Recognition R Xian, X Wang, D Kothandaraman, D Manocha Proceedings of the IEEE/CVF winter conference on applications of computer …, 2023 | 2 | 2023 |
Mitfas: Mutual information based temporal feature alignment and sampling for aerial video action recognition R Xian, X Wang, D Manocha Proceedings of the IEEE/CVF winter conference on applications of computer …, 2023 | 2 | 2023 |
Prompt Learning for Action Recognition X Wang*, R Xian*, T Guan, D Manocha arXiv preprint arXiv:2305.12437, 2023 | 1 | 2023 |
AGL-NET: Aerial-Ground Cross-Modal Global Localization with Varying Scales T Guan, R Xian, X Wang, X Wu, M Elnoor, D Song, D Manocha arXiv preprint arXiv:2404.03187, 2024 | | 2024 |