Follow
Tsu-Jui Fu
Title
Cited by
Cited by
Year
GraphRel: Modeling Text as Relational Graphs for Joint Entity and Relation Extraction
TJ Fu, PH Li, WY Ma
ACL (Long), 2019
4222019
VIOLET: End-to-End Video-Language Transformers with Masked Visual-token Modeling
TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu
arXiv:2111.12681, 2021
1732021
Dynamic Video Segmentation Network
YS Xu, TJ Fu*, HK Yang*, CY Lee
CVPR, 2018
1342018
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis
W Feng, X He, TJ Fu, V Jampani, A Akula, P Narayana, S Basu, XE Wang, ...
ICLR, 2023
1332023
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning
ZW Hong, TY Shann, SY Su, YH Chang, TJ Fu, CY Lee
NeurIPS, 2018
1252018
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling
TJ Fu, X Wang, M Peterson, S Grafton, M Eckstein, WY Wang
ECCV (Spotlight), 2020
90*2020
Attentive and Adversarial Learning for Video Summarization
TJ Fu, SH Tai, HT Chen
WACV (Oral), 2019
742019
Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER
PH Li, TJ Fu, WY Ma
AAAI (Oral), 2020
60*2020
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models
W Feng*, W Zhu*, T Fu, V Jampani, A Akula, X He, S Basu, XE Wang, ...
NeurIPS, 2023
452023
Language-Driven Artistic Style Transfer
TJ Fu, XE Wang, WY Wang
ECCV, 2022
41*2022
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning
TJ Fu, X Wang, S Grafton, M Eckstein, WY Wang
EMNLP (Oral), 2020
392020
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling
TJ Fu*, L Li*, Z Gan, K Lin, WY Wang, L Wang, Z Liu
CVPR, 2023
372023
DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents
TJ Fu, WY Wang, D McDuff, Y Song
AAAI, 2022
352022
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
W Zhu, X Wang, TJ Fu, A Yan, P Narayana, K Sone, S Basu, WY Wang
EACL (Long), 2021
252021
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation
TJ Fu, L Yu, N Zhang, CY Fu, JC Su, WY Wang, S Bell
CVPR, 2023
212023
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers
TJ Fu, XE Wang, ST Grafton, MP Eckstein, WY Wang
CVPR, 2022
192022
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View
R Schumann, W Zhu, W Feng, TJ Fu, S Riezler, WY Wang
AAAI, 2024
162024
Guiding Instruction-based Image Editing via Multimodal Large Language Models
TJ Fu, W Hu, X Du, WY Wang, Y Yang, Z Gan
arXiv:2309.17102, 2023
162023
CPL: Counterfactual Prompt Learning for Vision and Language Models
X He, D Yang, W Feng, TJ Fu, A Akula, V Jampani, P Narayana, S Basu, ...
EMNLP (Long), 2022
162022
Speed Reading: Learning to Read ForBackward via Shuttle
TJ Fu, WY Ma
EMNLP (Long), 2018
142018
The system can't perform the operation now. Try again later.
Articles 1–20