GraphRel: Modeling Text as Relational Graphs for Joint Entity and Relation Extraction TJ Fu, PH Li, WY Ma ACL (Long), 2019 | 442 | 2019 |
Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis W Feng, X He, TJ Fu, V Jampani, A Akula, P Narayana, S Basu, XE Wang, ... ICLR, 2023 | 209 | 2023 |
VIOLET: End-to-End Video-Language Transformers with Masked Visual-token Modeling TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu arXiv:2111.12681, 2021 | 193 | 2021 |
Dynamic Video Segmentation Network YS Xu, TJ Fu*, HK Yang*, CY Lee CVPR, 2018 | 146 | 2018 |
Diversity-Driven Exploration Strategy for Deep Reinforcement Learning ZW Hong, TY Shann, SY Su, YH Chang, TJ Fu, CY Lee NeurIPS, 2018 | 133 | 2018 |
LayoutGPT: Compositional Visual Planning and Generation with Large Language Models W Feng*, W Zhu*, T Fu, V Jampani, A Akula, X He, S Basu, XE Wang, ... NeurIPS, 2023 | 102 | 2023 |
Counterfactual Vision-and-Language Navigation via Adversarial Path Sampling TJ Fu, X Wang, M Peterson, S Grafton, M Eckstein, WY Wang ECCV (Spotlight), 2020 | 100* | 2020 |
Attentive and Adversarial Learning for Video Summarization TJ Fu, SH Tai, HT Chen WACV (Oral), 2019 | 82 | 2019 |
Why Attention? Analyze BiLSTM Deficiency and Its Remedies in the Case of NER PH Li, TJ Fu, WY Ma AAAI (Oral), 2020 | 71* | 2020 |
An Empirical Study of End-to-End Video-Language Transformers with Masked Visual Modeling TJ Fu*, L Li*, Z Gan, K Lin, WY Wang, L Wang, Z Liu CVPR, 2023 | 54 | 2023 |
Language-Driven Artistic Style Transfer TJ Fu, XE Wang, WY Wang ECCV, 2022 | 48* | 2022 |
DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents TJ Fu, WY Wang, D McDuff, Y Song AAAI, 2022 | 47 | 2022 |
SSCR: Iterative Language-Based Image Editing via Self-Supervised Counterfactual Reasoning TJ Fu, X Wang, S Grafton, M Eckstein, WY Wang EMNLP (Oral), 2020 | 44 | 2020 |
Guiding Instruction-based Image Editing via Multimodal Large Language Models TJ Fu, W Hu, X Du, WY Wang, Y Yang, Z Gan ICLR (Spotlight), 2024 | 37 | 2024 |
VELMA: Verbalization Embodiment of LLM Agents for Vision and Language Navigation in Street View R Schumann, W Zhu, W Feng, TJ Fu, S Riezler, WY Wang AAAI, 2024 | 34 | 2024 |
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation W Zhu, X Wang, TJ Fu, A Yan, P Narayana, K Sone, S Basu, WY Wang EACL (Long), 2021 | 32 | 2021 |
Tell Me What Happened: Unifying Text-guided Video Completion via Multimodal Masked Video Generation TJ Fu, L Yu, N Zhang, CY Fu, JC Su, WY Wang, S Bell CVPR, 2023 | 31 | 2023 |
M3L: Language-based Video Editing via Multi-Modal Multi-Level Transformers TJ Fu, XE Wang, ST Grafton, MP Eckstein, WY Wang CVPR, 2022 | 23 | 2022 |
CPL: Counterfactual Prompt Learning for Vision and Language Models X He, D Yang, W Feng, TJ Fu, A Akula, V Jampani, P Narayana, S Basu, ... EMNLP (Long), 2022 | 19 | 2022 |
Photoswap: Personalized Subject Swapping in Images J Gu, Y Wang, N Zhao, TJ Fu, W Xiong, Q Liu, Z Zhang, H Zhang, J Zhang, ... NeurIPS, 2023 | 16 | 2023 |