关注
Wenliang Dai
Wenliang Dai
Research Scientist, NVIDIA
在 nvidia.com 的电子邮件经过验证 - 首页
标题
引用次数
年份
Negative object presence evaluation (nope) to measure object hallucination in vision-language models
H Lovenia, W Dai, S Cahyawijaya, Z Ji, P Fung
arXiv preprint arXiv:2310.05338, 2023
232023
Survey of social bias in vision-language models
N Lee, Y Bang, H Lovenia, S Cahyawijaya, W Dai, P Fung
arXiv preprint arXiv:2309.14381, 2023
92023
Visual Instruction Tuning with Polite Flamingo
D Chen, J Liu, W Dai, B Wang
The 38th Annual AAAI Conference on Artificial Intelligence (AAAI-24), 2023
212023
mCLIP: Multilingual CLIP via Cross-lingual Transfer
G Chen, L Hou, Y Chen, W Dai, L Shang, X Jiang, Q Liu, J Pan, W Wang
ACL 2023, 2023
132023
InstructBLIP: Towards General-purpose Vision-Language Models with Instruction Tuning
W Dai, J Li, D Li, AMH Tiong, J Zhao, W Wang, B Li, P Fung, S Hoi
37th Conference on Neural Information Processing Systems (NeurIPS 2023), 2023
2393*2023
A Multitask, Multilingual, Multimodal Evaluation of ChatGPT on Reasoning, Hallucination, and Interactivity
Y Bang, S Cahyawijaya, N Lee, W Dai, D Su, B Wilie, H Lovenia, Z Ji, ...
AACL 2023 - Area Chair Award (Language Modeling and Analysis), 2023
1135*2023
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
S Cahyawijaya, H Lovenia, AF Aji, GI Winata, B Wilie, R Mahendra, ...
Findings of ACL 2023, 2022
10602022
Plausible May Not Be Faithful: Probing Object Hallucination in Vision-Language Pre-training
W Dai, Z Liu, Z Ji, D Su, P Fung
EACL 2023, 2022
472022
Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation
W Dai, L Hou, L Shang, X Jiang, Q Liu, P Fung
Findings of ACL 2022, 2022
842022
Survey of Hallucination in Natural Language Generation
Z Ji, N Lee, R Frieske, T Yu, D Su, Y Xu, E Ishii, Y Bang, W Dai, A Madotto, ...
ACM Computing Surveys, 2022
20892022
CI-AVSR: A Cantonese Audio-Visual Speech Dataset for In-car Command Recognition
W Dai, S Cahyawijaya, T Yu, EJ Barezi, P Xu, CTS Yiu, R Frieske, ...
LREC 2022, 2022
13*2022
Automatic Speech Recognition Datasets in Cantonese: A Survey and New Dataset
T Yu, R Frieske, P Xu, S Cahyawijaya, CTS Yiu, H Lovenia, W Dai, ...
LREC 2022, 2022
62022
ASCEND: A Spontaneous Chinese-English Dataset for Code-switching in Multi-turn Conversation
H Lovenia, S Cahyawijaya, GI Winata, P Xu, X Yan, Z Liu, R Frieske, T Yu, ...
LREC 2022, 2021
262021
Greenformer: Factorization toolkit for efficient deep neural networks
S Cahyawijaya, GI Winata, H Lovenia, B Wilie, W Dai, E Ishii, P Fung
arXiv preprint arXiv:2109.06762, 2021
52021
Vision Guided Generative Pre-trained Language Models for Multimodal Abstractive Summarization
T Yu*, W Dai*, Z Liu, P Fung
EMNLP 2021, 2021
662021
Weakly-supervised multi-task learning for multimodal affect recognition
W Dai, S Cahyawijaya, Y Bang, P Fung
arXiv preprint arXiv:2104.11560, 2021
142021
Multimodal End-to-End Sparse Model for Emotion Recognition
W Dai, S Cahyawijaya, Z Liu, P Fung
NAACL 2021, 2021
642021
CrossNER: Evaluating Cross-Domain Named Entity Recognition
Z Liu, Y Xu, T Yu, W Dai, Z Ji, S Cahyawijaya, A Madotto, P Fung
The Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21), 2020
1272020
BART-based Approach for Scientific Document Summarization
T Yu, D Su, W Dai, P Fung
Proceedings of the First Workshop on Scholarly Document Processing, EMNLP …, 2020
13*2020
Multi-hop Question Generation with Graph Convolutional Network
D Su, Y Xu, W Dai, Z Ji, T Yu, P Fung
Findings of EMNLP 2020, 4636–4647, 2020
442020
系统目前无法执行此操作,请稍后再试。
文章 1–20