关注
Yong Jae Lee
Yong Jae Lee
Associate Professor of Computer Sciences, UW-Madison
在 wisc.edu 的电子邮件经过验证 - 首页
标题
引用次数
年份
VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation
B Zou, M Cai, J Zhang, YJ Lee
arXiv preprint arXiv:2407.10972, 2024
2024
Testing learning-enabled cyber-physical systems with large-language models: A formal approach
X Zheng, AK Mok, R Piskac, YJ Lee, B Krishnamachari, D Zhu, ...
Companion Proceedings of the 32nd ACM International Conference on the …, 2024
12024
LLaRA: Supercharging Robot Learning Data for Vision-Language Policy
X Li, C Mata, J Park, K Kahatapitiya, YS Jang, J Shang, K Ranasinghe, ...
arXiv preprint arXiv:2406.20095, 2024
2024
MATE: Meet At The Embedding--Connecting Images with Long Texts
YK Jang, J Kang, YJ Lee, D Kim
arXiv preprint arXiv:2407.09541, 2024
2024
Yo'LLaVA: Your Personalized Language and Vision Assistant
T Nguyen, H Liu, Y Li, M Cai, U Ojha, YJ Lee
arXiv preprint arXiv:2406.09400, 2024
2024
Matryoshka Multimodal Models
M Cai, J Yang, J Gao, YJ Lee
arXiv preprint arXiv:2405.17430, 2024
22024
Llava-prumerge: Adaptive token reduction for efficient large multimodal models
Y Shang, M Cai, B Xu, YJ Lee, Y Yan
arXiv preprint arXiv:2403.15388, 2024
72024
Llm inference unveiled: Survey and roofline model insights
Z Yuan, Y Shang, Y Zhou, Z Dong, C Xue, B Wu, Z Li, Q Gu, YJ Lee, ...
arXiv preprint arXiv:2402.16363, 2024
122024
Cohere3D: Exploiting Temporal Coherence for Unsupervised Representation Learning of Vision-based Autonomous Driving
Y Xie, H Chen, GP Meyer, YJ Lee, EM Wolff, M Tomizuka, W Zhan, Y Chai, ...
arXiv preprint arXiv:2402.15583, 2024
2024
Method and system of using a global transformer for efficient modeling of global context in point clouds
YJ Lee, H Liu
US Patent 11,908,202, 2024
2024
CounterCurate: Enhancing Physical and Semantic Visio-Linguistic Compositional Reasoning via Counterfactual Examples
J Zhang, M Cai, T Xie, YJ Lee
arXiv preprint arXiv:2402.13254, 2024
2024
Visual instruction inversion: Image editing via image prompting
T Nguyen, Y Li, U Ojha, YJ Lee
Advances in Neural Information Processing Systems 36, 2024
192024
Visual instruction tuning
H Liu, C Li, Q Wu, YJ Lee
Advances in neural information processing systems 36, 2024
23742024
Segment everything everywhere all at once
X Zou, J Yang, H Zhang, F Li, L Li, J Wang, L Wang, J Gao, YJ Lee
Advances in Neural Information Processing Systems 36, 2024
3162024
Investigating the catastrophic forgetting in multimodal large language model fine-tuning
Y Zhai, S Tong, X Li, M Cai, Q Qu, YJ Lee, Y Ma
Conference on Parsimony and Learning, 202-227, 2024
63*2024
LLaVA-NeXT: Improved reasoning
H Liu, C Li, Y Li, B Li, Y Zhang, S Shen, YJ Lee
OCR, and world knowledge 2, 2024
92024
ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts
M Cai, H Liu, SK Mustikovela, GP Meyer, Y Chai, D Park, YJ Lee
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
22024
Llava-next: Improved reasoning, ocr, and world knowledge
H Liu, C Li, Y Li, B Li, Y Zhang, S Shen, YJ Lee
1462024
Edit One for All: Interactive Batch Image Editing
T Nguyen, U Ojha, Y Li, H Liu, YJ Lee
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
2024
Computer vision on the edge: Individual cattle identification in real-time with readmycow system
M Smink, H Liu, D Döpfer, YJ Lee
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
42024
系统目前无法执行此操作,请稍后再试。
文章 1–20