关注
Haotian Liu
Haotian Liu
xAI
在 x.ai 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Visual instruction tuning
H Liu, C Li, Q Wu, YJ Lee
Advances in neural information processing systems 36, 2024
22102024
Improved Baselines with Visual Instruction Tuning
H Liu, C Li, Y Li, YJ Lee
CVPR 2024, 2023
6502023
GLIGEN: Open-Set Grounded Text-to-Image Generation
Y Li, H Liu, Q Wu, F Mu, J Yang, J Gao, C Li, YJ Lee
CVPR 2023, 2023
371*2023
Llava-med: Training a large language-and-vision assistant for biomedicine in one day
C Li, C Wong, S Zhang, N Usuyama, H Liu, J Yang, T Naumann, H Poon, ...
NeurIPS 2023, Datasets and Benchmarks Track, 2023
2432023
Llava-next: Improved reasoning, ocr, and world knowledge
H Liu, C Li, Y Li, B Li, Y Zhang, S Shen, YJ Lee
https://llava-vl.github.io/blog/2024-01-30-llava-next, 2024
153*2024
Masked discrimination for self-supervised learning on point clouds
H Liu, M Cai, YJ Lee
Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022
1172022
ELEVATER: A Benchmark and Toolkit for Evaluating Language-Augmented Visual Models
C Li*, H Liu*, LH Li, P Zhang, J Aneja, J Yang, P Jin, H Hu, Z Liu, YJ Lee, ...
NeurIPS 2022, Datasets and Benchmarks Track, 2022
1062022
Aligning Large Multimodal Models with Factually Augmented RLHF
Z Sun, S Shen, S Cao, H Liu, C Li, Y Shen, C Gan, LY Gui, YX Wang, ...
arXiv preprint arXiv:2309.14525, 2023
932023
YolactEdge: Real-time Instance Segmentation on the Edge
H Liu*, RAR Soto*, F Xiao, YJ Lee
2021 IEEE International Conference on Robotics and Automation (ICRA), 9579-9585, 2021
782021
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
S Liu, H Cheng, H Liu, H Zhang, F Li, T Ren, X Zou, J Yang, H Su, J Zhu, ...
arXiv preprint arXiv:2311.05437, 2023
552023
Learning Customized Visual Models with Retrieval-Augmented Knowledge
H Liu, K Son, J Yang, C Liu, J Gao, YJ Lee, C Li
CVPR 2023, 2023
36*2023
Making large multimodal models understand arbitrary visual prompts
M Cai, H Liu, SK Mustikovela, GP Meyer, Y Chai, D Park, YJ Lee
CVPR 2024, 2023
242023
Identity from here, Pose from there: Self-supervised Disentanglement and Generation of Objects using Unlabeled Videos
F Xiao, H Liu, YJ Lee
Proceedings of the IEEE International Conference on Computer Vision, 7013-7022, 2019
202019
An Empirical Study of Scaling Instruct-Tuned Large Multimodal Models
Y Lu, C Li, H Liu, J Yang, J Gao, Y Shen
arXiv preprint arXiv:2309.09958, 2023
162023
Operation strategy of public building: Implications from trade-off between carbon emission and occupant satisfaction
Y Chen, H Liu, L Shi
Journal of Cleaner Production 205, 629-644, 2018
112018
LLaVA-NeXT: A Strong Zero-shot Video Understanding Model
Y Zhang, B Li, H Liu, YJ Lee, L Gui, D Fu, J Feng, Z Liu, C Li
https://llava-vl.github.io/blog/2024-04-30-llava-next-video/, 2024
9*2024
Generate Anything Anywhere in Any Scene
Y Li, H Liu, Y Wen, YJ Lee
arXiv preprint arXiv:2306.17154, 2023
82023
Computer Vision on the Edge: Individual Cattle Identification in Real-Time With ReadMyCow System
M Smink, H Liu, D Döpfer, YJ Lee
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2024
42024
Lmms-eval: Accelerating the development of large multimoal models
B Li, P Zhang, K Zhang, F Pu, X Du, Y Dong, H Liu, Y Zhang, G Zhang, ...
March, 2024
32024
Benchmarking and Analyzing Generative Data for Visual Recognition
B Li, H Liu, L Chen, YJ Lee, C Li, Z Liu
arXiv preprint arXiv:2307.13697, 2023
32023
系统目前无法执行此操作,请稍后再试。
文章 1–20