Large language models for data annotation: A survey

Z Tan, D Li, S Wang, A Beigi, B Jiang… - arXiv preprint arXiv …, 2024 - arxiv.org
Data annotation generally refers to the labeling or generating of raw data with relevant
information, which could be used for improving the efficacy of machine learning models. The …

Exploring large language models for feature selection: A data-centric perspective

D Li, Z Tan, H Liu - arXiv preprint arXiv:2408.12025, 2024 - arxiv.org
The rapid advancement of Large Language Models (LLMs) has significantly influenced
various domains, leveraging their exceptional few-shot and zero-shot learning capabilities …

Expllm: Towards chain of thought for facial expression recognition

X Lan, J Xue, J Qi, D Jiang, K Lu, TS Chua - arXiv preprint arXiv …, 2024 - arxiv.org
Facial expression recognition (FER) is a critical task in multimedia with significant
implications across various domains. However, analyzing the causes of facial expressions is …

Emo-llama: Enhancing facial emotion understanding with instruction tuning

B Xing, Z Yu, X Liu, K Yuan, Q Ye, W Xie, H Yue… - arXiv preprint arXiv …, 2024 - arxiv.org
Facial expression recognition (FER) is an important research topic in emotional artificial
intelligence. In recent decades, researchers have made remarkable progress. However …

DLO: Dynamic Layer Operation for Efficient Vertical Scaling of LLMs

Z Tan, D Dong, X Zhao, J Peng, Y Cheng… - arXiv preprint arXiv …, 2024 - arxiv.org
In this paper, we introduce Dynamic Layer Operations (DLO), a novel approach for vertically
scaling transformer-based Large Language Models (LLMs) by dynamically expanding …

LRQ-Fact: LLM-Generated Relevant Questions for Multimodal Fact-Checking

A Beigi, B Jiang, D Li, T Kumarage, Z Tan… - arXiv preprint arXiv …, 2024 - arxiv.org
Human fact-checkers have specialized domain knowledge that allows them to formulate
precise questions to verify information accuracy. However, this expert-driven approach is …

A Survey on Multimodal Benchmarks: In the Era of Large AI Models

L Li, G Chen, H Shi, J Xiao, L Chen - arXiv preprint arXiv:2409.18142, 2024 - arxiv.org
The rapid evolution of Multimodal Large Language Models (MLLMs) has brought substantial
advancements in artificial intelligence, significantly enhancing the capability to understand …

Towards Unified Facial Action Unit Recognition Framework by Large Language Models

G Hu, X Lan, H Jiang, J Lyu, J Xue - arXiv preprint arXiv:2409.08444, 2024 - arxiv.org
Facial Action Units (AUs) are of great significance in the realm of affective computing. In this
paper, we propose AU-LLaVA, the first unified AU recognition framework based on the …

Toward Open World Visual Understanding

W Bao - 2024 - search.proquest.com
Visual data such as images and videos are the most prominent media to record, transmit,
and exchange information in this era. Though we have witnessed waves of success in visual …