Diffusion model-based image editing: A survey

Y Huang, J Huang, Y Liu, M Yan, J Lv, J Liu… - arXiv preprint arXiv …, 2024 - arxiv.org
Denoising diffusion models have emerged as a powerful tool for various image generation
and editing tasks, facilitating the synthesis of visual content in an unconditional or input …

Efficient diffusion models: A comprehensive survey from principles to practices

Z Ma, Y Zhang, G Jia, L Zhao, Y Ma, M Ma… - arXiv preprint arXiv …, 2024 - arxiv.org
As one of the most popular and sought-after generative models in the recent years, diffusion
models have sparked the interests of many researchers and steadily shown excellent …

Lego: Learning egocentric action frame generation via visual instruction tuning

B Lai, X Dai, L Chen, G Pang, JM Rehg… - European Conference on …, 2025 - Springer
Generating instructional images of human daily actions from an egocentric viewpoint serves
as a key step towards efficient skill transfer. In this paper, we introduce a novel problem …

Hq-edit: A high-quality dataset for instruction-based image editing

M Hui, S Yang, B Zhao, Y Shi, H Wang, P Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
This study introduces HQ-Edit, a high-quality instruction-based image editing dataset with
around 200,000 edits. Unlike prior approaches relying on attribute guidance or human …

Generate Any Scene: Evaluating and Improving Text-to-Vision Generation with Scene Graph Programming

Z Gao, W Huang, J Zhang, A Kembhavi… - arXiv preprint arXiv …, 2024 - arxiv.org
DALL-E and Sora have gained attention by producing implausible images, such as"
astronauts riding a horse in space." Despite the proliferation of text-to-vision models that …

Instruction-Guided Editing Controls for Images and Multimedia: A Survey in LLM era

TT Nguyen, Z Ren, T Pham, PL Nguyen, H Yin… - arXiv preprint arXiv …, 2024 - arxiv.org
The rapid advancement of large language models (LLMs) and multimodal learning has
transformed digital content creation and manipulation. Traditional visual editing tools require …

InsightEdit: Towards Better Instruction Following for Image Editing

Y Xu, J Kong, J Wang, X Pan, B Lin, Q Liu - arXiv preprint arXiv …, 2024 - arxiv.org
In this paper, we focus on the task of instruction-based image editing. Previous works like
InstructPix2Pix, InstructDiffusion, and SmartEdit have explored end-to-end editing. However …

Knowledge-Enhanced Large Language Models and Human-AI Collaboration Frameworks for Creativity Support

T Chakrabarty - 2024 - search.proquest.com
Large language models (LLMs) constitute a paradigm shift in Natural Language Processing
and Artificial Intelligence. In this thesis, I explore the integration of creative capabilities into …

Image Restoration, Editing, and Assessment with Generative Artificial Intelligence

L Ji - 2024 - search.proquest.com
Modern high-performance computing (HPC) systems rely extensively on parallel
architectures, including multicore CPUs and GPUs, to fulfill the ever-growing computational …

HQ-EDIT: AHIGH-QUALITY DATASET FOR INSTRUC-TION BASED IMAGE EDITING

TBI EDITING - openreview.net
This study introduces HQ-Edit, a high-quality instruction-based image editing dataset with
around 200,000 edits. Unlike prior approaches relying on attribute guidance or human …