Unleashing the power of edge-cloud generative ai in mobile networks: A survey of aigc services

M Xu, H Du, D Niyato, J Kang, Z Xiong… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org
Artificial Intelligence-Generated Content (AIGC) is an automated method for generating,
manipulating, and modifying valuable and diverse data using AI algorithms creatively. This …

Study on artificial intelligence: The state of the art and future prospects

C Zhang, Y Lu - Journal of Industrial Information Integration, 2021 - Elsevier
In the world, the technological and industrial revolution is accelerating by the widespread
application of new generation information and communication technologies, such as AI, IoT …

Scaling vision transformers to 22 billion parameters

M Dehghani, J Djolonga, B Mustafa… - International …, 2023 - proceedings.mlr.press
The scaling of Transformers has driven breakthrough capabilities for language models. At
present, the largest large language models (LLMs) contain upwards of 100B parameters …

Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps

C Lu, Y Zhou, F Bao, J Chen, C Li… - Advances in Neural …, 2022 - proceedings.neurips.cc
Diffusion probabilistic models (DPMs) are emerging powerful generative models. Despite
their high-quality generation performance, DPMs still suffer from their slow sampling as they …

[PDF][PDF] The dawn of lmms: Preliminary explorations with gpt-4v (ision)

Z Yang, L Li, K Lin, J Wang, CC Lin… - arXiv preprint arXiv …, 2023 - stableaiprompts.com
Large multimodal models (LMMs) extend large language models (LLMs) with multi-sensory
skills, such as visual understanding, to achieve stronger generic intelligence. In this paper …

Diffusion art or digital forgery? investigating data replication in diffusion models

G Somepalli, V Singla, M Goldblum… - Proceedings of the …, 2023 - openaccess.thecvf.com
Cutting-edge diffusion models produce images with high quality and customizability,
enabling them to be used for commercial art and graphic design purposes. But do diffusion …

Mm-react: Prompting chatgpt for multimodal reasoning and action

Z Yang, L Li, J Wang, K Lin, E Azarnasab… - arXiv preprint arXiv …, 2023 - arxiv.org
We propose MM-REACT, a system paradigm that integrates ChatGPT with a pool of vision
experts to achieve multimodal reasoning and action. In this paper, we define and explore a …

Adaface: Quality adaptive margin for face recognition

M Kim, AK Jain, X Liu - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Recognition in low quality face datasets is challenging because facial attributes are
obscured and degraded. Advances in margin-based loss functions have resulted in …

Hyperdreambooth: Hypernetworks for fast personalization of text-to-image models

N Ruiz, Y Li, V Jampani, W Wei, T Hou… - Proceedings of the …, 2024 - openaccess.thecvf.com
Personalization has emerged as a prominent aspect within the field of generative AI
enabling the synthesis of individuals in diverse contexts and styles while retaining high …

Repaint: Inpainting using denoising diffusion probabilistic models

A Lugmayr, M Danelljan, A Romero… - Proceedings of the …, 2022 - openaccess.thecvf.com
Free-form inpainting is the task of adding new content to an image in the regions specified
by an arbitrary binary mask. Most existing approaches train for a certain distribution of …