Deep learning face attributes in the wild

M Xu, H Du, D Niyato, J Kang, Z Xiong… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org

Artificial Intelligence-Generated Content (AIGC) is an automated method for generating,
manipulating, and modifying valuable and diverse data using AI algorithms creatively. This …

被引用次数：115 相关文章所有 5 个版本

Study on artificial intelligence: The state of the art and future prospects

C Zhang, Y Lu - Journal of Industrial Information Integration, 2021 - Elsevier

In the world, the technological and industrial revolution is accelerating by the widespread
application of new generation information and communication technologies, such as AI, IoT …

被引用次数：826 相关文章所有 3 个版本

[PDF] mlr.press

Scaling vision transformers to 22 billion parameters

M Dehghani, J Djolonga, B Mustafa… - International …, 2023 - proceedings.mlr.press

The scaling of Transformers has driven breakthrough capabilities for language models. At
present, the largest large language models (LLMs) contain upwards of 100B parameters …

被引用次数：338 相关文章所有 9 个版本

[PDF] neurips.cc

Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps

C Lu, Y Zhou, F Bao, J Chen, C Li… - Advances in Neural …, 2022 - proceedings.neurips.cc

Diffusion probabilistic models (DPMs) are emerging powerful generative models. Despite
their high-quality generation performance, DPMs still suffer from their slow sampling as they …

被引用次数：772 相关文章所有 6 个版本

[PDF] stableaiprompts.com

[PDF][PDF] The dawn of lmms: Preliminary explorations with gpt-4v (ision)

Z Yang, L Li, K Lin, J Wang, CC Lin… - arXiv preprint arXiv …, 2023 - stableaiprompts.com

Large multimodal models (LMMs) extend large language models (LLMs) with multi-sensory
skills, such as visual understanding, to achieve stronger generic intelligence. In this paper …

被引用次数：324 相关文章所有 3 个版本

[PDF] thecvf.com

Diffusion art or digital forgery? investigating data replication in diffusion models

G Somepalli, V Singla, M Goldblum… - Proceedings of the …, 2023 - openaccess.thecvf.com

Cutting-edge diffusion models produce images with high quality and customizability,
enabling them to be used for commercial art and graphic design purposes. But do diffusion …

被引用次数：191 相关文章所有 6 个版本

[PDF] arxiv.org

Mm-react: Prompting chatgpt for multimodal reasoning and action

Z Yang, L Li, J Wang, K Lin, E Azarnasab… - arXiv preprint arXiv …, 2023 - arxiv.org

We propose MM-REACT, a system paradigm that integrates ChatGPT with a pool of vision
experts to achieve multimodal reasoning and action. In this paper, we define and explore a …

被引用次数：240 相关文章所有 2 个版本

[PDF] thecvf.com

Adaface: Quality adaptive margin for face recognition

M Kim, AK Jain, X Liu - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com

Recognition in low quality face datasets is challenging because facial attributes are
obscured and degraded. Advances in margin-based loss functions have resulted in …

被引用次数：332 相关文章所有 7 个版本

[PDF] thecvf.com

Hyperdreambooth: Hypernetworks for fast personalization of text-to-image models

N Ruiz, Y Li, V Jampani, W Wei, T Hou… - Proceedings of the …, 2024 - openaccess.thecvf.com

Personalization has emerged as a prominent aspect within the field of generative AI
enabling the synthesis of individuals in diverse contexts and styles while retaining high …

被引用次数：80 相关文章所有 3 个版本

[PDF] thecvf.com

Repaint: Inpainting using denoising diffusion probabilistic models

A Lugmayr, M Danelljan, A Romero… - Proceedings of the …, 2022 - openaccess.thecvf.com

Free-form inpainting is the task of adding new content to an image in the regions specified
by an arbitrary binary mask. Most existing approaches train for a certain distribution of …

被引用次数：1041 相关文章所有 8 个版本

高级搜索

QQ 群