Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop

X Wu, L Xiao, Y Sun, J Zhang, T Ma, L He - Future Generation Computer …, 2022 - Elsevier

Abstract Machine learning has become the state-of-the-art technique for many tasks
including computer vision, natural language processing, speech processing tasks, etc …

被引用次数：424 相关文章所有 6 个版本

[PDF] arxiv.org

Software testing with large language models: Survey, landscape, and vision

J Wang, Y Huang, C Chen, Z Liu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Pre-trained large language models (LLMs) have recently emerged as a breakthrough
technology in natural language processing and artificial intelligence, with the ability to …

被引用次数：98 相关文章所有 7 个版本

[PDF] thecvf.com

Scaling up gans for text-to-image synthesis

M Kang, JY Zhu, R Zhang, J Park… - Proceedings of the …, 2023 - openaccess.thecvf.com

The recent success of text-to-image synthesis has taken the world by storm and captured the
general public's imagination. From a technical standpoint, it also marked a drastic change in …

被引用次数：302 相关文章所有 6 个版本

[PDF] thecvf.com

Score jacobian chaining: Lifting pretrained 2d diffusion models for 3d generation

H Wang, X Du, J Li, RA Yeh… - Proceedings of the …, 2023 - openaccess.thecvf.com

A diffusion model learns to predict a vector field of gradients. We propose to apply chain rule
on the learned gradients, and back-propagate the score of a diffusion model through the …

被引用次数：339 相关文章所有 6 个版本

[PDF] thecvf.com

Open-vocabulary panoptic segmentation with text-to-image diffusion models

J Xu, S Liu, A Vahdat, W Byeon… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present ODISE: Open-vocabulary DIffusion-based panoptic SEgmentation, which unifies
pre-trained text-image diffusion and discriminative models to perform open-vocabulary …

被引用次数：255 相关文章所有 6 个版本

[PDF] arxiv.org

Consistency models

Y Song, P Dhariwal, M Chen, I Sutskever - arXiv preprint arXiv:2303.01469, 2023 - arxiv.org

Diffusion models have significantly advanced the fields of image, audio, and video
generation, but they depend on an iterative sampling process that causes slow generation …

被引用次数：416 相关文章所有 9 个版本

[PDF] thecvf.com

Depth anything: Unleashing the power of large-scale unlabeled data

L Yang, B Kang, Z Huang, X Xu… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract This work presents Depth Anything a highly practical solution for robust monocular
depth estimation. Without pursuing novel technical modules we aim to build a simple yet …

被引用次数：104 相关文章所有 6 个版本

[PDF] neurips.cc

Dpm-solver: A fast ode solver for diffusion probabilistic model sampling in around 10 steps

C Lu, Y Zhou, F Bao, J Chen, C Li… - Advances in Neural …, 2022 - proceedings.neurips.cc

Diffusion probabilistic models (DPMs) are emerging powerful generative models. Despite
their high-quality generation performance, DPMs still suffer from their slow sampling as they …

被引用次数：743 相关文章所有 6 个版本

[PDF] mlr.press

Out-of-distribution detection with deep nearest neighbors

Y Sun, Y Ming, X Zhu, Y Li - International Conference on …, 2022 - proceedings.mlr.press

Abstract Out-of-distribution (OOD) detection is a critical task for deploying machine learning
models in the open world. Distance-based methods have demonstrated promise, where …

被引用次数：341 相关文章所有 4 个版本

[PDF] neurips.cc

Improving diffusion models for inverse problems using manifold constraints

H Chung, B Sim, D Ryu, JC Ye - Advances in Neural …, 2022 - proceedings.neurips.cc

Recently, diffusion models have been used to solve various inverse problems in an
unsupervised manner with appropriate modifications to the sampling process. However, the …

被引用次数：224 相关文章所有 8 个版本

高级搜索

QQ 群