Rank-DETR for high quality object detection

Y Pu, W Liang, Y Hao, Y Yuan… - Advances in …, 2024 - proceedings.neurips.cc
Modern detection transformers (DETRs) use a set of object queries to predict a list of
bounding boxes, sort them by their classification confidence scores, and select the top …

Prompt-free diffusion: Taking" text" out of text-to-image diffusion models

X Xu, J Guo, Z Wang, G Huang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Text-to-image (T2I) research has grown explosively in the past year owing to the
large-scale pre-trained diffusion models and many emerging personalization and editing …

Strategic preys make acute predators: Enhancing camouflaged object detectors by generating camouflaged objects

C He, K Li, Y Zhang, Y Zhang, Z Guo, X Li… - arXiv preprint arXiv …, 2023 - arxiv.org
Camouflaged object detection (COD) is the challenging task of identifying camouflaged
objects visually blended into surroundings. Albeit achieving remarkable success, existing …

Smooth diffusion: Crafting smooth latent spaces in diffusion models

J Guo, X Xu, Y Pu, Z Ni, C Wang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Recently diffusion models have made remarkable progress in text-to-image (T2I) generation
synthesizing images with high fidelity and diverse contents. Despite this advancement latent …

Visual-Augmented Dynamic Semantic Prototype for Generative Zero-Shot Learning

W Hou, S Chen, S Chen, Z Hong… - Proceedings of the …, 2024 - openaccess.thecvf.com
Generative Zero-shot learning (ZSL) learns a generator to synthesize visual samples for
unseen classes which is an effective way to advance ZSL. However existing generative …

A survey on generative modeling with limited data, few shots, and zero shot

M Abdollahzadeh, T Malekzadeh, CTH Teo… - arXiv preprint arXiv …, 2023 - arxiv.org
In machine learning, generative modeling aims to learn to generate new data statistically
similar to the training data distribution. In this paper, we survey learning generative models …

Faceclip: Facial image-to-video translation via a brief text description

J Guo, H Manukyan, C Yang, C Wang… - … on Circuits and …, 2023 - ieeexplore.ieee.org
The existing image-to-video translation methods generally follow a frame-by-frame
generative paradigm, while extracting the temporal information from a reference video or an …

FlashEval: Towards Fast and Accurate Evaluation of Text-to-image Diffusion Generative Models

L Zhao, T Zhao, Z Lin, X Ning, G Dai… - Proceedings of the …, 2024 - openaccess.thecvf.com
In recent years there has been significant progress in the development of text-to-image
generative models. Evaluating the quality of the generative models is one essential step in …

Applications of Generative AI (GAI) for Mobile and Wireless Networking: A Survey

TH Vu, SK Jagatheesaperumal, MD Nguyen… - arXiv preprint arXiv …, 2024 - arxiv.org
The success of Artificial Intelligence (AI) in multiple disciplines and vertical domains in
recent years has promoted the evolution of mobile networking and the future Internet toward …

GRA: Detecting Oriented Objects through Group-wise Rotating and Attention

J Wang, Y Pu, Y Han, J Guo, Y Wang, X Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Oriented object detection, an emerging task in recent years, aims to identify and locate
objects across varied orientations. This requires the detector to accurately capture the …