A comprehensive survey on applications of transformers for deep learning tasks

S Islam, H Elmekki, A Elsebai, J Bentahar… - Expert Systems with …, 2024 - Elsevier
Abstract Transformers are Deep Neural Networks (DNN) that utilize a self-attention
mechanism to capture contextual relationships within sequential data. Unlike traditional …

Deep generative modelling: A comparative review of vaes, gans, normalizing flows, energy-based and autoregressive models

S Bond-Taylor, A Leach, Y Long… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
Deep generative models are a class of techniques that train deep neural networks to model
the distribution of training samples. Research has fragmented into various interconnected …

Zero-shot image-to-image translation

G Parmar, K Kumar Singh, R Zhang, Y Li, J Lu… - ACM SIGGRAPH 2023 …, 2023 - dl.acm.org
Large-scale text-to-image generative models have shown their remarkable ability to
synthesize diverse, high-quality images. However, directly applying these models for real …

Spatext: Spatio-textual representation for controllable image generation

O Avrahami, T Hayes, O Gafni… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent text-to-image diffusion models are able to generate convincing results of
unprecedented quality. However, it is nearly impossible to control the shapes of different …

Make-a-scene: Scene-based text-to-image generation with human priors

O Gafni, A Polyak, O Ashual, S Sheynin… - … on Computer Vision, 2022 - Springer
Recent text-to-image generation methods provide a simple yet exciting conversion capability
between text and image domains. While these methods have incrementally improved the …

Egsde: Unpaired image-to-image translation via energy-guided stochastic differential equations

M Zhao, F Bao, C Li, J Zhu - Advances in Neural …, 2022 - proceedings.neurips.cc
Score-based diffusion models (SBDMs) have achieved the SOTA FID results in unpaired
image-to-image translation (I2I). However, we notice that existing methods totally ignore the …

Alias-free generative adversarial networks

T Karras, M Aittala, S Laine… - Advances in neural …, 2021 - proceedings.neurips.cc
We observe that despite their hierarchical convolutional nature, the synthesis process of
typical generative adversarial networks depends on absolute pixel coordinates in an …

Unsupervised medical image translation with adversarial diffusion models

M Özbey, O Dalmaz, SUH Dar, HA Bedel… - … on Medical Imaging, 2023 - ieeexplore.ieee.org
Imputation of missing images via source-to-target modality translation can improve diversity
in medical imaging protocols. A pervasive approach for synthesizing target images involves …

Towards language-free training for text-to-image generation

Y Zhou, R Zhang, C Chen, C Li… - Proceedings of the …, 2022 - openaccess.thecvf.com
One of the major challenges in training text-to-image generation models is the need of a
large number of high-quality text-image pairs. While image samples are often easily …

Deep clustering: A comprehensive survey

Y Ren, J Pu, Z Yang, J Xu, G Li, X Pu… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
Cluster analysis plays an indispensable role in machine learning and data mining. Learning
a good data representation is crucial for clustering algorithms. Recently, deep clustering …