Vggface2: A dataset for recognising faces across pose and age

Recent advances on federated learning for cybersecurity and cybersecurity for federated learning for internet of things

B Ghimire, DB Rawat - IEEE Internet of Things Journal, 2022 - ieeexplore.ieee.org

Decentralized paradigm in the field of cybersecurity and machine learning (ML) for the
emerging Internet of Things (IoT) has gained a lot of attention from the government …

被引用次数：255 相关文章所有 4 个版本

[PDF] arxiv.org

Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

被引用次数：181 相关文章所有 11 个版本

[PDF] arxiv.org

Make-a-scene: Scene-based text-to-image generation with human priors

O Gafni, A Polyak, O Ashual, S Sheynin… - … on Computer Vision, 2022 - Springer

Recent text-to-image generation methods provide a simple yet exciting conversion capability
between text and image domains. While these methods have incrementally improved the …

被引用次数：383 相关文章所有 4 个版本

[PDF] thecvf.com

Instantbooth: Personalized text-to-image generation without test-time finetuning

J Shi, W Xiong, Z Lin, HJ Jung - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Recent advances in personalized image generation have enabled pre-trained text-to-image
models to learn new concepts from specific image sets. However these methods often …

被引用次数：127 相关文章所有 3 个版本

[PDF] thecvf.com

Merlot reserve: Neural script knowledge through vision and language and sound

R Zellers, J Lu, X Lu, Y Yu, Y Zhao… - Proceedings of the …, 2022 - openaccess.thecvf.com

As humans, we navigate a multimodal world, building a holistic understanding from all our
senses. We introduce MERLOT Reserve, a model that represents videos jointly over time …

被引用次数：213 相关文章所有 9 个版本

[PDF] thecvf.com

Magface: A universal representation for face recognition and quality assessment

Q Meng, S Zhao, Z Huang… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com

The performance of face recognition system degrades when the variability of the acquired
faces increases. Prior work alleviates this issue by either monitoring the face quality in pre …

被引用次数：513 相关文章所有 7 个版本

[PDF] arxiv.org

Poisoning web-scale training datasets is practical

N Carlini, M Jagielski, CA Choquette-Choo… - arXiv preprint arXiv …, 2023 - arxiv.org

Deep learning models are often trained on distributed, webscale datasets crawled from the
internet. In this paper, we introduce two new dataset poisoning attacks that intentionally …

被引用次数：106 相关文章所有 6 个版本

Classifying emotions and engagement in online learning based on a single facial expression recognition neural network

AV Savchenko, LV Savchenko… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

In this article, behaviour of students in the e-learning environment is analyzed. The novel
pipeline is proposed based on video facial processing. At first, face detection, tracking and …

被引用次数：171 相关文章所有 2 个版本

[PDF] thecvf.com

Emoca: Emotion driven monocular face capture and animation

R Daněček, MJ Black, T Bolkart - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

As 3D facial avatars become more widely used for communication, it is critical that they
faithfully convey emotion. Unfortunately, the best recent methods that regress parametric 3D …

被引用次数：129 相关文章所有 8 个版本

[PDF] acm.org

Learning an animatable detailed 3D face model from in-the-wild images

Y Feng, H Feng, MJ Black, T Bolkart - ACM Transactions on Graphics …, 2021 - dl.acm.org

While current monocular 3D face reconstruction methods can recover fine geometric details,
they suffer several limitations. Some methods produce faces that cannot be realistically …

被引用次数：480 相关文章所有 9 个版本

高级搜索

QQ 群