Recent advances on federated learning for cybersecurity and cybersecurity for federated learning for internet of things

B Ghimire, DB Rawat - IEEE Internet of Things Journal, 2022 - ieeexplore.ieee.org
Decentralized paradigm in the field of cybersecurity and machine learning (ML) for the
emerging Internet of Things (IoT) has gained a lot of attention from the government …

Multimodal image synthesis and editing: A survey and taxonomy

F Zhan, Y Yu, R Wu, J Zhang, S Lu, L Liu… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org
As information exists in various modalities in real world, effective interaction and fusion
among multimodal information plays a key role for the creation and perception of multimodal …

Make-a-scene: Scene-based text-to-image generation with human priors

O Gafni, A Polyak, O Ashual, S Sheynin… - … on Computer Vision, 2022 - Springer
Recent text-to-image generation methods provide a simple yet exciting conversion capability
between text and image domains. While these methods have incrementally improved the …

Instantbooth: Personalized text-to-image generation without test-time finetuning

J Shi, W Xiong, Z Lin, HJ Jung - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Recent advances in personalized image generation have enabled pre-trained text-to-image
models to learn new concepts from specific image sets. However these methods often …

Merlot reserve: Neural script knowledge through vision and language and sound

R Zellers, J Lu, X Lu, Y Yu, Y Zhao… - Proceedings of the …, 2022 - openaccess.thecvf.com
As humans, we navigate a multimodal world, building a holistic understanding from all our
senses. We introduce MERLOT Reserve, a model that represents videos jointly over time …

Magface: A universal representation for face recognition and quality assessment

Q Meng, S Zhao, Z Huang… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
The performance of face recognition system degrades when the variability of the acquired
faces increases. Prior work alleviates this issue by either monitoring the face quality in pre …

Poisoning web-scale training datasets is practical

N Carlini, M Jagielski, CA Choquette-Choo… - arXiv preprint arXiv …, 2023 - arxiv.org
Deep learning models are often trained on distributed, webscale datasets crawled from the
internet. In this paper, we introduce two new dataset poisoning attacks that intentionally …

Classifying emotions and engagement in online learning based on a single facial expression recognition neural network

AV Savchenko, LV Savchenko… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
In this article, behaviour of students in the e-learning environment is analyzed. The novel
pipeline is proposed based on video facial processing. At first, face detection, tracking and …

Emoca: Emotion driven monocular face capture and animation

R Daněček, MJ Black, T Bolkart - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
As 3D facial avatars become more widely used for communication, it is critical that they
faithfully convey emotion. Unfortunately, the best recent methods that regress parametric 3D …

Learning an animatable detailed 3D face model from in-the-wild images

Y Feng, H Feng, MJ Black, T Bolkart - ACM Transactions on Graphics …, 2021 - dl.acm.org
While current monocular 3D face reconstruction methods can recover fine geometric details,
they suffer several limitations. Some methods produce faces that cannot be realistically …