Mart: Masked affective representation learning via masked temporal distribution distillation

Z Zhang, P Zhao, E Park… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Limited training data is a long-standing problem for video emotion analysis (VEA). Existing
works leverage the power of large-scale image datasets for transferring while failing to …

Extdm: Distribution extrapolation diffusion model for video prediction

Z Zhang, J Hu, W Cheng, D Paudel… - Proceedings of the …, 2024 - openaccess.thecvf.com
Video prediction is a challenging task due to its nature of uncertainty especially for
forecasting a long period. To model the temporal dynamics advanced methods benefit from …

Lake-red: Camouflaged images generation by latent background knowledge retrieval-augmented diffusion

P Zhao, P Xu, P Qin, DP Fan, Z Zhang… - Proceedings of the …, 2024 - openaccess.thecvf.com
Camouflaged vision perception is an important vision task with numerous practical
applications. Due to the expensive collection and labeling costs this community struggles …

Seeing the Unseen: A Frequency Prompt Guided Transformer for Image Restoration

S Zhou, J Pan, J Shi, D Chen, L Qu, J Yang - European Conference on …, 2025 - Springer
How to explore useful features from images as prompts to guide the deep image restoration
models is an effective way to solve image restoration. In contrast to mining spatial relations …

A Two-Stage Tone Mapping Network Based on Attention Mechanism for High Dynamic Range Images

M Zhu, H Yu, Y Chu - Authorea Preprints, 2024 - techrxiv.org
High dynamic range (HDR) imaging enhances visual realism by capturing a wide luminance
range, but displaying HDR images on devices with limited dynamic range requires effective …