Detailclip: Detail-oriented clip for fine-grained tasks

AK Monsefi, KP Sailaja, A Alilooee, SN Lim… - arXiv preprint arXiv …, 2024 - arxiv.org
In this paper, we introduce DetailCLIP: A Detail-Oriented CLIP to address the limitations of
contrastive learning-based vision-language models, particularly CLIP, in handling detail …

HiMTM: Hierarchical Multi-Scale Masked Time Series Modeling for Long-Term Forecasting

S Zhao, M Jin, Z Hou, C Yang, Z Li, Q Wen… - arXiv preprint arXiv …, 2024 - arxiv.org
Time series forecasting is crucial and challenging in the real world. The recent surge in
interest regarding time series foundation models, which cater to a diverse array of …

HiMTM: Hierarchical Multi-Scale Masked Time Series Modeling with Self-Distillation for Long-Term Forecasting

S Zhao, M Jin, Z Hou, C Yang, Z Li, Q Wen… - Proceedings of the 33rd …, 2024 - dl.acm.org
Time series forecasting is a critical and challenging task in practical application. Recent
advancements in pre-trained foundation models for time series forecasting have gained …

Mind your indices! Index hijacking attacks on collaborative unpooling autoencoder systems

K Lee, J Yun, J Jin, J Han, JG Ko - Internet of Things, 2025 - Elsevier
Autoencoder model architectures are attractive approaches for implementing intelligent
mobile/IoT sensing applications. This is attributed to their capability of offering efficient …

Frequency-Guided Masking for Enhanced Vision Self-Supervised Learning

AK Monsefi, M Zhou, NK Monsefi, SN Lim… - arXiv preprint arXiv …, 2024 - arxiv.org
We present a novel frequency-based Self-Supervised Learning (SSL) approach that
significantly enhances its efficacy for pre-training. Prior work in this direction masks out pre …

Wearable Accelerometer Foundation Models for Health via Knowledge Distillation

S Abbaspourazad, A Mishra, J Futoma… - arXiv preprint arXiv …, 2024 - arxiv.org
Modern wearable devices can conveniently and continuously record various biosignals in
the many different environments of daily living, ultimately enabling a rich view of individual …

Promoting cross-modal representations to improve multimodal foundation models for physiological signals

C Fang, C Sandino, B Mahasseni, J Minxha… - arXiv preprint arXiv …, 2024 - arxiv.org
Many healthcare applications are inherently multimodal, involving several physiological
signals. As sensors for these signals become more common, improving machine learning …

Radar Signal Recognition through Self-Supervised Learning and Domain Adaptation

Z Huang, A Pemasiri, S Denman, C Fookes… - arXiv preprint arXiv …, 2025 - arxiv.org
Automatic radar signal recognition (RSR) plays a pivotal role in electronic warfare (EW), as
accurately classifying radar signals is critical for informing decision-making processes …

Balanced Data, Imbalanced Spectra: Unveiling Class Disparities with Spectral Imbalance

C Kaushik, R Liu, CH Lin, A Khera, MY Jin… - arXiv preprint arXiv …, 2024 - arxiv.org
Classification models are expected to perform equally well for different classes, yet in
practice, there are often large gaps in their performance. This issue of class bias is widely …

Embedded Multi-Sensor Smartwatch for Computationally Intensive Biosignal Processing

PM Rapa, M Orlandi, M Zanghieri… - … Circuits and Systems …, 2024 - ieeexplore.ieee.org
Over the past decades, the rapid expansion of the wearable market has led to significant
advances in integrated devices and technologies, particularly in smartwatches. These …