Solving attention kernel regression problem via pre-conditioner

Z Song, J Yin, L Zhang - International Conference on …, 2024 - proceedings.mlr.press
Attention mechanism is the key to large language models, and attention matrix serves as an
algorithmic and computational bottleneck for such a scheme. In this paper, we define two …

Structural attention enhanced continual meta-learning for graph edge labeling based few-shot remote sensing scene classification

F Li, S Li, X Fan, X Li, H Chang - Remote Sensing, 2022 - mdpi.com
Scene classification is one of the fundamental techniques shared by many basic remote
sensing tasks with a wide range of applications. As the demands of catering with situations …

Hierarchically structured task-agnostic continual learning

H Hihn, DA Braun - Machine Learning, 2023 - Springer
One notable weakness of current machine learning algorithms is the poor ability of models
to solve new problems without forgetting previously acquired knowledge. The Continual …

[PDF][PDF] Coral: Continual representation learning for overcoming catastrophic forgetting

MS Yasar, T Iqbal - … of the 2023 International Conference on …, 2023 - southampton.ac.uk
Humans have the ability to acquire, retain and transfer knowledge over their lifespan. For
intelligent agents to achieve fluent longitudinal interaction, they need to continually retain …

Avoiding forgetting and allowing forward transfer in continual learning via sparse networks

G Sokar, DC Mocanu, M Pechenizkiy - Joint European Conference on …, 2022 - Springer
Using task-specific components within a neural network in continual learning (CL) is a
compelling strategy to address the stability-plasticity dilemma in fixed-capacity models …

Mixture-of-variational-experts for continual learning

H Hihn, DA Braun - arXiv preprint arXiv:2110.12667, 2021 - arxiv.org
One weakness of machine learning algorithms is the poor ability of models to solve new
problems without forgetting previously acquired knowledge. The Continual Learning (CL) …

[HTML][HTML] Few-shot incremental radar target recognition framework based on scattering-topology properties

LI Chenxuan, ZHU Weigang, ZHU Bakun… - Chinese Journal of …, 2024 - Elsevier
The continuous emergence of new targets in open scenarios leads to a substantial decrease
in the performance of Inverse Synthetic Aperture Radar (ISAR) recognition systems. Also …

Progressive Learning With Recurrent Neural Network for Sequence Classification

RR Karn, J Knechtel… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Progressive learning is a deep learning framework in which tasks are learned sequentially,
with the capacity to leverage past knowledge from previously acquired tasks to aid in the …

Online continual learning using enhanced random vector functional link networks

CSY Wong, G Yang, A Ambikapathi… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
We propose an online continual learning algorithm based on an enhanced Random Vector
Functional Link Network (OCL-eRVFL), that learns a sequence of tasks continually, where …

Negotiated Representations for Machine Mearning Application

N Korhan, S Bayram - arXiv preprint arXiv:2311.11410, 2023 - arxiv.org
Overfitting is a phenomenon that occurs when a machine learning model is trained for too
long and focused too much on the exact fitness of the training samples to the provided …