Improved training speed, accuracy, and data utilization through loss function optimization

T Hospedales, A Antoniou, P Micaelli… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

The field of meta-learning, or learning-to-learn, has seen a dramatic rise in interest in recent
years. Contrary to conventional approaches to AI where tasks are solved from scratch using …

被引用次数：2001 相关文章所有 10 个版本

[PDF] arxiv.org

Polyloss: A polynomial expansion perspective of classification loss functions

Z Leng, M Tan, C Liu, ED Cubuk, X Shi… - arXiv preprint arXiv …, 2022 - arxiv.org

Cross-entropy loss and focal loss are the most common choices when training deep neural
networks for classification problems. Generally speaking, however, a good loss function can …

被引用次数：152 相关文章所有 3 个版本

[PDF] mlr.press

Sharp-maml: Sharpness-aware model-agnostic meta learning

M Abbas, Q Xiao, L Chen, PY Chen… - … on machine learning, 2022 - proceedings.mlr.press

Abstract Model-agnostic meta learning (MAML) is currently one of the dominating
approaches for few-shot meta-learning. Albeit its effectiveness, the optimization of MAML …

被引用次数：69 相关文章所有 8 个版本

[PDF] jmlr.org

Compute-efficient deep learning: Algorithmic trends and opportunities

BR Bartoldson, B Kailkhura, D Blalock - Journal of Machine Learning …, 2023 - jmlr.org

Although deep learning has made great progress in recent years, the exploding economic
and environmental costs of training neural networks are becoming unsustainable. To …

被引用次数：32 相关文章所有 4 个版本

[PDF] sciencedirect.com

Meta-learning PINN loss functions

AF Psaros, K Kawaguchi, GE Karniadakis - Journal of computational …, 2022 - Elsevier

We propose a meta-learning technique for offline discovery of physics-informed neural
network (PINN) loss functions. We extend earlier works on meta-learning, and develop a …

被引用次数：76 相关文章所有 4 个版本

[PDF] ieee.org

A survey on evolutionary construction of deep neural networks

X Zhou, AK Qin, M Gong, KC Tan - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Automated construction of deep neural networks (DNNs) has become a research hot spot
nowadays because DNN's performance is heavily influenced by its architecture and …

被引用次数：68 相关文章所有 4 个版本

[PDF] mlr.press

Loss function learning for domain generalization by implicit gradient

B Gao, H Gouk, Y Yang… - … Conference on Machine …, 2022 - proceedings.mlr.press

Generalising robustly to distribution shift is a major challenge that is pervasive across most
real-world applications of machine learning. A recent study highlighted that many advanced …

被引用次数：30 相关文章所有 6 个版本

[PDF] thecvf.com

Meta-tuning loss functions and data augmentation for few-shot object detection

B Demirel, OB Baran, RG Cinbis - proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Few-shot object detection, the problem of modelling novel object detection categories with
few training instances, is an emerging topic in the area of few-shot learning and object …

被引用次数：10 相关文章所有 8 个版本

[PDF] arxiv.org

Evolutionary optimization of deep learning activation functions

G Bingham, W Macke, R Miikkulainen - Proceedings of the 2020 Genetic …, 2020 - dl.acm.org

The choice of activation function can have a large effect on the performance of a neural
network. While there have been some attempts to hand-engineer novel activation functions …

被引用次数：58 相关文章所有 4 个版本

[PDF] arxiv.org

Discovering parametric activation functions

G Bingham, R Miikkulainen - Neural Networks, 2022 - Elsevier

Recent studies have shown that the choice of activation function can significantly affect the
performance of deep learning networks. However, the benefits of novel activation functions …

被引用次数：53 相关文章所有 7 个版本

高级搜索

QQ 群