A review of deep learning techniques for speech processing

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

A survey of convolutional neural networks: analysis, applications, and prospects

Z Li, F Liu, W Yang, S Peng… - IEEE transactions on …, 2021 - ieeexplore.ieee.org
A convolutional neural network (CNN) is one of the most significant networks in the deep
learning field. Since CNN made impressive achievements in many areas, including but not …

Adaface: Quality adaptive margin for face recognition

M Kim, AK Jain, X Liu - … of the IEEE/CVF conference on …, 2022 - openaccess.thecvf.com
Recognition in low quality face datasets is challenging because facial attributes are
obscured and degraded. Advances in margin-based loss functions have resulted in …

Mitigating neural network overconfidence with logit normalization

H Wei, R Xie, H Cheng, L Feng… - … conference on machine …, 2022 - proceedings.mlr.press
Detecting out-of-distribution inputs is critical for the safe deployment of machine learning
models in the real world. However, neural networks are known to suffer from the …

Multimodal foundation models: From specialists to general-purpose assistants

C Li, Z Gan, Z Yang, J Yang, L Li… - … and Trends® in …, 2024 - nowpublishers.com
Neural compression is the application of neural networks and other machine learning
methods to data compression. Recent advances in statistical machine learning have opened …

Magface: A universal representation for face recognition and quality assessment

Q Meng, S Zhao, Z Huang… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
The performance of face recognition system degrades when the variability of the acquired
faces increases. Prior work alleviates this issue by either monitoring the face quality in pre …

Fsce: Few-shot object detection via contrastive proposal encoding

B Sun, B Li, S Cai, Y Yuan… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
Emerging interests have been brought to recognize previously unseen objects given very
few training examples, known as few-shot object detection (FSOD). Recent researches …

Elasticface: Elastic margin loss for deep face recognition

F Boutros, N Damer… - Proceedings of the …, 2022 - openaccess.thecvf.com
Learning discriminative face features plays a major role in building high-performing face
recognition models. The recent state-of-the-art face recognition solutions proposed to …

Frequency domain model augmentation for adversarial attack

Y Long, Q Zhang, B Zeng, L Gao, X Liu, J Zhang… - European conference on …, 2022 - Springer
For black-box attacks, the gap between the substitute model and the victim model is usually
large, which manifests as a weak attack performance. Motivated by the observation that the …

Deep discriminative transfer learning network for cross-machine fault diagnosis

Q Qian, Y Qin, J Luo, Y Wang, F Wu - Mechanical Systems and Signal …, 2023 - Elsevier
Many domain adaptation methods have been presented to deal with the distribution
alignment and knowledge transfer between the target domain and the source domain …