Spectrogram transformers for audio classification

Y Zhang, B Li, H Fang, Q Meng - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
Audio classification is an important task in the machine learning field with a wide range of
applications. Since the last decade, deep learning based methods have been widely used …

Task-driven common subspace learning based semantic feature extraction for acoustic event recognition

Q Shi, S Deng, J Han - Expert Systems with Applications, 2023 - Elsevier
For acoustic event recognition (AER), it is important to extract the semantic feature that
considers both the content information and the temporal ordering. To this end, our previous …

Multimodal knowledge graph construction of Chinese traditional operas and sentiment and genre recognition

T Fan, H Wang, T Hodel - Journal of Cultural Heritage, 2023 - Elsevier
The advancement of digital technologies promotes the documentation of traditional operas,
leaving a large amount of data but in a state of fragmentation. Constructing a knowledge …

Combined query embroidery image retrieval based on enhanced CNN and blend transformer

X Zhuo, D Huang, Y Lin, Z Huang - Scientific Reports, 2024 - nature.com
Embroidery images carry rich historical information and are an important form of embroidery
art. In the field of combination query image retrieval, how to efficiently retrieve the …

[HTML][HTML] An IoT-enhanced automatic music composition system integrating audio-visual learning with transformer and SketchVAE

Y Zhang - Alexandria Engineering Journal, 2025 - Elsevier
With the rapid development of artificial intelligence and the Internet of Things technology, the
automatic music composition system has become a hot topic of research. This paper …

Inductive Bias Integration for Transformer Enhancement in Small-scale Segmentation Tasks

L Wang, Z Niu, B Wang, G Li, L Li - Proceedings of the 2024 5th …, 2024 - dl.acm.org
Transformers have made substantial contributions to various computer vision tasks,
leveraging their attention mechanisms. However, their performance tends to be deficient on …

Machine Learning-based Energy Consumption Model for Data Center

L Qiao, Y Yu, Q Wang, Y Zhang… - 2023 35th Chinese …, 2023 - ieeexplore.ieee.org
The accurate prediction of time series data in the industrial production process can provide
important guidance for the scheduling and decision-making of industrial systems, and is also …

[PDF][PDF] Investigating the Impact of Patching Methods on the Use of Transformer-based Image Classification Models for Audio Classification

S Zhang, L Li, Y Ohishi, D Takeuchi, D Niizumi… - s.makino.w.waseda.jp
Audio Classification Task [1] is a task that aims to classify audio signals into different
categories. For example, audio can be classified as the human voice, animal voice, music …