Shallow and deep feature fusion for digital audio tampering detection

Z Wang, Y Yang, C Zeng, S Kong, S Feng… - EURASIP Journal on …, 2022 - Springer
Digital audio tampering detection can be used to verify the authenticity of digital audio.
However, most current methods use standard electronic network frequency (ENF) databases …

Deletion and insertion tampering detection for speech authentication based on fluctuating super vector of electrical network frequency

C Zeng, S Kong, Z Wang, S Feng, N Zhao… - Speech …, 2024 - Elsevier
The current digital speech deletion and insertion tampering detection methods mainly
employes the extraction of phase and frequency features of the Electrical Network …

Spatio-temporal representation learning enhanced source cell-phone recognition from speech recordings

C Zeng, S Feng, Z Wang, X Wan, Y Chen… - Journal of Information …, 2024 - Elsevier
The existing source cell-phone recognition method lacks the long-term feature
characterization of the source device, resulting in an inaccurate representation of the source …

Photovoltaic panel defect detection based on ghost convolution with BottleneckCSP and tiny target prediction head incorporating YOLOv5

L Li, Z Wang, T Zhang - arXiv preprint arXiv:2303.00886, 2023 - arxiv.org
Photovoltaic (PV) panel surface-defect detection technology is crucial for the PV industry to
perform smart maintenance. Using computer vision technology to detect PV panel surface …

Digital audio tampering detection based on spatio-temporal representation learning of electrical network frequency

C Zeng, S Kong, Z Wang, K Li, Y Zhao, X Wan… - Multimedia Tools and …, 2024 - Springer
Abstract The majority of Digital Audio Tampering Detection (DATD) methods, which are
based on Electrical Network Frequency (ENF), predominantly concentrate on the static …

Learning behavior recognition in smart classroom with multiple students based on YOLOv5

Z Wang, J Yao, C Zeng, W Wu, H Xu, Y Yang - arXiv preprint arXiv …, 2023 - arxiv.org
Deep learning-based computer vision technology has grown stronger in recent years, and
cross-fertilization using computer vision technology has been a popular direction in recent …

DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition

Z Wang, C Zeng, S Duan, H Ouyang, H Xu - arXiv preprint arXiv …, 2023 - arxiv.org
Speaker recognition is a biometric modality that utilizes the speaker's speech segments to
recognize the identity, determining whether the test speaker belongs to one of the enrolled …

Multi-Scale Deformable Transformers for Student Learning Behavior Detection in Smart Classroom

Z Wang, M Wang, C Zeng, L Li - arXiv preprint arXiv:2410.07834, 2024 - arxiv.org
The integration of Artificial Intelligence into the modern educational system is rapidly
evolving, particularly in monitoring student behavior in classrooms, a task traditionally …

A Low-Cost Detail-Aware Neural Network Framework and Its Application in Mask Wearing Monitoring

S Cao, S Long, F Liao - Applied Sciences, 2023 - mdpi.com
The use of deep learning techniques in real-time monitoring can save a lot of manpower in
various scenarios. For example, mask-wearing is an effective measure to prevent COVID-19 …

DKT-STDRL: Spatial and Temporal Representation Learning Enhanced Deep Knowledge Tracing for Learning Performance Prediction

L Lyu, Z Wang, H Yun, Z Yang, Y Li - arXiv preprint arXiv:2302.11569, 2023 - arxiv.org
Knowledge tracing (KT) serves as a primary part of intelligent education systems. Most
current KTs either rely on expert judgments or only exploit a single network structure, which …