查看文章

plos.org 中的 [HTML]

pyaudioanalysis: An open-source python library for audio signal analysis

作者

Theodoros Giannakopoulos

发表日期

2015/12/11

期刊

PloS one

卷号

期号

页码范围

e0144610

出版商

Public Library of Science

简介

Audio information plays a rather important role in the increasing digital content that is available today, resulting in a need for methodologies that automatically analyze such content: audio event recognition for home automations and surveillance systems, speech recognition, music information retrieval, multimodal analysis (e.g. audio-visual analysis of online videos for content-based recommendation), etc. This paper presents pyAudioAnalysis, an open-source Python library that provides a wide range of audio analysis procedures including: feature extraction, classification of audio signals, supervised and unsupervised segmentation and content visualization. pyAudioAnalysis is licensed under the Apache License and is available at GitHub (https://github.com/tyiannak/pyAudioAnalysis/). Here we present the theoretical background behind the wide range of the implemented methodologies, along with evaluation metrics for some of the methods. pyAudioAnalysis has been already used in several audio analysis research applications: smart-home functionalities through audio event detection, speech emotion recognition, depression classification based on audio-visual features, music segmentation, multimodal content-based movie recommendation and health applications (e.g. monitoring eating habits). The feedback provided from all these particular audio applications has led to practical enhancement of the library.

引用总数

被引用次数：540

2016201720182019202020212022202320249 39 64 75 86 92 78 62 27

学术搜索中的文章

pyaudioanalysis: An open-source python library for audio signal analysis

T Giannakopoulos - PloS one, 2015

被引用次数：540 相关文章所有 16 个版本