Signal processing methods for music transcription

A Klapuri, M Davy - 2007 - books.google.com
Signal Processing Methods for Music Transcription is the first book dedicated to uniting
research related to signal processing algorithms and models for various aspects of music …

The cocktail party problem

S Haykin, Z Chen - Neural computation, 2005 - direct.mit.edu
This review presents an overview of a challenging problem in auditory perception, the
cocktail party phenomenon, the delineation of which goes back to a classic paper by Cherry …

[图书][B] Computational auditory scene analysis: Principles, algorithms, and applications

DL Wang, GJ Brown - 2006 - dl.acm.org
Computational Auditory Scene Analysis | Guide books skip to main content ACM Digital Library
home ACM home Google, Inc. (search) Advanced Search Browse About Sign in Register …

A real-time music-scene-description system: Predominant-F0 estimation for detecting melody and bass lines in real-world audio signals

M Goto - Speech Communication, 2004 - Elsevier
In this paper, we describe the concept of music scene description and address the problem
of detecting melody and bass lines in real-world audio signals containing the sounds of …

A computationally efficient multipitch analysis model

T Tolonen, M Karjalainen - IEEE transactions on speech and …, 2000 - ieeexplore.ieee.org
A computationally efficient model for multipitch and periodicity analysis of complex audio
signals is presented. The model essentially divides the signal into two channels, below and …

[PDF][PDF] Active audition for humanoid

K Nakadai, T Lourens, HG Okuno, H Kitano - AAAI/IAAI, 2000 - cdn.aaai.org
In this paper, we present an active audition system for humanoid robot “SIG the humanoid”.
The audition system of the highly intelligent humanoid requires localization of sound …

[PDF][PDF] Multimodal person recognition using unconstrained audio and video

T Choudhury, B Clarkson, T Jebara… - … Conference on Audio …, 1999 - lifewear.gatech.edu
We propose a person identification technique that can recognize and verify people from
unconstrained video and audio. We do not expect fully frontal face image or clean speech as …

[图书][B] Computational Auditory Scene Analysis: Proceedings of the Ijcai-95 Workshop

DF Rosenthal, HG Okuno, H Okuno, D Rosenthal - 2021 - books.google.com
The interest of AI in problems related to understanding sounds has a rich history dating back
to the ARPA Speech Understanding Project in the 1970s. While a great deal has been …

The auditory organization of speech and other sources in listeners and computational models

M Cooke, DPW Ellis - Speech communication, 2001 - Elsevier
Speech is typically perceived against a background of other sounds. Listeners are adept at
extracting target sources from the acoustic mixture reaching the ears. The auditory scene …

The auditory “primal sketch”: A multiscale model of rhythmic grouping

NPMA Todd - Journal of new music Research, 1994 - Taylor & Francis
In this paper a new theory of rhythmic grouping is proposed which is based on a multitime
scale decomposition of the auditory nerve response. The theory has been inspired by the …