Computational auditory scene analysis: a representational approach.

A Klapuri, M Davy - 2007 - books.google.com

Signal Processing Methods for Music Transcription is the first book dedicated to uniting
research related to signal processing algorithms and models for various aspects of music …

被引用次数：866 相关文章所有 8 个版本

[PDF] psu.edu

The cocktail party problem

S Haykin, Z Chen - Neural computation, 2005 - direct.mit.edu

This review presents an overview of a challenging problem in auditory perception, the
cocktail party phenomenon, the delineation of which goes back to a classic paper by Cherry …

被引用次数：696 相关文章所有 13 个版本

[图书][B] Computational auditory scene analysis: Principles, algorithms, and applications

DL Wang, GJ Brown - 2006 - dl.acm.org

Computational Auditory Scene Analysis | Guide books skip to main content ACM Digital Library
home ACM home Google, Inc. (search) Advanced Search Browse About Sign in Register …

被引用次数：1746 相关文章所有 4 个版本

[PDF] aist.go.jp

A real-time music-scene-description system: Predominant-F0 estimation for detecting melody and bass lines in real-world audio signals

M Goto - Speech Communication, 2004 - Elsevier

In this paper, we describe the concept of music scene description and address the problem
of detecting melody and bass lines in real-world audio signals containing the sounds of …

被引用次数：546 相关文章所有 14 个版本

[PDF] psu.edu

A computationally efficient multipitch analysis model

T Tolonen, M Karjalainen - IEEE transactions on speech and …, 2000 - ieeexplore.ieee.org

A computationally efficient model for multipitch and periodicity analysis of complex audio
signals is presented. The model essentially divides the signal into two channels, below and …

被引用次数：506 相关文章所有 12 个版本

[PDF] aaai.org

[PDF][PDF] Active audition for humanoid

K Nakadai, T Lourens, HG Okuno, H Kitano - AAAI/IAAI, 2000 - cdn.aaai.org

In this paper, we present an active audition system for humanoid robot “SIG the humanoid”.
The audition system of the highly intelligent humanoid requires localization of sound …

被引用次数：354 相关文章所有 15 个版本

[PDF] gatech.edu

[PDF][PDF] Multimodal person recognition using unconstrained audio and video

T Choudhury, B Clarkson, T Jebara… - … Conference on Audio …, 1999 - lifewear.gatech.edu

We propose a person identification technique that can recognize and verify people from
unconstrained video and audio. We do not expect fully frontal face image or clean speech as …

被引用次数：285 相关文章所有 14 个版本

[图书][B] Computational Auditory Scene Analysis: Proceedings of the Ijcai-95 Workshop

DF Rosenthal, HG Okuno, H Okuno, D Rosenthal - 2021 - books.google.com

The interest of AI in problems related to understanding sounds has a rich history dating back
to the ARPA Speech Understanding Project in the 1970s. While a great deal has been …

被引用次数：298 相关文章所有 6 个版本

[PDF] psu.edu

The auditory organization of speech and other sources in listeners and computational models

M Cooke, DPW Ellis - Speech communication, 2001 - Elsevier

Speech is typically perceived against a background of other sounds. Listeners are adept at
extracting target sources from the acoustic mixture reaching the ears. The auditory scene …

被引用次数：230 相关文章所有 14 个版本

The auditory “primal sketch”: A multiscale model of rhythmic grouping

NPMA Todd - Journal of new music Research, 1994 - Taylor & Francis

In this paper a new theory of rhythmic grouping is proposed which is based on a multitime
scale decomposition of the auditory nerve response. The theory has been inspired by the …

被引用次数：216 相关文章所有 3 个版本

高级搜索

QQ 群