Learning neural acoustic fields

A Luo, Y Du, M Tarr, J Tenenbaum… - Advances in Neural …, 2022 - proceedings.neurips.cc
Our environment is filled with rich and dynamic acoustic information. When we walk into a
cathedral, the reverberations as much as appearance inform us of the sanctuary's wide open …

Few-shot audio-visual learning of environment acoustics

S Majumder, C Chen, Z Al-Halah… - Advances in Neural …, 2022 - proceedings.neurips.cc
Room impulse response (RIR) functions capture how the surrounding physical environment
transforms the sounds heard by a listener, with implications for various applications in AR …

Av-rir: Audio-visual room impulse response estimation

A Ratnarajah, S Ghosh, S Kumar… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Accurate estimation of Room Impulse Response (RIR) which captures an
environment's acoustic properties is important for speech processing and AR/VR …

Real acoustic fields: An audio-visual room acoustics dataset and benchmark

Z Chen, ID Gebru, C Richardt… - Proceedings of the …, 2024 - openaccess.thecvf.com
We present a new dataset called Real Acoustic Fields (RAF) that captures real acoustic
room data from multiple modalities. The dataset includes high-quality and densely captured …

Inras: Implicit neural representation for audio scenes

K Su, M Chen, E Shlizerman - Advances in Neural …, 2022 - proceedings.neurips.cc
The spatial acoustic information of a scene, ie, how sounds emitted from a particular location
in the scene are perceived in another location, is key for immersive scene modeling. Robust …

Adverb: Visually guided audio dereverberation

S Chowdhury, S Ghosh, S Dasgupta… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present AdVerb, a novel audio-visual dereverberation framework that uses visual cues
in addition to the reverberant sound to estimate clean audio. Although audio-only …

Av-nerf: Learning neural fields for real-world audio-visual scene synthesis

S Liang, C Huang, Y Tian… - Advances in Neural …, 2024 - proceedings.neurips.cc
Can machines recording an audio-visual scene produce realistic, matching audio-visual
experiences at novel positions and novel view directions? We answer it by studying a new …

Be everywhere-hear everything (bee): Audio scene reconstruction by sparse audio-visual samples

M Chen, K Su, E Shlizerman - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
Fully immersive and interactive audio-visual scenes are dynamic such that the listeners and
the sound emitters move and interact with each other. Reconstruction of an immersive sound …

Mesh2ir: Neural acoustic impulse response generator for complex 3d scenes

A Ratnarajah, Z Tang, R Aralikatti… - Proceedings of the 30th …, 2022 - dl.acm.org
We propose a mesh-based neural network (MESH2IR) to generate acoustic impulse
responses (IRs) for indoor 3D scenes represented using a mesh. The IRs are used to create …

Towards improved room impulse response estimation for speech recognition

A Ratnarajah, I Ananthabhotla… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
We propose a novel approach for blind room impulse response (RIR) estimation systems in
the context of a downstream application scenario, far-field automatic speech recognition …