Beyond image to depth: Improving depth prediction using echoes

KK Parida, S Srivastava… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
We address the problem of estimating depth with multi modal audio visual data. Inspired by
the ability of animals, such as bats and dolphins, to infer distance of objects with …

Sound localization from motion: Jointly learning sound direction and camera rotation

Z Chen, S Qian, A Owens - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
The images and sounds that we perceive undergo subtle but geometrically consistent
changes as we rotate our heads. In this paper, we use these cues to solve a problem we call …

Sound localization by self-supervised time delay estimation

Z Chen, DF Fouhey, A Owens - European Conference on Computer Vision, 2022 - Springer
Sounds reach one microphone in a stereo pair sooner than the other, resulting in an
interaural time delay that conveys their directions. Estimating a sound's time delay requires …

CatChatter: Acoustic perception for mobile robots

E Tracy, N Kottege - IEEE Robotics and Automation Letters, 2021 - ieeexplore.ieee.org
There are many examples in nature of animals using acoustics to understand and navigate
the world around them. Inspired by this, we train an image-to-image translation network to …

The Audio-Visual BatVision Dataset for Research on Sight and Sound

A Brunetto, S Hornauer, XY Stella… - 2023 IEEE/RSJ …, 2023 - ieeexplore.ieee.org
Vision research showed remarkable success in understanding our world, propelled by
datasets of images and videos. Sensor data from radar, LiDAR and cameras supports …

Depth Estimation of Multi-Modal Scene Based on Multi-Scale Modulation

A Wang, Z Fang, X Jiang, Y Gao… - … Conference on Image …, 2023 - ieeexplore.ieee.org
As multimodal information is complementary, effectively utilizing scene multimodal
information has become an increasingly important research topic for many scholars. This …

Learned Acoustic Reconstruction Using Synthetic Aperture Focusing

T Straubinger, R Xiao, H Rhodin - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Many algorithmic approaches to 3D acoustic imaging have been devised which rely on a
large abundance of receiving elements to produce images with delay-and-sum techniques …