(ASC) and a visualization method for presenting a sound scene context. To this end, we first
propose an inception-based and low-memory footprint ASC model as the ASC baseline. The
ASC baseline is then compared with benchmark and high-complexity network architectures.
Next, we improve the ASC baseline by proposing a novel deep neural network architecture
which leverages a residual-inception architecture and multiple kernels. Given the novel …