field size, number of parameters and spatial resolution of features in deeper layers of the
network. In this work we present a novel network design based on combination of many
convolutional and recurrent layers that solves these dilemmas. We compare our solution
with U-nets based models known from the literature and other baseline models on speech
enhancement task. We test our solution on TIMIT speech utterances combined with noise …