extract deep acoustic features, such as deep CQCC features, from a received voice sample.
The deep acoustic features are processed by a second deep neural network that classifies
the deep acoustic features according to a determined likelihood of including a spoofing
condition. A binary classifier then classifies the voice sample as being genuine or spoofed.