Deep learning for environmentally robust speech recognition: An overview of recent developments Z Zhang, J Geiger, J Pohjalainen, AED Mousa, W Jin, B Schuller ACM Transactions on Intelligent Systems and Technology (TIST) 9 (5), 1-28, 2018 | 396 | 2018 |
The TUM Gait from Audio, Image and Depth (GAID) Database: Multimodal Recognition of Subjects and Traits M Hofmann, J Geiger, S Bachmann, B Schuller, G Rigoll Journal of Visual Communication and Image Representation (JVCI), Special …, 2014 | 271 | 2014 |
LARGE-SCALE AUDIO FEATURE EXTRACTION AND SVM FOR ACOUSTIC SCENE CLASSIFICATION JT Geiger, B Schuller, G Rigoll WASPAA, 2013 | 158 | 2013 |
Robust speech recognition using long short-term memory recurrent neural networks for hybrid acoustic modelling JT Geiger, Z Zhang, F Weninger, B Schuller, G Rigoll Proc. INTERSPEECH 2014, Singapore, Singapore, 2014 | 98 | 2014 |
IMPROVING EVENT DETECTION FOR AUDIO SURVEILLANCE USING GABOR FILTERBANK FEATURES JT Geiger, K Helwani EUSIPCO, 2015 | 76 | 2015 |
Feature enhancement by deep LSTM networks for ASR in reverberant multisource environments F Weninger, J Geiger, M Wöllmer, B Schuller, G Rigoll Computer Speech & Language 28 (4), 888-902, 2014 | 72 | 2014 |
Acoustic gait-based person identification using hidden Markov models JT Geiger, M Kneißl, BW Schuller, G Rigoll Proceedings of the 2014 workshop on mapping personality traits challenge and …, 2014 | 65 | 2014 |
Detecting Overlapping Speech with Long Short-Term Memory Recurrent Neural Networks JT Geiger, F Eyben, B Schuller, G Rigoll Interspeech, 2013 | 58 | 2013 |
The Munich 2011 CHiME Challenge Contribution: NMF-BLSTM Speech Enhancement and Recognition for Reverberated Multisource Environments F Weninger, J Geiger, M Wöllmer, B Schuller, G Rigoll | 56 | 2011 |
NON-NEGATIVE MATRIX FACTORIZATION FOR HIGHLY NOISE-ROBUST ASR: TO ENHANCE OR TO RECOGNIZE? F Weninger, M Wöllmer, J Geiger, B Schuller, JF Gemmeke, ... ICASSP, 2012 | 50 | 2012 |
GMM-UBM based open-set online speaker diarization J Geiger, F Wallhoff, G Rigoll Proc. INTERSPEECH 2010, Makuhari, Japan, 2330-2333, 2010 | 42 | 2010 |
THE MERL/MELCO/TUM SYSTEM FOR THE REVERB CHALLENGE USING DEEP RECURRENT NEURAL NETWORK FEATURE ENHANCEMENT F Weninger, S Watanabe, J Le Roux, JR Hershey, Y Tachioka, J Geiger, ... REVERB Workshop, 2014 | 41 | 2014 |
Memory-Enhanced Neural Networks and NMF for Robust ASR J Geiger, F Weninger, J Gemmeke, M Wollmer, B Schuller, G Rigoll Audio, Speech, and Language Processing, IEEE/ACM Transactions on 20 (6 …, 2014 | 38 | 2014 |
Speech Overlap Detection and Attribution Using Convolutive Non-Negative Sparse Coding R Vipperla, J Geiger, S Bozonnet, D Wang, N Evans, B Schuller, G Rigoll Proc. ICASSP, 4181-4184, 2012 | 38 | 2012 |
THE TUM+ TUT+ KUL APPROACH TO THE 2ND CHIME CHALLENGE: MULTI-STREAM ASR EXPLOITING BLSTM NETWORKS AND SPARSE NMF JT Geiger, F Weninger, A Hurmalainen, JF Gemmeke, M Wöllmer, ... CHiME Workshop, 2013 | 33 | 2013 |
RECOGNISING ACOUSTIC SCENES WITH LARGE-SCALE AUDIO FEATURE EXTRACTION AND SVM JT Geiger, B Schuller, G Rigoll | 29 | 2013 |
Signal processing apparatus for enhancing a voice component within a multi-channel audio signal J Geiger, P Grosche US Patent 10,210,883, 2019 | 25 | 2019 |
DIALOGUE ENHANCEMENT OF STEREO SOUND JT Geiger, P Grosche, YL Parodi EUSIPCO, 2015 | 24 | 2015 |
GAIT-BASED PERSON IDENTIFICATION BY SPECTRAL, CEPSTRAL AND ENERGY-RELATED AUDIO FEATURES JT Geiger, M Hofmann, B Schuller, G Rigoll ICASSP, 2013 | 24 | 2013 |
THE MUNICH FEATURE ENHANCEMENT APPROACH TO THE 2ND CHIME CHALLENGE USING BLSTM RECURRENT NEURAL NETWORKS F Weninger, J Geiger, M Wöllmer, B Schuller, G Rigoll | 23 | 2013 |