STDP based unsupervised multimodal learning with cross-modal processing in spiking neural networks

N Rathi, K Roy - IEEE Transactions on Emerging Topics in …, 2018 - ieeexplore.ieee.org
Spiking neural networks perform reasonably well in recognition applications for single
modality (eg, images, audio, or text). In this paper, we propose a multimodal spiking neural …

Advanced news video parsing via visual characteristics of anchorperson scenes

Y Dong, G Qin, G Xiao, S Lian, X Chang - Telecommunication Systems, 2013 - Springer
In this paper, we present an advanced news video parsing system via exploring the visual
characteristics of anchorperson scenes, which aims to provide personalized news services …

Brain-like evolving spiking neural networks for multimodal information processing

SG Wysoski, L Benuskova, N Kasabov - Brain-Inspired Information …, 2010 - Springer
Despite of much evidence suggesting how and where sensory information converge in the
human brain, the neural mechanisms of interaction among modalities at the level of …

Robust anchorperson detection based on audio streams using a hybrid I-vector and DNN system

YF Chang, P Lin, SH Cheng, KH Chan… - … Annual Summit and …, 2014 - ieeexplore.ieee.org
Anchorperson segment detection enables efficient video content indexing for information
retrieval. Anchorperson detection based on audio analysis has gained popularity due to …

Production and multi-channel distribution of news

E Mannens, M Verwaest, R Van de Walle - Multimedia Systems, 2008 - Springer
News production is characterised by complex and dynamic workflows as it is important to
produce and distribute news as soon as possible and in an audiovisual quality as good as …

A visual grammar approach for TV program identification

T Zlitni, W Mahdi - arXiv preprint arXiv:1301.2200, 2013 - arxiv.org
Automatic identification of TV programs within TV streams is an important task for archive
exploitation. This paper proposes a new spatial-temporal approach to identify programs in …

Personal television: A crossmodal analysis approach

P Dunker, M Gruhne, S Sturtz - 2008 IEEE International …, 2008 - ieeexplore.ieee.org
Personal information consumption became more and more important due to the huge
number of existing information channels and the broad range of available information. While …

Training Methodologies for Energy-Efficient, Low Latency Spiking Neural Networks

N Rathi - 2021 - search.proquest.com
Deep learning models have become the de-facto solution in various fields like computer
vision, natural language processing, robotics, drug discovery, and many others. The …

A new approach for TV program identification based on video grammar

T Zlitni, W Mahdi, HB Abdallah - … of the 7th International Conference on …, 2009 - dl.acm.org
In this paper, we propose a new approach to identify programs in TV streams. In the first step
of our approach, we construct a reference catalogue for video grammars of visual jingles. In …

Anchorperson shot detection in MPEG domain

Z Ji, C Zhang, Y Su - 2007 IEEE International Conference on …, 2007 - ieeexplore.ieee.org
In this paper, a refined ASD algorithm in MPEG compressed domain is proposed. The new
method is expected to outperform the existing strategies based on the following two …