The MGB challenge: Evaluating multi-genre broadcast media recognition

P Bell, MJF Gales, T Hain, J Kilgour… - … IEEE Workshop on …, 2015 - ieeexplore.ieee.org
This paper describes the Multi-Genre Broadcast (MGB) Challenge at ASRU 2015, an
evaluation focused on speech recognition, speaker diarization, and" lightly supervised" …

On influential trends in interactive video retrieval: video browser showdown 2015–2017

J Lokoč, W Bailer, K Schoeffmann… - IEEE Transactions …, 2018 - ieeexplore.ieee.org
The last decade has seen innovations that make video recording, manipulation, storage,
and sharing easier than ever before, thus impacting many areas of life. New video retrieval …

Albayzin 2018 evaluation: the iberspeech-rtve challenge on speech technologies for spanish broadcast media

E Lleida, A Ortega, A Miguel, V Bazán-Gil, C Pérez… - Applied sciences, 2019 - mdpi.com
The IberSpeech-RTVE Challenge presented at IberSpeech 2018 is a new Albayzin
evaluation series supported by the Spanish Thematic Network on Speech Technologies …

TREC 2020 podcasts track overview

R Jones, B Carterette, A Clifton, M Eskevich… - arXiv preprint arXiv …, 2021 - arxiv.org
The Podcast Track is new at the Text Retrieval Conference (TREC) in 2020. The podcast
track was designed to encourage research into podcasts in the information retrieval and …

Interactive video search tools: a detailed analysis of the video browser showdown 2015

C Cobârzan, K Schoeffmann, W Bailer, W Hürst… - Multimedia Tools and …, 2017 - Springer
Interactive video retrieval tools developed over the past few years are emerging as powerful
alternatives to automatic retrieval approaches by giving the user more control as well as …

TutorialVQA: Question answering dataset for tutorial videos

A Colas, S Kim, F Dernoncourt, S Gupte… - arXiv preprint arXiv …, 2019 - arxiv.org
Despite the number of currently available datasets on video question answering, there still
remains a need for a dataset involving multi-step and non-factoid answers. Moreover …

Bidirectional joint representation learning with symmetrical deep neural networks for multimodal and crossmodal applications

V Vukotić, C Raymond, G Gravier - Proceedings of the 2016 ACM on …, 2016 - dl.acm.org
Common approaches to problems involving multiple modalities (classification, retrieval,
hyperlinking, etc.) are early fusion of the initial modalities and crossmodal translation from …

When textual and visual information join forces for multimedia retrieval

B Safadi, M Sahuguet, B Huet - Proceedings of International Conference …, 2014 - dl.acm.org
Currently, popular search engines retrieve documents on the basis of text information.
However, integrating the visual information with the text-based search for video and image …

Multimedia information seeking through search and hyperlinking

M Eskevich, GJF Jones, R Aly, RJF Ordelman… - Proceedings of the 3rd …, 2013 - dl.acm.org
Searching for relevant webpages and following hyperlinks to related content is a widely
accepted and effective approach to information seeking on the textual web. Existing work on …

Blip10000: A social video dataset containing spug content for tagging and retrieval

S Schmiedeke, P Xu, I Ferrané, M Eskevich… - Proceedings of the 4th …, 2013 - dl.acm.org
The increasing amount of digital multimedia content available is inspiring potential new
types of user interaction with video data. Users want to easily find the content by searching …