查看文章

Human vs machine: establishing a human baseline for multimodal location estimation

作者

Jaeyoung Choi, Howard Lei, Venkatesan Ekambaram, Pascal Kelm, Luke Gottlieb, Thomas Sikora, Kannan Ramchandran, Gerald Friedland

发表日期

2013/10/21

图书

Proceedings of the 21st ACM international conference on Multimedia

页码范围

867-876

简介

Over the recent years, the problem of video location estimation (i.e., estimating the longitude/latitude coordinates of a video without GPS information) has been approached with diverse methods and ideas in the research community and significant improvements have been made. So far, however, systems have only been compared against each other and no systematic study on human performance has been conducted. Based on a human-subject study with 11,900 experiments, this article presents a human baseline for location estimation for different combinations of modalities (audio, audio/video, audio/video/text). Furthermore, this article compares state-of-the-art location estimation systems with the human baseline. Although the overall performance of humans' multimodal video location estimation is better than current machine learning approaches, the difference is quite small: For 41% of the test set, the machine …

引用总数

被引用次数：19

201420152016201720182019202020212 7 4 3 1 1 1

学术搜索中的文章

Human vs machine: establishing a human baseline for multimodal location estimation

J Choi, H Lei, V Ekambaram, P Kelm, L Gottlieb… - Proceedings of the 21st ACM international conference …, 2013

被引用次数：19 相关文章所有 6 个版本