查看文章

linliang.net 中的 [PDF]

I2t: Image parsing to text description

作者

Benjamin Z Yao, Xiong Yang, Liang Lin, Mun Wai Lee, Song-Chun Zhu

发表日期

2010/6/17

期刊

Proceedings of the IEEE

卷号

期号

页码范围

1485-1508

出版商

IEEE

简介

In this paper, we present an image parsing to text description (I2T) framework that generates text descriptions of image and video content based on image understanding. The proposed I2T framework follows three steps: 1) input images (or video frames) are decomposed into their constituent visual patterns by an image parsing engine, in a spirit similar to parsing sentences in natural language; 2) the image parsing results are converted into semantic representation in the form of Web ontology language (OWL), which enables seamless integration with general knowledge bases; and 3) a text generation engine converts the results from previous steps into semantically meaningful, human readable, and query-able text reports. The centerpiece of the I2T framework is an and-or graph (AoG) visual knowledge representation, which provides a graphical representation serving as prior knowledge for representing diverse …

引用总数

被引用次数：399

20092010201120122013201420152016201720182019202020212022202320241 8 17 27 28 36 43 27 30 30 19 24 36 32 22 13

学术搜索中的文章

I2t: Image parsing to text description

BZ Yao, X Yang, L Lin, MW Lee, SC Zhu - Proceedings of the IEEE, 2010

被引用次数：399 相关文章所有 9 个版本