作者
Xiang Yu, Jie Li, Shijing Yuan, Chao Wang, Chentao Wu
发表日期
2022/8/21
研讨会论文
2022 26th International Conference on Pattern Recognition (ICPR)
页码范围
1894-1900
出版商
IEEE
简介
Existing scene graph generation (SGG) methods are far from practical, primarily due to their poor performance on predicting zero-shot (i.e., unseen) subject-predicate-object triples. We observe that these SGG methods treat images along with the triples in them independently and thus fail to consider the complex and hidden information that is inherently implicit in the triples of other images. To this effect, our paper proposes a novel encoder-decoder SGG framework to leverage the semantic correlations between the triples of different images into the prediction of a zero-shot triple. Specifically, the encoder aggregates the triples in each image of training set into a large knowledge graph and learns the entity embeddings that capture the features of their neighborhoods with a relational graph neural network. The neighborhood-aware embeddings are then fed into the vision-based decoder to predict the predicates in …
学术搜索中的文章
X Yu, J Li, S Yuan, C Wang, C Wu - 2022 26th International Conference on Pattern …, 2022