Graph-LSTM with Global Attribute for Scene Graph Generation
Research output: Journal Publications and Reviews (RGC: 21, 22, 62) › 21_Publication in refereed journal › peer-review
Author(s)
Detail(s)
Original language | English |
---|---|
Article number | 012001 |
Journal / Publication | Journal of Physics: Conference Series |
Volume | 2003 |
Online published | 27 Aug 2021 |
Publication status | Published - 2021 |
Externally published | Yes |
Conference
Title | 2021 International Conference on Artificial Intelligence, Automation and Algorithms (AI2A 2021) |
---|---|
Location | Online |
Place | China |
City | Guilin |
Period | 23 - 25 July 2021 |
Link(s)
DOI | DOI |
---|---|
Attachment(s) | Documents
Publisher's Copyright Statement
|
Link to Scopus | https://www.scopus.com/record/display.uri?eid=2-s2.0-85115018602&origin=recordpage |
Permanent Link | https://scholars.cityu.edu.hk/en/publications/publication(a71b21aa-a587-4420-9f85-978b553ce0d1).html |
Abstract
Lots of machine learning tasks require dealing with graph data, and among them, scene graph generation is a challenging one that calls for graph neural networks' potential ability. In this paper, we present a definition of graph neural network (GNN) consists of node, edge and global attribute, as well as their corresponding update and aggregate functions. Based on this, we then propose a realization of GNN model called Graph-LSTM and use it in scene graph generation. The model first extracts the item features in the image as the initial states of the node-LSTM representing subject/object and edge-LSTM representing predicate. Two LSTMs update the states via LSTM's timestep and aggregate information via message passing. Repeat the update-aggregate until convergence. Meanwhile, the tag feature, i.e., the generated probability distribution of image's semantic concepts is sent to the LSTM through a semantic compositional network (SCN). The SCN-LSTM is trained in an ensemble style, and hence allows the tag feature to serve as the global attribute providing context information to all individuals. The LSTMs' final states are input to inference modules and generate the triplet (subject, predicate, object) of the scene graph. Experimental results show that Graph-LSTM outperforms the Message Passing and the attention Graph Covolutional Network methods, proving the effectiveness of the proposed scheme.
Research Area(s)
Citation Format(s)
Graph-LSTM with Global Attribute for Scene Graph Generation. / Shao, Tong; Wu, Dapeng Oliver.
In: Journal of Physics: Conference Series, Vol. 2003, 012001, 2021.Research output: Journal Publications and Reviews (RGC: 21, 22, 62) › 21_Publication in refereed journal › peer-review
Download Statistics
No data available