Graph Attention Networks for Link Prediction in Semantic Word Grouping

University essay from Uppsala universitet/Avdelningen för beräkningsvetenskap

Author: Anton Gollbo; [2023]

Keywords: ;

Abstract: Manually extracting relevant information from extensive amounts of data can betime-consuming and labour-intensive. Automating this process can allow for a shift of focus toward analysis and utilization of the extracted information, rather than allocating time and resources to data collection and preparation. Information extraction refers to methods of automatically extracting structured information from unstructured or semi-structured documents. An example of such documents can be referred to as visually rich documents, a term encompassing documents that contain a significant amount of visual content, such as images, charts, or diagrams, in addition to text. Examples of visually rich documents include PDFs and scanned documents. A graph neural network (GNN) is a type of neural network that is designed toperform inference on data represented as graphs. Utilizing graph representations, GNNs have the ability to incorporate both visual and textual information inperforming inference. As such, leveraging GNNs can be particularly useful for information extraction from visually rich documents, such documents commonly contain inherent structures that are essential for understanding. Semantic word grouping is a technique that groups individual word entities intocorresponding entity groups. This thesis analyses the performance of state-of-the-art GNNs on the task of link prediction between nodes in a data set of labeled restaurant menus. The method shows promising results in the field of link prediction in an information extraction setting. Further, incorporating additional features related to structural information in the documents can significantly improve performance.

  AT THIS PAGE YOU CAN DOWNLOAD THE WHOLE ESSAY. (follow the link to the next page)