GNEE - GAT Neural Event Embeddings

This repository contains source code for the GNEE (GAT Neural Event Embeddings) method introduced in the paper: "Semi-Supervised Graph Attention Networks for Event Representation Learning".

Abstract: Event analysis from news and social networks is very useful for a wide range of social studies and real-world applications. Recently, event graphs have been explored to represent event datasets and their complex relationships, where events are vertices connected to other vertices that represent locations, people's names, dates, and various other event metadata. Graph representation learning methods are promising for extracting latent features from event graphs to enable the use of different classification algorithms. However, existing methods fail to meet important requirements for event graphs, such as (i) dealing with semi-supervised graph embedding to take advantage of some labeled events, (ii) automatically determining the importance of the relationships between event vertices and their metadata vertices, as well as (iii) dealing with the graph heterogeneity. In this paper, we present GNEE (GAT Neural Event Embeddings), a method that combines Graph Attention Networks and Graph Regularization. First, an event graph regularization is proposed to ensure that all graph vertices receive event features, thereby mitigating the graph heterogeneity drawback. Second, semi-supervised graph embedding with self-attention mechanism considers existing labeled events, as well as learns the importance of relationships in the event graph during the representation learning process. A statistical analysis of experimental results with five real-world event graphs and six graph embedding methods shows that GNEE obtains state-of-the-art results.

File Structure

Our method consists of a BERT text encoding and a pre-processment procedure followed by modified version of GAT (Veličković et. al - 2017, https://arxiv.org/abs/1710.10903) to the event embedding task.

In our work, we adopt and modify the PyTorch implementation of GAT, pyGAT, developed by Diego999.

.
├── datasets_runs/ -> Datasets used
├── event_graph_utils.py -> Useful functions when working with event datasets
├── layers.py -> Implementation of Graph Attention layers
├── LICENSE
├── main.py -> Execute this script to reproduce our experiments (refer to our paper for more details)
├── models.py -> Implementation of the original GAT model
├── notebooks -> Run these notebooks to reproduce all our experiments.
├── README.md
├── requirements.txt
├── train.py -> Implementation of our preprocessing, traning and testing pipelines
└── utils.py -> Useful functions used in GAT original implementation.

Reproducibility Notebooks

./notebooks
├── DeepWalk_Event_Embeddings.ipynb -> DeepWalk Benchmark
├── GAT_Event_Embeddings_+_Without_Regularization.ipynb -> GAT w/o embeddings benchmark
├── GCN_Event_Embeddings_.ipynb -> GCN Benchmark
├── GNEE_Attention_Matrices_Example.ipynb -> GNEE Attention matrices visualization
├── GNEE_Embedding_Visualization_t_SNE.ipynb -> GNEE Embeddings visualization using t-SNE
├── GNEE.ipynb -> GNEE Benchmark
├── Label_Propagation_Event_Classification.ipynb -> LP Benchmark
├── LINE_Event_Embeddings.ipynb -> LINE Benchmark
├── Node2Vec_Event_Embeddings.ipynb -> Node2Vec Benchmark
├── SDNE_Event_Embeddings.ipynb -> SDNE Benchmark
└── Struct2Vec_Event_Embeddings.ipynb -> Struct2Vec Benchmark

Hardware requirements

When running on "dense" mode (no --sparse flag), our model uses about 18 GB on GRAM. On the other hand, the sparse mode (using --sparse) uses less than 1.5 GB on GRAM, which is an ideal setup to environments such as Google Colab.

Issues/Pull Requests/Feedbacks

Please, contact the authors in case of issues / pull requests / feedbacks :)

GNEE - GAT Neural Event Embeddings

Related tags

Overview

GNEE - GAT Neural Event Embeddings

File Structure

Reproducibility Notebooks

Hardware requirements

Issues/Pull Requests/Feedbacks

Owner

João Pedro Rodrigues Mattos

[CVPR 2022] Official Pytorch code for OW-DETR: Open-world Detection Transformer

The Codebase for Causal Distillation for Language Models.

Code for Paper: Self-supervised Learning of Motion Capture

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

[CVPR 2021] Unsupervised Degradation Representation Learning for Blind Super-Resolution

Unified file system operation experience for different backend

Language Models Can See: Plugging Visual Controls in Text Generation

PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

ByteTrack超详细教程！训练自己的数据集&&摄像头实时检测跟踪

Adjust Decision Boundary for Class Imbalanced Learning

Make Watson Assistant send messages to your Discord Server

SMPLpix: Neural Avatars from 3D Human Models

Codebase for testing whether hidden states of neural networks encode discrete structures.

Pipeline code for Sequential-GAM(Genome Architecture Mapping).

Activity tragle - Google is tracking everything, we just look at it

Learning to Draw: Emergent Communication through Sketching

A PaddlePaddle implementation of STGCN with a few modifications in the model architecture in order to forecast traffic jam.

Event sourced bank - A wide-and-shallow example using the Python event sourcing library

Mall-Customers-Segmentation - Customer Segmentation Using K-Means Clustering

Video Swin Transformer - PyTorch