MMSceneGraph

Introduction

MMSceneneGraph is an open source code hub for scene graph generation as well as supporting downstream tasks based on the scene graph on PyTorch. The frontend object detector is supported by open-mmlab/mmdetection.

Major features

Modular design

We decompose the framework into different components and one can easily construct a customized scene graph generation framework by combining different modules.
Support of multiple frameworks out of box

The toolbox directly supports popular and contemporary detection frameworks, e.g. Faster RCNN, Mask RCNN, etc.
Visualization support

The visualization of the groundtruth/predicted scene graph is integrated into the toolbox.

License

This project is released under the MIT license.

Changelog

Please refer to CHANGELOG.md for details.

Benchmark and model zoo

The original object detection results and models provided by mmdetection are available in the model zoo. The models for the scene graph generation are temporarily unavailable yet.

Supported methods and Datasets

Supported SGG (VRD) methods:

Supported saliency object detection methods:

R3Net (IJCAI'2018)
SCRN (ICCV'2019)

Supported image captioning methods:

bottom-up (CVPR'2018)
XLAN (CVPR'2020)

Supported datasets:

Visual Genome: VG150 (CVPR'2017)
VRD (ECCV'2016)
Visual Genome: VG200/VG-KR (ours)
MSCOCO (for object detection, image caption)
RelCap (from VG and COCO, ours)

Installation

As our project is built on mmdetection 1.x (which is a bit different from their current master version 2.x), please refer to INSTALL.md. If you want to use mmdetection 2.x, please refer to mmdetection/get_start.md.

Getting Started

Please refer to GETTING_STARTED.md for using the projects. We will update it constantly.

Acknowledgement

We appreciate the contributors of the mmdetection project and Scene-Graph-Benchmark.pytorch which inspires our design.

Citation

If you find this code hub or our works useful in your research works, please consider citing:

@inproceedings{wang2021topic,
  title={Topic Scene Graph Generation by Attention Distillation from Caption},
  author={Wang, Wenbin and Wang, Ruiping and Chen, Xilin},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
  pages={15900--15910},
  month = {October},
  year={2021}
}


@inproceedings{wang2020sketching,
  title={Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph Generation},
  author={Wang, Wenbin and Wang, Ruiping and Shan, Shiguang and Chen, Xilin},
  booktitle={Proceedings of European Conference on Computer Vision (ECCV)},
  pages={222--239},
  year={2020},
  volume={12358},
  doi={10.1007/978-3-030-58601-0_14},
  publisher={Springer}
}

@InProceedings{Wang_2019_CVPR,
author = {Wang, Wenbin and Wang, Ruiping and Shan, Shiguang and Chen, Xilin},
title = {Exploring Context and Visual Pattern of Relationship for Scene Graph Generation},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
pages = {8188-8197},
month = {June},
address = {Long Beach, California, USA},
doi = {10.1109/CVPR.2019.00838},
year = {2019}
}

A brand new hub for Scene Graph Generation methods based on MMdetection (2021). The pipeline of from detection, scene graph generation to downstream tasks (e.g., image cpationing) is supported. Pytorch version implementation of HetH (ECCV 2020) and TopicSG (ICCV 2021) is included.

Related tags

Overview

MMSceneGraph

Introduction

Major features

License

Changelog

Benchmark and model zoo

Supported methods and Datasets

Installation

Getting Started

Acknowledgement

Citation

Owner

Kenneth-Wong

Advanced yabai wooting scripts

Adversarial vulnerability of powerful near out-of-distribution detection

ShinRL: A Library for Evaluating RL Algorithms from Theoretical and Practical Perspectives

Pyramid Scene Parsing Network, CVPR2017.

Official Pytorch Implementation of 'Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization' (ICCV-21 Oral)

Robbing the FED: Directly Obtaining Private Data in Federated Learning with Modified Models

Epidemiology analysis package

A 10000+ hours dataset for Chinese speech recognition

Generate Contextual Directory Wordlist For Target Org

Tensorflow solution of NER task Using BiLSTM-CRF model with Google BERT Fine-tuning And private Server services

Orbivator AI - To Determine which features of data (measurements) are most important for diagnosing breast cancer and find out if breast cancer occurs or not.

Adaptation through prediction: multisensory active inference torque control

coldcuts is an R package to automatically generate and plot segmentation drawings in R

🐦 Opytimizer is a Python library consisting of meta-heuristic optimization techniques.

SIEM Logstash parsing for more than hundred technologies

SSD: A Unified Framework for Self-Supervised Outlier Detection [ICLR 2021]

[CVPR 2022 Oral] EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation

Hierarchical Uniform Manifold Approximation and Projection

Official Repository of NeurIPS2021 paper: PTR

Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"