Official PyTorch implementation of Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations

Last update: Jan 03, 2023

Overview

Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations

Zhenyu Jiang, Yifeng Zhu, Maxwell Svetlik, Kuan Fang, Yuke Zhu

Project | arxiv

Introduction

GIGA (Grasp detection via Implicit Geometry and Affordance) is a network that jointly detects 6 DOF grasp poses and reconstruct the 3D scene. GIGA takes advantage of deep implicit functions, a continuous and memory-efficient representation, to enable differentiable training of both tasks. GIGA takes as input a Truncated Signed Distance Function (TSDF) representation of the scene, and predicts local implicit functions for grasp affordance and 3D occupancy. By querying the affordance implict functions with grasp center candidates, we can get grasp quality, grasp orientation and gripper width at these centers. GIGA is trained on a synthetic grasping dataset generated with physics simulation.

Installation

Create a conda environment.
Install packages list in requirements.txt. Then install torch-scatter following here, based on pytorch version and cuda version.
Go to the root directory and install the project locally using pip

pip install -e .

Build ConvONets dependents by running python scripts/convonet_setup.py build_ext --inplace.
Download the data, then unzip and place the data folder under the repo's root. Pretrained models of GIGA, GIGA-Aff and VGN are in data/models.

Self-supervised Data Generation

Raw synthetic grasping trials

Pile scenario:

python scripts/generate_data_parallel.py --scene pile --object-set pile/train --num-grasps 4000000 --num-proc 40 --save-scene ./data/pile/data_pile_train_random_raw_4M

Packed scenario:

python scripts/generate_data_parallel.py --scene packed --object-set packed/train --num-grasps 4000000 --num-proc 40 --save-scene ./data/pile/data_packed_train_random_raw_4M

Please run python scripts/generate_data_parallel.py -h to print all options.

Data clean and processing

First clean and balance the data using:

python scripts/clean_balance_data.py /path/to/raw/data

Then construct the dataset (add noise):

python scripts/construct_dataset_parallel.py --num-proc 40 --single-view --add-noise dex /path/to/raw/data /path/to/new/data

Save occupancy data

Sampling occupancy data on the fly can be very slow and block the training, so I sample and store the occupancy data in files beforehand:

python scripts/save_occ_data_parallel.py /path/to/raw/data 100000 2 --num-proc 40

Please run python scripts/save_occ_data_parallel.py -h to print all options.

Training

Train GIGA

Run:

# GIGA
python scripts/train_giga.py --dataset /path/to/new/data --dataset_raw /path/to/raw/data

Simulated grasping

Run:

python scripts/sim_grasp_multiple.py --num-view 1 --object-set (packed/test | pile/test) --scene （packed ｜ pile) --num-rounds 100 --sideview --add-noise dex --force --best --model /path/to/model --type (vgn | giga | giga_aff) --result-path /path/to/result

This commands will run experiment with each seed specified in the arguments.

Run python scripts/sim_grasp_multiple.py -h to print a complete list of optional arguments.

Related Repositories

Our code is largely based on VGN
We use ConvONets as our backbone.

Official PyTorch implementation of Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations

Related tags

Overview

Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations

Introduction

Installation

Self-supervised Data Generation

Raw synthetic grasping trials

Data clean and processing

Save occupancy data

Training

Train GIGA

Simulated grasping

Related Repositories

Owner

UT-Austin Robot Perception and Learning Lab

Official implementation of the MM'21 paper Constrained Graphic Layout Generation via Latent Optimization

Official PyTorch implementation and pretrained models of the paper Self-Supervised Classification Network

PolyGlot, a fuzzing framework for language processors

OpenCVのGrabCut()を利用したセマンティックセグメンテーション向けアノテーションツール(Annotation tool using GrabCut() of OpenCV. It can be used to create datasets for semantic segmentation.)

AbelNN: Deep Learning Python module from scratch

Official implementation of deep Gaussian process (DGP)-based multi-speaker speech synthesis with PyTorch.

Using deep actor-critic model to learn best strategies in pair trading

Annotate with anyone, anywhere.

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Masked regression code - Masked Regression

STEM: An approach to Multi-source Domain Adaptation with Guarantees

GAN-generated image detection based on CNNs

A pyparsing-based library for parsing SOQL statements

A `Neural = Symbolic` framework for sound and complete weighted real-value logic

Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"

Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP

AITUS - An atomatic notr maker for CYTUS

Retinal Vessel Segmentation with Pixel-wise Adaptive Filters (ISBI 2022)

Source code for From Stars to Subgraphs

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Official PyTorch implementation of Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations

Related tags

Overview

Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations

Introduction

Installation

Self-supervised Data Generation

Raw synthetic grasping trials

Data clean and processing

Save occupancy data

Training

Train GIGA

Simulated grasping

Related Repositories

Owner

UT-Austin Robot Perception and Learning Lab

Official implementation of the MM'21 paper Constrained Graphic Layout Generation via Latent Optimization

Official PyTorch implementation and pretrained models of the paper Self-Supervised Classification Network

PolyGlot, a fuzzing framework for language processors

OpenCVのGrabCut()を利用したセマンティックセグメンテーション向けアノテーションツール(Annotation tool using GrabCut() of OpenCV. It can be used to create datasets for semantic segmentation.)

AbelNN: Deep Learning Python module from scratch

Official implementation of deep Gaussian process (DGP)-based multi-speaker speech synthesis with PyTorch.

Using deep actor-critic model to learn best strategies in pair trading

Annotate with anyone, anywhere.

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park*, Rares Ambrus*, Vitor Guizilini, Jie Li, and Adrien Gaidon.

Masked regression code - Masked Regression

STEM: An approach to Multi-source Domain Adaptation with Guarantees

GAN-generated image detection based on CNNs

A pyparsing-based library for parsing SOQL statements

A `Neural = Symbolic` framework for sound and complete weighted real-value logic

Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"

Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP

AITUS - An atomatic notr maker for CYTUS

Retinal Vessel Segmentation with Pixel-wise Adaptive Filters (ISBI 2022)

Source code for From Stars to Subgraphs

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Official PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection? (ICCV 2021), Dennis Park, Rares Ambrus, Vitor Guizilini, Jie Li, and Adrien Gaidon.