Attention-driven Robotic Manipulation (ARM)

This codebase is home to:

Q-attention: Enabling Efficient Learning for Vision-based Robotic Manipulation

Installation

ARM is trained using the YARR framework. Head to the YARR github page and follow installation instructions.

ARM is evaluated on RLBench 1.1.0. Head to the RLBench github page and follow installation instructions.

Now install project requirements:

pip install -r requirements.txt

Running experiments

Be sure to have RLBench demos saved on your machine before proceeding. To generate demos for a task, go to the tools directory in RLBench (rlbench/tools), and run:

python dataset_generator.py --save_path=/mnt/my/save/dir --tasks=take_lid_off_saucepan --image_size=128,128 \
--renderer=opengl --episodes_per_task=100 --variations=1 --processes=1

Experiments are launched via Hydra. To start training an agent to accomplish take_lid_off_saucepan with the default parameters on gpu 0, then run:

python launch.py method=ARM rlbench.task=take_lid_off_saucepan rlbench.demo_path=/mnt/my/save/dir framework.gpu=0

Attention-driven Robot Manipulation (ARM) which includes Q-attention

Related tags

Overview

Attention-driven Robotic Manipulation (ARM)

Installation

Running experiments

Owner

Stephen James

Generate saved_model, tfjs, tf-trt, EdgeTPU, CoreML, quantized tflite and .pb from .tflite.

A spherical CNN for weather forecasting

A tensorflow implementation of an HMM layer

PCAM: Product of Cross-Attention Matrices for Rigid Registration of Point Clouds

[CVPR'22] COAP: Learning Compositional Occupancy of People

True Few-Shot Learning with Language Models

Hardware-accelerated DNN model inference ROS2 packages using NVIDIA Triton/TensorRT for both Jetson and x86_64 with CUDA-capable GPU

Find-Lane-Line - Use openCV library and Python to detect the road-lane-line

Code for Environment Inference for Invariant Learning (ICML 2020 UDL Workshop Paper)

[CVPR 2021] NormalFusion: Real-Time Acquisition of Surface Normals for High-Resolution RGB-D Scanning

This code is an implementation for Singing TTS.

Run Effective Large Batch Contrastive Learning on Limited Memory GPU

Semi-Supervised Semantic Segmentation via Adaptive Equalization Learning, NeurIPS 2021 (Spotlight)

This repository contains the implementation of the following paper: Cross-Descriptor Visual Localization and Mapping

The CLRS Algorithmic Reasoning Benchmark

The repo of Feedback Networks, CVPR17

TensorFlow for Raspberry Pi

A very simple tool to rewrite parameters such as attributes and constants for OPs in ONNX models. Simple Attribute and Constant Modifier for ONNX.

Official code for paper Exemplar Based 3D Portrait Stylization.

Diverse Object-Scene Compositions For Zero-Shot Action Recognition