Deep generative models of 3D grids for structure-based drug discovery

Last update: Jan 03, 2023

Overview

What is liGAN?

liGAN is a research codebase for training and evaluating deep generative models for de novo drug design based on 3D atomic density grids. It is based on libmolgrid and the gnina fork of caffe.

VAE paper - 2 minute talk

CVAE paper - 15 minute talk

Dependencies

numpy
pandas
scikit-image
openbabel
rdkit
molgrid
torch
protobuf
gnina version of caffe

Usage

You can use the scripts download_data.sh and download_weights.sh to download the test data and weights that were evaluated in the above papers.

The script generate.py is used to generate atomic density grids and molecular structures from a trained generative model.

Its basic usage can be seen in the scripts generate_vae.sh:

LIG_FILE=$1 # e.g. data/molport/0/102906000_8.sdf

python3 generate.py \
  --data_model_file models/data_48_0.5_molport.model \
  --gen_model_file models/vae.model \
  --gen_weights_file weights/gen_e_0.1_1_disc_x_10_0.molportFULL_rand_.0.0_gen_iter_100000.caffemodel \
  --rec_file data/molport/10gs_rec.pdb \
  --lig_file $LIG_FILE \
  --out_prefix VAE \
  --n_samples 10 \
  --fit_atoms \
  --dkoes_make_mol \
  --output_sdf \
  --output_dx \
  --gpu

And generate_cvae.sh:

REC_FILE=$1 # e.g. data/crossdock2020/PARP1_HUMAN_775_1012_0/2rd6_A_rec.pdb
LIG_FILE=$2 # e.g. data/crossdock2020/PARP1_HUMAN_775_1012_0/2rd6_A_rec_2rd6_78p_lig_tt_min.sdf

python3 generate.py \
  --data_model_file models/data_48_0.5_crossdock.model \
  --gen_model_file models/cvae.model \
  --gen_weights_file weights/lessskip_crossdocked_increased_1.lowrmsd.0_gen_iter_1500000.caffemodel \
  --rec_file $REC_FILE \
  --lig_file $LIG_FILE \
  --out_prefix CVAE \
  --n_samples 10 \
  --fit_atoms \
  --dkoes_make_mol \
  --output_sdf \
  --output_dx \
  --gpu

Both scripts can be run from the root directory of the repository.

Deep generative models of 3D grids for structure-based drug discovery

Related tags

Overview

What is liGAN?

Dependencies

Usage

Owner

Matt Ragoza

Text mining project; Using distilBERT to predict authors in the classification task authorship attribution.

A working implementation of the Categorical DQN (Distributional RL).

Scientific Computation Methods in C and Python (Open for Hacktoberfest 2021)

Minecraft Hack Detection With Python

Generating Digital Painting Lighting Effects via RGB-space Geometry (SIGGRAPH2020/TOG2020)

SNE-RoadSeg in PyTorch, ECCV 2020

Constrained Logistic Regression - How to apply specific constraints to logistic regression's coefficients

Code for PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning

Yolov5 deepsort inference，使用YOLOv5+Deepsort实现车辆行人追踪和计数，代码封装成一个Detector类，更容易嵌入到自己的项目中

PiRank: Learning to Rank via Differentiable Sorting

3.8% and 18.3% on CIFAR-10 and CIFAR-100

The PyTorch implementation of Directed Graph Contrastive Learning (DiGCL), NeurIPS-2021

Implementation for paper MLP-Mixer: An all-MLP Architecture for Vision

A library that allows for inference on probabilistic models

Official implementation of Self-supervised Image-to-text and Text-to-image Synthesis

A spherical CNN for weather forecasting

Deep Inertial Prediction (DIPr)

PartImageNet is a large, high-quality dataset with part segmentation annotations

Official repo for our 3DV 2021 paper "Monocular 3D Reconstruction of Interacting Hands via Collision-Aware Factorized Refinements".

Learning Lightweight Low-Light Enhancement Network using Pseudo Well-Exposed Images