iftopt

An Implicit Function Theorem (IFT) optimizer for bi-level optimizations.

Requirements

Python 3.7+
PyTorch 1.x

Installation

$ pip install git+https://github.com/money-shredder/iftopt.git

Usage

Assuming a bi-level optimization of the form:

y* = argmin_{y} val_loss(x*, y), where x* = argmin_{x} train_loss(x, y).

To solve for the optimal x* and y* in the optimization problem, we can implement the following with iftopt:

from iftopt import HyperOptimizer
train_lr = val_lr = 0.1
# parameter to minimize the training loss
x = torch.nn.Parameter(...)
# hyper-parameter to minimize the validation loss
y = torch.nn.Parameter(...)
# training loss optimizer
opt = torch.optim.SGD([x], lr=train_lr)
# validation loss optimizer
hopt = HyperOptimizer(
    [y], torch.optim.SGD([y], lr=val_lr), vih_lr=0.1, vih_iterations=5)
# outer optimization loop for y
for _ in range(...):
    # inner optimization loop for x
    for _ in range(...):
        z = train_loss(x, y)
        # inner optimization step for x
        opt.zero_grad()
        z.backward()
        opt.step()
    # outer optimization step for y
    hopt.set_train_parameters([x])
    z = train_loss(x, y)
    hopt.train_step(z)
    v = val_loss(x, y)
    hopt.val_step(v)
    hopt.grad()
    hopt.step()

For a concrete simple example, please check out and run demo.py, where

train_loss = lambda x, y: (x + y) ** 2
val_loss = lambda x, y: x ** 2

with x = y = 1.0 initially. It will generate a video demo.mp4 showing the optimization trajectory in the animation below. Note that although the hyper-parameter y does not have a direct gradient w.r.t. the validation loss, iftopt can still minimize the validation loss by computing the hyper-gradient via implicit function theorem.

An Implicit Function Theorem (IFT) optimizer for bi-level optimizations

Related tags

Overview

iftopt

Requirements

Installation

Usage

Owner

The Money Shredder Lab

用强化学习DQN算法，训练AI模型来玩合成大西瓜游戏，提供Keras版本和PARL（paddle）版本

CSWin Transformer: A General Vision Transformer Backbone with Cross-Shaped

Platform-agnostic AI Framework 🔥

face property detection pytorch

[SIGIR22] Official PyTorch implementation for "CORE: Simple and Effective Session-based Recommendation within Consistent Representation Space".

Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.

Official implementation of the paper Do pedestrians pay attention? Eye contact detection for autonomous driving

Annotate with anyone, anywhere.

This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".

RetinaFace: Deep Face Detection Library in TensorFlow for Python

Convnet transfer - Code for paper How transferable are features in deep neural networks?

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

A PyTorch implementation of "DGC-Net: Dense Geometric Correspondence Network"

Toward Realistic Single-View 3D Object Reconstruction with Unsupervised Learning from Multiple Images (ICCV 2021)

The repository is for safe reinforcement learning baselines.

Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)

A minimalist environment for decision-making in autonomous driving

Human motion synthesis using Unity3D

Automatic deep learning for image classification.

Towards uncontrained hand-object reconstruction from RGB videos