SPARSEDNN

**If you want to use this repo, please send me an email: [email protected], or raise a Github issue. **

Fast sparse deep learning on CPUs. This is the kernel library generator described in the paper: https://arxiv.org/abs/2101.07948

Python API: python fastsparse.py. Minimal required dependencies. Should work anywhere.

C++ API: check out driver_cpu.cpp, or run autotune_cpu_random.sh 128 128 128 0. This requires cnpy to read numpy files, so make sure that you can link to cnpy.

Python API has some bad overhead due to using ctypes. This is noticeable for smaller matrices but not really noticeable for large matrices. The benchmarkings done in the Arxiv paper was all done with the C++ API.

Work that is not yet open sourced: kernel generator for sparse convolutions (as described in the Arxiv paper) using implicit convolution, lightweight inference engine to get end-to-end results, sparse int8 kernels. If interested in any of this please email.

FAQs:

How does this compare to Neuralmagic? Last time I checked the deepsparse library does not allow you to run kernel-level benchmarks. If you care about end to end neural network acceleration, you should definitely go with Neuralmagic if they happen to support your model.
Future work? This is not exactly along the lines of my PhD thesis so I work on this sparingly. If you want to contribute to this repo you could make a Pytorch or Tensorflow custom op with the Python or C++ API. However it's unclear how gradients would work, and you will have to compile this op with the fixed sparsity pattern, something that the current Pytorch/Tensorflow frameworks might not support that well.

Fast sparse deep learning on CPUs

Related tags

Overview

SPARSEDNN

Owner

Ziheng Wang

NeuroFind - A solution to the to the Task given by the Oberseminar of Messtechnik Institute of TU Dresden in 2021

「PyTorch Implementation of AnimeGANv2」を用いて、生成した顔画像を元の画像に上書きするデモ

Using contrastive learning and OpenAI's CLIP to find good embeddings for images with lossy transformations

FairEdit: Preserving Fairness in Graph Neural Networks through Greedy Graph Editing

Using contrastive learning and OpenAI's CLIP to find good embeddings for images with lossy transformations

FCOSR: A Simple Anchor-free Rotated Detector for Aerial Object Detection

Evaluating Cross-lingual Sentence Representations

Решения, подсказки, тесты и утилиты для тренировки по алгоритмам от Яндекса.

Mengzi Pretrained Models

PyTorch Implementation of the SuRP algorithm by the authors of the AISTATS 2022 paper "An Information-Theoretic Justification for Model Pruning"

efficient neural audio synthesis in the waveform domain

Localized representation learning from Vision and Text (LoVT)

Pytorch Implementation of "Desigining Network Design Spaces", Radosavovic et al. CVPR 2020.

WormMovementSimulation - 3D Simulation of Worm Body Movement with Neurons attached to its body

Learning-Augmented Dynamic Power Management

This is an open solution to the Home Credit Default Risk challenge 🏡

You Only Hypothesize Once: Point Cloud Registration with Rotation-equivariant Descriptors

Randomized Correspondence Algorithm for Structural Image Editing

L-Verse: Bidirectional Generation Between Image and Text

SPLADE: Sparse Lexical and Expansion Model for First Stage Ranking