Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.

Overview

Monk - A computer vision toolkit for everyone Tweet

Version Build_Status


Why use Monk

  • Issue: Want to begin learning computer vision

    • Solution: Start with Monk's hands-on study roadmap tutorials
  • Issue: Multiple libraries hence multiple syntaxes to learn

    • Solution: Monk's one syntax to rule them all - pytorch, keras, mxnet, etc
  • Issue: Tough to keep track of all the trial projects while participating in a deep learning competition

    • Solution: Use monk's project management and work on multiple prototyping experiments
  • Issue: Tough to set hyper-parameters while training a classifier

    • Solution: Try out hyper-parameter analyser to find the right fit
  • Issue: Looking for a library to build quick solutions for your customer

    • Solution: Train, Infer and deploy with monk's low-code syntax


Create real-world Image Classification applications

Medical Domain Fashion Domain Autonomous Vehicles Domain
Agriculture Domain Wildlife Domain Retail Domain
Satellite Domain Healthcare Domain Activity Analysis Domain

...... For more check out the Application Model Zoo!!!!



How does Monk make image classification easy

  • Write less code and create end to end applications.
  • Learn only one syntax and create applications using any deep learning library - pytorch, mxnet, keras, tensorflow, etc
  • Manage your entire project easily with multiple experiments


For whom this library is built

  • Students
    • Seamlessly learn computer vision using our comprehensive study roadmaps
  • Researchers and Developers
    • Create and Manage multiple deep learning projects
  • Competiton participants (Kaggle, Codalab, Hackerearth, AiCrowd, etc)
    • Expedite the prototyping process and jumpstart with a higher rank


Table of Contents




Sample Showcase - Quick Mode

Create an image classifier.

#Create an experiment
ptf.Prototype("sample-project-1", "sample-experiment-1")

#Load Data
ptf.Default(dataset_path="sample_dataset/", 
             model_name="resnet18", 
             num_epochs=2)
# Train
ptf.Train()

Inference

predictions = ptf.Infer(img_name="sample.png", return_raw=True);

Compare Experiments

#Create comparison project
ctf.Comparison("Sample-Comparison-1");

#Add all your experiments
ctf.Add_Experiment("sample-project-1", "sample-experiment-1");
ctf.Add_Experiment("sample-project-1", "sample-experiment-2");
   
# Generate statistics
ctf.Generate_Statistics();



Installation

  • CUDA 9.0          : pip install -U monk-cuda90
  • CUDA 9.0          : pip install -U monk-cuda92
  • CUDA 10.0        : pip install -U monk-cuda100
  • CUDA 10.1        : pip install -U monk-cuda101
  • CUDA 10.2        : pip install -U monk-cuda102
  • CPU (+Mac-OS) : pip install -U monk-cpu
  • Google Colab   : pip install -U monk-colab
  • Kaggle              : pip install -U monk-kaggle

For More Installation instructions visit: Link




Study Roadmaps




Documentation




TODO-2020

Features

  • Model Visualization
  • Pre-processed data visualization
  • Learned feature visualization
  • NDimensional data input - npy - hdf5 - dicom - tiff
  • Multi-label Image Classification
  • Custom model development

General

  • Functional Documentation
  • Tackle Multiple versions of libraries
  • Add unit-testing
  • Contribution guidelines
  • Python pip packaging support

Backend Support

  • Tensorflow 2.0 provision support with v1
  • Tensorflow 2.0 complete
  • Chainer

External Libraries

  • TensorRT Acceleration
  • Intel Acceleration
  • Echo AI - for Activation functions


Connect with the project contributors



Copyright

Copyright 2019 onwards, Tessellate Imaging Private Limited Licensed under the Apache License, Version 2.0 (the "License"); you may not use this project's files except in compliance with the License. A copy of the License is provided in the LICENSE file in this repository.

Owner
Tessellate Imaging
Computer Vision and Deep Learning Consultance and Development
Tessellate Imaging
Code for CPM-2 Pre-Train

CPM-2 Pre-Train Pre-train CPM-2 此分支为110亿非 MoE 模型的预训练代码,MoE 模型的预训练代码请切换到 moe 分支 CPM-2技术报告请参考link。 0 模型下载 请在智源资源下载页面进行申请,文件介绍如下: 文件名 描述 参数大小 100000.tar

Tsinghua AI 136 Dec 28, 2022
The implementation of the paper "HIST: A Graph-based Framework for Stock Trend Forecasting via Mining Concept-Oriented Shared Information".

The HIST framework for stock trend forecasting The implementation of the paper "HIST: A Graph-based Framework for Stock Trend Forecasting via Mining C

Wentao Xu 110 Dec 27, 2022
SysWhispers Shellcode Loader

Shhhloader Shhhloader is a SysWhispers Shellcode Loader that is currently a Work in Progress. It takes raw shellcode as input and compiles a C++ stub

icyguider 630 Jan 03, 2023
MohammadReza Sharifi 27 Dec 13, 2022
Implementations for the ICLR-2021 paper: SEED: Self-supervised Distillation For Visual Representation.

Implementations for the ICLR-2021 paper: SEED: Self-supervised Distillation For Visual Representation.

Jacob 27 Oct 23, 2022
Improving Transferability of Representations via Augmentation-Aware Self-Supervision

Improving Transferability of Representations via Augmentation-Aware Self-Supervision Accepted to NeurIPS 2021 TL;DR: Learning augmentation-aware infor

hankook 38 Sep 16, 2022
Predict stock movement with Machine Learning and Deep Learning algorithms

Project Overview Stock market movement prediction using LSTM Deep Neural Networks and machine learning algorithms Software and Library Requirements Th

Naz Delam 46 Sep 13, 2022
Implementation of Rotary Embeddings, from the Roformer paper, in Pytorch

Rotary Embeddings - Pytorch A standalone library for adding rotary embeddings to transformers in Pytorch, following its success as relative positional

Phil Wang 110 Dec 30, 2022
A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model.

Semantic Meshes A framework for annotating 3D meshes using the predictions of a 2D semantic segmentation model. Paper If you find this framework usefu

Florian 40 Dec 09, 2022
An experimentation and research platform to investigate the interaction of automated agents in an abstract simulated network environments.

CyberBattleSim April 8th, 2021: See the announcement on the Microsoft Security Blog. CyberBattleSim is an experimentation research platform to investi

Microsoft 1.5k Dec 25, 2022
The implementation of 'Image synthesis via semantic composition'.

Image synthesis via semantic synthesis [Project Page] by Yi Wang, Lu Qi, Ying-Cong Chen, Xiangyu Zhang, Jiaya Jia. Introduction This repository gives

DV Lab 71 Jan 06, 2023
Deep Q-network learning to play flappybird.

AI Plays Flappy Bird I've trained a DQN that learns to play flappy bird on it's own. Try the pre-trained model First install the pip requirements and

Anish Shrestha 3 Mar 01, 2022
Agent-based model simulator for air quality and pandemic risk assessment in architectural spaces

Agent-based model simulation for air quality and pandemic risk assessment in architectural spaces. User Guide archABM is a fast and open source agent-

Vicomtech 10 Dec 05, 2022
Image super-resolution through deep learning

srez Image super-resolution through deep learning. This project uses deep learning to upscale 16x16 images by a 4x factor. The resulting 64x64 images

David Garcia 5.3k Dec 28, 2022
A code implementation of AC-GC: Activation Compression with Guaranteed Convergence, in NeurIPS 2021.

Code For AC-GC: Lossy Activation Compression with Guaranteed Convergence This code is intended to be used as a supplemental material for submission to

Dave Evans 2 Nov 01, 2022
A PyTorch Implementation of ViT (Vision Transformer)

ViT - Vision Transformer This is an implementation of ViT - Vision Transformer by Google Research Team through the paper "An Image is Worth 16x16 Word

Quan Nguyen 7 May 11, 2022
Code for Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019)

Talking Face Generation by Adversarially Disentangled Audio-Visual Representation (AAAI 2019) We propose Disentangled Audio-Visual System (DAVS) to ad

Hang_Zhou 750 Dec 23, 2022
Stacked Recurrent Hourglass Network for Stereo Matching

SRH-Net: Stacked Recurrent Hourglass Introduction This repository is supplementary material of our RA-L submission, which helps reviewers to understan

28 Jan 03, 2023
This repository contains the source code of an efficient 1D probabilistic model for music time analysis proposed in ICASSP2022 venue.

Jump Reward Inference for 1D Music Rhythmic State Spaces An implementation of the probablistic jump reward inference model for music rhythmic informat

Mojtaba Heydari 25 Dec 16, 2022
Pose estimation for iOS and android using TensorFlow 2.0

💃 Mobile 2D Single Person (Or Your Own Object) Pose Estimation for TensorFlow 2.0 This repository is forked from edvardHua/PoseEstimationForMobile wh

tucan9389 165 Nov 16, 2022