Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.

Overview

Monk - A computer vision toolkit for everyone Tweet

Version Build_Status


Why use Monk

  • Issue: Want to begin learning computer vision

    • Solution: Start with Monk's hands-on study roadmap tutorials
  • Issue: Multiple libraries hence multiple syntaxes to learn

    • Solution: Monk's one syntax to rule them all - pytorch, keras, mxnet, etc
  • Issue: Tough to keep track of all the trial projects while participating in a deep learning competition

    • Solution: Use monk's project management and work on multiple prototyping experiments
  • Issue: Tough to set hyper-parameters while training a classifier

    • Solution: Try out hyper-parameter analyser to find the right fit
  • Issue: Looking for a library to build quick solutions for your customer

    • Solution: Train, Infer and deploy with monk's low-code syntax


Create real-world Image Classification applications

Medical Domain Fashion Domain Autonomous Vehicles Domain
Agriculture Domain Wildlife Domain Retail Domain
Satellite Domain Healthcare Domain Activity Analysis Domain

...... For more check out the Application Model Zoo!!!!



How does Monk make image classification easy

  • Write less code and create end to end applications.
  • Learn only one syntax and create applications using any deep learning library - pytorch, mxnet, keras, tensorflow, etc
  • Manage your entire project easily with multiple experiments


For whom this library is built

  • Students
    • Seamlessly learn computer vision using our comprehensive study roadmaps
  • Researchers and Developers
    • Create and Manage multiple deep learning projects
  • Competiton participants (Kaggle, Codalab, Hackerearth, AiCrowd, etc)
    • Expedite the prototyping process and jumpstart with a higher rank


Table of Contents




Sample Showcase - Quick Mode

Create an image classifier.

#Create an experiment
ptf.Prototype("sample-project-1", "sample-experiment-1")

#Load Data
ptf.Default(dataset_path="sample_dataset/", 
             model_name="resnet18", 
             num_epochs=2)
# Train
ptf.Train()

Inference

predictions = ptf.Infer(img_name="sample.png", return_raw=True);

Compare Experiments

#Create comparison project
ctf.Comparison("Sample-Comparison-1");

#Add all your experiments
ctf.Add_Experiment("sample-project-1", "sample-experiment-1");
ctf.Add_Experiment("sample-project-1", "sample-experiment-2");
   
# Generate statistics
ctf.Generate_Statistics();



Installation

  • CUDA 9.0          : pip install -U monk-cuda90
  • CUDA 9.0          : pip install -U monk-cuda92
  • CUDA 10.0        : pip install -U monk-cuda100
  • CUDA 10.1        : pip install -U monk-cuda101
  • CUDA 10.2        : pip install -U monk-cuda102
  • CPU (+Mac-OS) : pip install -U monk-cpu
  • Google Colab   : pip install -U monk-colab
  • Kaggle              : pip install -U monk-kaggle

For More Installation instructions visit: Link




Study Roadmaps




Documentation




TODO-2020

Features

  • Model Visualization
  • Pre-processed data visualization
  • Learned feature visualization
  • NDimensional data input - npy - hdf5 - dicom - tiff
  • Multi-label Image Classification
  • Custom model development

General

  • Functional Documentation
  • Tackle Multiple versions of libraries
  • Add unit-testing
  • Contribution guidelines
  • Python pip packaging support

Backend Support

  • Tensorflow 2.0 provision support with v1
  • Tensorflow 2.0 complete
  • Chainer

External Libraries

  • TensorRT Acceleration
  • Intel Acceleration
  • Echo AI - for Activation functions


Connect with the project contributors



Copyright

Copyright 2019 onwards, Tessellate Imaging Private Limited Licensed under the Apache License, Version 2.0 (the "License"); you may not use this project's files except in compliance with the License. A copy of the License is provided in the LICENSE file in this repository.

Owner
Tessellate Imaging
Computer Vision and Deep Learning Consultance and Development
Tessellate Imaging
Python parser for DTED data.

DTED Parser This is a package written in pure python (with help from numpy) to parse and investigate Digital Terrain Elevation Data (DTED) files. This

Ben Bonenfant 12 Dec 18, 2022
Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Dense Passage Retrieval Dense Passage Retrieval (DPR) - is a set of tools and models for state-of-the-art open-domain Q&A research. It is based on the

Meta Research 1.1k Jan 03, 2023
A simple baseline for the 2022 IEEE GRSS Data Fusion Contest (DFC2022)

DFC2022 Baseline A simple baseline for the 2022 IEEE GRSS Data Fusion Contest (DFC2022) This repository uses TorchGeo, PyTorch Lightning, and Segmenta

isaac 24 Nov 28, 2022
The implementation of the paper "A Deep Feature Aggregation Network for Accurate Indoor Camera Localization".

A Deep Feature Aggregation Network for Accurate Indoor Camera Localization This is the PyTorch implementation of our paper "A Deep Feature Aggregation

9 Dec 09, 2022
Learning Logic Rules for Document-Level Relation Extraction

LogiRE Learning Logic Rules for Document-Level Relation Extraction We propose to introduce logic rules to tackle the challenges of doc-level RE. Equip

41 Dec 26, 2022
Pointer-generator - Code for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks

Note: this code is no longer actively maintained. However, feel free to use the Issues section to discuss the code with other users. Some users have u

Abi See 2.1k Jan 04, 2023
[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

SADRNet Paper link: SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction Requirements python

Multimedia Computing Group, Nanjing University 99 Dec 30, 2022
Parameterising Simulated Annealing for the Travelling Salesman Problem

Parameterising Simulated Annealing for the Travelling Salesman Problem

Gary Sun 55 Jun 15, 2022
使用深度学习框架提取视频硬字幕;docker容器免安装深度学习库,使用本地api接口使得界面和后端识别分离;

extract-video-subtittle 使用深度学习框架提取视频硬字幕; 本地识别无需联网; CPU识别速度可观; 容器提供API接口; 运行环境 本项目运行环境非常好搭建,我做好了docker容器免安装各种深度学习包; 提供windows界面操作; 容器为CPU版本; 视频演示 https

歌者 16 Aug 06, 2022
Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs

Implementation for the paper: Probabilistic Entity Representation Model for Reasoning over Knowledge Graphs, Nurendra Choudhary, Nikhil Rao, Sumeet Ka

Nurendra Choudhary 8 Nov 15, 2022
Stereo Hybrid Event-Frame (SHEF) Cameras for 3D Perception, IROS 2021

For academic use only. Stereo Hybrid Event-Frame (SHEF) Cameras for 3D Perception Ziwei Wang, Liyuan Pan, Yonhon Ng, Zheyu Zhuang and Robert Mahony Th

Ziwei Wang 11 Jan 04, 2023
Official pytorch implementation of Rainbow Memory (CVPR 2021)

Rainbow Memory: Continual Learning with a Memory of Diverse Samples

Clova AI Research 91 Dec 17, 2022
Weight estimation in CT by multi atlas techniques

maweight A Python package for multi-atlas based weight estimation for CT images, including segmentation by registration, feature extraction and model

György Kovács 0 Dec 24, 2021
How to Become More Salient? Surfacing Representation Biases of the Saliency Prediction Model

How to Become More Salient? Surfacing Representation Biases of the Saliency Prediction Model

Bogdan Kulynych 49 Nov 05, 2022
Cognate Detection Repository

Cognate Detection Repository Details This repository contains the data for two publications: Challenge Dataset of Cognates and False Friend Pairs from

Diptesh Kanojia 1 Apr 26, 2022
A working implementation of the Categorical DQN (Distributional RL).

Categorical DQN. Implementation of the Categorical DQN as described in A distributional Perspective on Reinforcement Learning. Thanks to @tudor-berari

Florin Gogianu 98 Sep 20, 2022
A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)

A selection of State Of The Art research papers (and code) on human trajectory prediction (forecasting). Papers marked with [W] are workshop papers.

Karttikeya Manglam 40 Nov 18, 2022
[LREC] MMChat: Multi-Modal Chat Dataset on Social Media

MMChat This repo contains the code and data for the LREC2022 paper MMChat: Multi-Modal Chat Dataset on Social Media. Dataset MMChat is a large-scale d

Silver 47 Jan 03, 2023
CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP

CLOOB: Modern Hopfield Networks with InfoLOOB Outperform CLIP Andreas Fürst* 1, Elisabeth Rumetshofer* 1, Viet Tran1, Hubert Ramsauer1, Fei Tang3, Joh

Institute for Machine Learning, Johannes Kepler University Linz 133 Jan 04, 2023
Finding Biological Plausibility for Adversarially Robust Features via Metameric Tasks

Adversarially-Robust-Periphery Code + Data from the paper "Finding Biological Plausibility for Adversarially Robust Features via Metameric Tasks" by A

Anne Harrington 2 Feb 07, 2022