一个多模态内容理解算法框架，其中包含数据处理、预训练模型、常见模型以及模型加速等模块。

Last update: Dec 22, 2022

Related tags

Deep Learning Lichee

Overview

框架简介

方便使用，支持多模态，多任务的统一训练框架

能力列表：

bert + 分类任务
自定义任务训练（插件注册）

框架设计

框架采用分层的思想组织模型训练流程。

DATA 层负责读取用户数据，根据 field 管理数据。
Parser 层负责转换原始数据为模型的输入。
MODEL 层为模型层，具体由表示层（REPRESENTATION）和任务层（TASK）组成。
- 表示层用于抽取数据的高维特征，框架里内置了一些成熟实现（包括bert、NeXtVLAD等）。
- 任务层用于拟合具体的训练任务，框架里提供一些默认实现（包括分类任务等），用户也可以根据训练任务，自定义任务模型。
- 任务层可用于实现多任务训练。
框架通过配置文件组合 DATA、Parser、MODEL、Optimizer、Scheduler，构建具体的训练流程。
框架还内置了成熟的组件模块（Module），包括 Metrics、Loss、Layer 等，供用户选择使用。

详细可参考文档

框架安装

参考文档

使用说明

cd examples/base_bert_cls_mac
sh train.sh
sh eval.sh
sh predict.sh

模型训练任务被拆分为三步，每个步骤可以独立运行：

任务执行均依赖配置文件，详细介绍可参考文档

若框架默认实现无法满足需求，也可以实现自定义插件，详细介绍可参考文档

Contributing

如果你有好的意见或建议，欢迎给我们提 Issues 或 Pull Requests，为蓝鲸开源社区贡献力量。关于标准运维分支管理、Issue 以及 PR 规范，请阅读 Contributing Guide。

一个多模态内容理解算法框架，其中包含数据处理、预训练模型、常见模型以及模型加速等模块。

Related tags

Overview

Overview

框架简介

框架设计

框架安装

使用说明

Contributing

Owner

Tencent

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

Python implementation of "Elliptic Fourier Features of a Closed Contour"

Supervised 3D Pre-training on Large-scale 2D Natural Image Datasets for 3D Medical Image Analysis

Official implementation of "A Shared Representation for Photorealistic Driving Simulators" in PyTorch.

Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"

PyTorch Implementation of "Light Field Image Super-Resolution with Transformers"

Unofficial implementation of MLP-Mixer: An all-MLP Architecture for Vision

METS/ALTO OCR enhancing tool by the National Library of Luxembourg (BnL)

Pytorch Implementation of paper "Noisy Natural Gradient as Variational Inference"

Code for "3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback Loop"

BEGAN in PyTorch

BTC-Generator - BTC Generator With Python

High-Resolution Image Synthesis with Latent Diffusion Models

You Only Look One-level Feature (YOLOF), CVPR2021, Detectron2

Cross View SLAM

This project aims at providing a concise, easy-to-use, modifiable reference implementation for semantic segmentation models using PyTorch.

Driller: augmenting AFL with symbolic execution!

You Only Look Once for Panopitic Driving Perception

InsightFace: 2D and 3D Face Analysis Project on MXNet and PyTorch

An pytorch implementation of Masked Autoencoders Are Scalable Vision Learners