CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

Last update: Mar 07, 2022

Related tags

Deep Learning CvT-ASSD

Overview

CvT-ASSD

including extra CvT, CvT-SSD, VGG-ASSD models

original-code-website:

https://github.com/albert-jin/CvT-SSD

new-code-website:

https://github.com/albert-jin/CvT-ASSD

为了符合开源号召,本项目于2021-7-12 正式开源...

project architecture:

Mentions

You may probably need to install an anaconda environment which contains all packages followed.
- pytorch 1.9.0 py3.7_cuda10.2_cudnn7_0 pytorch
- cudatoolkit 10.2.89 h74a9793_1
- opencv-python 4.5.2.54 pypi_0 pypi
- visdom 0.1.8.9 pypi_0 pypi
- yacs 0.1.8 pypi_0 pypi
- jupyter 1.0.0 pypi_0 pypi
For training, an NVIDIA GPU is strongly recommended for speed. we use two NVIDIA GTX-1080TI, but we recommend GPUs like Tesla-V100 /RTX-3090 for more memory
Before you run the codes for self-study or reappearance the performance in this paper "CvT-ASSD", please add the CvT_SSD/model/ directory into sources Root caused by the reference of many codes inside of model directory
you should download the pytorch parameters file postfix by ".pth" and move into models/CvT/weights like 项目结构.PNG
图像物体检测benchmark(参照论文native-SSD)一般是将VOC2007—TEST的数据作为模型的测试集,训练集可有以下搭配:
- 1. 07:VOC2007 trainval 训练集验证集
- 1. 02+12 VOC2007 trainval + VOC2007 trainval 训练集验证集
- 1. 07+12+COCO 在 COCO trainval35k上预训练,然后在07+12上微调
评价指标maP使用mxnet提供的VOC07MApMetric,将recall分成10等分,继而对所有precision取平均,在对类别去平均,具体参见 https://blog.csdn.net/u014203453/article/details/77598997

CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

Related tags

Overview

CvT-ASSD

including extra CvT, CvT-SSD, VGG-ASSD models

original-code-website:

new-code-website:

为了符合开源号召,本项目于2021-7-12 正式开源...

project architecture:

Mentions

Owner

金伟强 -上海大学人工智能小渣渣~

Memory Defense: More Robust Classificationvia a Memory-Masking Autoencoder

3rd Place Solution of the Traffic4Cast Core Challenge @ NeurIPS 2021

NLG evaluation via Statistical Measures of Similarity: BaryScore, DepthScore, InfoLM

Official repository for "Restormer: Efficient Transformer for High-Resolution Image Restoration". SOTA results for single-image motion deblurring, image deraining, image denoising (synthetic and real data), and dual-pixel defocus deblurring.

Point Cloud Registration using Representative Overlapping Points.

SweiNet is an uncertainty-quantifying shear wave speed (SWS) estimator for ultrasound shear wave elasticity (SWE) imaging.

High-fidelity 3D Model Compression based on Key Spheres

Video2x - A lossless video/GIF/image upscaler achieved with waifu2x, Anime4K, SRMD and RealSR.

Scripts and outputs related to the paper Prediction of Adverse Biological Effects of Chemicals Using Knowledge Graph Embeddings.

Code accompanying paper: Meta-Learning to Improve Pre-Training

This repository will be a summary and outlook on all our open, medical, AI advancements.

Official PyTorch implementation of UACANet: Uncertainty Aware Context Attention for Polyp Segmentation

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Classification Modeling: Probability of Default

Repository for the paper "Exploring the Sensory Spaces of English Perceptual Verbs in Natural Language Data"

A general framework for deep learning experiments under PyTorch based on pytorch-lightning

Differentiable rasterization applied to 3D model simplification tasks

Keras-1D-ACGAN-Data-Augmentation

Spatial Contrastive Learning for Few-Shot Classification (SCL)

Husein pet projects in here!