基于Paddle框架的fcanet复现

Overview

fcanet-Paddle

基于Paddle框架的fcanet复现

fcanet

本项目基于paddlepaddle框架复现fcanet,并参加百度第三届论文复现赛,将在2021年5月15日比赛完后提供AIStudio链接~敬请期待

参考项目:

frazerlin-fcanet

数据准备

本项目已挂载论文所使用的数据集,对于tgztar文件需要利用以下命令解压

tar -xvf benchmark.tgz
tar xvf VOCtrainval_11-May-2012.tar

整个工程具有以下目录结构

/home/aistudio
|───Data(数据集)
└───────benchmark_RELEASE
└───────VOCdevkit
└───────GrabCut
└───────Berkeley
└───fcanet(代码文件)
└───InitialPaddleModel(初始化权重)

训练

The official PyTorch implementation of CVPR 2020 paper "Interactive Image Segmentation with First Click Attention". 并未提供训练代码。通过邮件联系作者,作者由于企业合作项目原因,合作结束后会将会提供训练代码

测试

模型下载

提取码:2ira

AIStudio链接

验证集测试

python fcanet/evaluate.py --backbone [resnet/res2net] --dataset [GrabCut,Berkeley,DAVIS(not exists in this repo),VOCdevkit] (--sis)

如下图所示,默认的backbone均为101

resnet101测试示例

res2net101测试示例

backbone dataset mNoC mIoU-NoC
resnet101 Berkeley 4.23 [0. 0.728 0.854 0.885 0.912 0.915 0.926 0.935 0.939 0.935 0.94 0.943 0.942 0.944 0.945 0.945 0.947 0.947 0.948 0.947 0.949]
resnet101 GrabCut 2.24 [0. 0.78 0.87 0.923 0.944 0.95 0.956 0.966 0.964 0.971 0.971 0.971 0.975 0.977 0.978 0.979 0.978 0.978 0.979 0.979 0.979]
resnet101 VOC2012 2.9810329734461627 [0. 0.715 0.838 0.885 0.909 0.926 0.937 0.945 0.951 0.957 0.962 0.964 0.967 0.969 0.971 0.973 0.974 0.976 0.977 0.978 0.979]
res2net101 Berkeley 3.98 [0. 0.788 0.872 0.901 0.921 0.93 0.933 0.938 0.938 0.943 0.943 0.943 0.943 0.945 0.947 0.948 0.949 0.949 0.95 0.951 0.95 ]
res2net101 GrabCut 2.16 [0. 0.819 0.877 0.927 0.916 0.931 0.948 0.96 0.966 0.967 0.969 0.971 0.973 0.976 0.977 0.976 0.978 0.977 0.98 0.977 0.979]
res2net101 VOC2012 2.793988911584476 [0. 0.757 0.841 0.882 0.908 0.925 0.937 0.945 0.952 0.958 0.963 0.966 0.968 0.971 0.973 0.974 0.976 0.977 0.978 0.98 0.98 ]

可视化测试

利用annotator.py可以实现可视化操作,感兴趣的读者可是利用Qt实现UI程序,实现效果如下所示

需要注意的是,AIStudio环境暂不支持这种可视化方式,你需要将此仓库部署到本地运行,你可能需要修改代码文件中的路径

python fcanet/annotator.py --backbone res2net --input fcanet/test.jpg --output test_mask.jpg

关于作者

姓名 郭权浩
学校 电子科技大学研2020级
研究方向 计算机视觉
主页 Deep Hao的主页
如有错误,请及时留言纠正,非常蟹蟹!
后续会有更多论文复现系列推出,欢迎大家有问题留言交流学习,共同进步成长!
Owner
QuanHao Guo
master at UESTC
QuanHao Guo
Python code to fuse multiple RGB-D images into a TSDF voxel volume.

Volumetric TSDF Fusion of RGB-D Images in Python This is a lightweight python script that fuses multiple registered color and depth images into a proj

Andy Zeng 845 Jan 03, 2023
Code for "Learning Graph Cellular Automata"

Learning Graph Cellular Automata This code implements the experiments from the NeurIPS 2021 paper: "Learning Graph Cellular Automata" Daniele Grattaro

Daniele Grattarola 37 Oct 26, 2022
A Python module for the generation and training of an entry-level feedforward neural network.

ff-neural-network A Python module for the generation and training of an entry-level feedforward neural network. This repository serves as a repurposin

Riadh 2 Jan 31, 2022
TensorFlow (Python) implementation of DeepTCN model for multivariate time series forecasting.

DeepTCN TensorFlow TensorFlow (Python) implementation of multivariate time series forecasting model introduced in Chen, Y., Kang, Y., Chen, Y., & Wang

Flavia Giammarino 21 Dec 19, 2022
VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations

VolumeGAN - 3D-aware Image Synthesis via Learning Structural and Textural Representations 3D-aware Image Synthesis via Learning Structural and Textura

GenForce: May Generative Force Be with You 116 Dec 26, 2022
Mapping Conditional Distributions for Domain Adaptation Under Generalized Target Shift

This repository contains the official code of OSTAR in "Mapping Conditional Distributions for Domain Adaptation Under Generalized Target Shift" (ICLR 2022).

Matthieu Kirchmeyer 5 Dec 06, 2022
TumorInsight is a Brain Tumor Detection and Classification model built using RESNET50 architecture.

A Brain Tumor Detection and Classification Model built using RESNET50 architecture. The model is also deployed as a web application using Flask framework.

Pranav Khurana 0 Aug 17, 2021
A tf.keras implementation of Facebook AI's MadGrad optimization algorithm

MADGRAD Optimization Algorithm For Tensorflow This package implements the MadGrad Algorithm proposed in Adaptivity without Compromise: A Momentumized,

20 Aug 18, 2022
Recurrent Conditional Query Learning

Recurrent Conditional Query Learning (RCQL) This repository contains the Pytorch implementation of One Model Packs Thousands of Items with Recurrent C

Dongda 4 Nov 28, 2022
Embeddinghub is a database built for machine learning embeddings.

Embeddinghub is a database built for machine learning embeddings.

Featureform 1.2k Jan 01, 2023
Codebase for "ProtoAttend: Attention-Based Prototypical Learning."

Codebase for "ProtoAttend: Attention-Based Prototypical Learning." Authors: Sercan O. Arik and Tomas Pfister Paper: Sercan O. Arik and Tomas Pfister,

47 2 May 17, 2022
SpeechNAS Better Trade off between Latency and Accuracy for Large Scale Speaker Verification

SpeechNAS Better Trade off between Latency and Accuracy for Large Scale Speaker Verification

Wentao Zhu 24 May 20, 2022
This is a five-step framework for the development of intrusion detection systems (IDS) using machine learning (ML) considering model realization, and performance evaluation.

AB-TRAP: building invisibility shields to protect network devices The AB-TRAP framework is applicable to the development of Network Intrusion Detectio

Lab-C2DC - Laboratory of Command and Control and Cyber-security 17 Jan 04, 2023
Normalizing Flows with a resampled base distribution

Resampling Base Distributions of Normalizing Flows Normalizing flows are a popular class of models for approximating probability distributions. Howeve

Vincent Stimper 24 Nov 03, 2022
Voxel Transformer for 3D object detection

Voxel Transformer This is a reproduced repo of Voxel Transformer for 3D object detection. The code is mainly based on OpenPCDet. Introduction We provi

173 Dec 25, 2022
PyTorch ,ONNX and TensorRT implementation of YOLOv4

PyTorch ,ONNX and TensorRT implementation of YOLOv4

4.2k Jan 01, 2023
Fusion-DHL: WiFi, IMU, and Floorplan Fusion for Dense History of Locations in Indoor Environments

Fusion-DHL: WiFi, IMU, and Floorplan Fusion for Dense History of Locations in Indoor Environments Paper: arXiv (ICRA 2021) Video : https://youtu.be/CC

Sachini Herath 68 Jan 03, 2023
Codebase for Amodal Segmentation through Out-of-Task andOut-of-Distribution Generalization with a Bayesian Model

Codebase for Amodal Segmentation through Out-of-Task andOut-of-Distribution Generalization with a Bayesian Model

Yihong Sun 12 Nov 15, 2022
The Turing Change Point Detection Benchmark: An Extensive Benchmark Evaluation of Change Point Detection Algorithms on real-world data

Turing Change Point Detection Benchmark Welcome to the repository for the Turing Change Point Detection Benchmark, a benchmark evaluation of change po

The Alan Turing Institute 85 Dec 28, 2022
PSML: A Multi-scale Time-series Dataset for Machine Learning in Decarbonized Energy Grids

PSML: A Multi-scale Time-series Dataset for Machine Learning in Decarbonized Energy Grids The electric grid is a key enabling infrastructure for the a

Texas A&M Engineering Research 19 Jan 07, 2023