Code for the paper 'A High Performance CRF Model for Clothes Parsing'.

Last update: Nov 21, 2022

Related tags

Overview

Clothes Parsing

Overview

This code provides an implementation of the research paper:

  A High Performance CRF Model for Clothes Parsing
  Edgar Simo-Serra, Sanja Fidler, Francesc Moreno-Noguer, and Raquel Urtasun
  Asian Conference on Computer Vision (ACCV), 2014

The code here allows training and testing of a model that got state-of-the-art results on the Fashionista dataset at the time of publication.

License

  Copyright (C) <2014> <Edgar Simo-Serra, Sanja Fidler, Francesc Moreno-Noguer, Raquel Urtasun>

  This work is licensed under the Creative Commons
  Attribution-NonCommercial-ShareAlike 4.0 International License. To view a copy
  of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/ or
  send a letter to Creative Commons, PO Box 1866, Mountain View, CA 94042, USA.

  Edgar Simo-Serra, Institut de Robotica i Informatica Industrial (CSIC/UPC), December 2014.
  [email protected], http://www-iri.upc.es/people/esimo/

Installation

In order to get started first checkout out the source code and then extract the features:

# Check out the git and cd into it as working directory
git clone https://github.com/bobbens/clothes_parsing.git
cd clothes_parsing
# Get and unpack the necessary features
wget http://hi.cs.waseda.ac.jp/~esimo//data/poseseg.tar.bz2
tar xvjf poseseg.tar.bz2

The dSP dependency must also be compiled. This can be done by:

cd lib/dSP_5.1
make # First edit the Makefile if necessary

Usage

You can reproduce results simply by running from Matlab:

sm = segmodel( 'PROFILE', '0.16', 'use_real_pose', false ); % Load the model, parameters can be set here
sm = sm.train_misc_unaries(); % Trains some misc stuff
sm = sm.train_MRF(); % Actually sets up and trains the CRF
R = sm.test_MRF_segmentation() % Performs testing and outputs results

This should generate an output like:

 BUILDING MRF OUTPUT 29 CLASSES (REAL POSE=0)...
 UNARIES:
    bgbias
    logreg:       29
    cpmc_logreg:  29
    cpmc
    shapelets
 HIGHER ORDER
    similarity
    limbs
 Initializing Image 011 / 350...   0.4 seconds!   

 ...

 Tested MRF in 319.0 seconds
 350 / 350... 

 R = 

     confusion: [29x29 double]
     order: [29x1 double]
     acc: 0.8432
     pre: [29x1 double]
     rec: [29x1 double]
     f1: [29x1 double]
     voc: [29x1 double]
     avr_pre: 0.3007
     avr_rec: 0.3292
     avr_f1: 0.3039
     avr_voc: 0.2013

Please note that due to stochastic components and differences between software versions, the numbers will not be exactly the same as the paper. For the paper all results were obtained on a linux machine running Ubuntu 12.04 with Matlab R2012a (7.14.0.739) 64-bit (glnxa64).

You can furthermore visualize the output of the model with:

sm.test_MRF_visualize( 'output/' )

This will save both the ground truth segmentations and the predicted segmentations in the directory 'output/' as shown in the paper.

If you use this code please cite:

 @InProceedings{SimoSerraACCV2014,
    author = {Edgar Simo-Serra and Sanja Fidler and Francesc Moreno-Noguer and Raquel Urtasun},
    title = {{A High Performance CRF Model for Clothes Parsing}},
    booktitle = "Proceedings of the Asian Conference on Computer Vision (2014)",
    year = 2014
 }

Acknowledgments

We would like to give our thanks to Kota Yamaguchi for his excellent code which we have used as a base for our model.

The different codes we have used (in alphabetical order):

Changelog

December 2014: Initial version 1.0 release

Code for the paper 'A High Performance CRF Model for Clothes Parsing'.

Related tags

Overview

Clothes Parsing

Overview

License

Installation

Usage

Acknowledgments

Changelog

Owner

Edgar Simo-Serra

This repository contains the implementation of Deep Detail Enhancment for Any Garment proposed in Eurographics 2021

GPU implementation of $k$-Nearest Neighbors and Shared-Nearest Neighbors

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

Weakly Supervised Text-to-SQL Parsing through Question Decomposition

Code release of paper Improving neural implicit surfaces geometry with patch warping

OCRA (Object-Centric Recurrent Attention) source code

Pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering".

Sign-to-Speech for Sign Language Understanding: A case study of Nigerian Sign Language

A cool little repl-based simulation written in Python

Gated-Shape CNN for Semantic Segmentation (ICCV 2019)

This is a Machine Learning Based Hand Detector Project, It Uses Machine Learning Models and Modules Like Mediapipe, Developed By Google!

RLHive: a framework designed to facilitate research in reinforcement learning.

Unsupervised 3D Human Mesh Recovery from Noisy Point Clouds

Local Multi-Head Channel Self-Attention for FER2013

This YoloV5 based model is fit to detect people and different types of land vehicles, and displaying their density on a fitted map, according to their coordinates and detected labels.

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features

Adversarial Graph Representation Adaptation for Cross-Domain Facial Expression Recognition (AGRA, ACM 2020, Oral)

Code accompanying "Adaptive Methods for Aggregated Domain Generalization"

A collection of resources on GAN Inversion.

StarGAN2 for practice