Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Last update: Aug 03, 2022

Overview

SSWS-loss_function_based_on_MS-TCN

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Abstract

Recently, more and more videos have been uploaded to the network, so that video analysis task has been one of the most important applications in various fields. At present, video analysis methods can be divided into two kinds: weakly supervised video action segmentation and supervised video action segmentation. The former uses a sliding window or Markov model, while the latter uses the TCN model. In this paper, we introduce the Supervised Sliding Window Smooth Loss Function (SSWS) into the TCN baseline, which is a complement to MS-TCN smoothing loss function TMSE. In this method, three discriminant frames are selected from the video prediction sequence and combined into an adaptive sliding window to selectively smooth the whole prediction sequence. In particular, it doubles the penalty when it slides to the wrong place in the category. Compared to TMSE, our method effectively increases the receptive field of smoothing loss function. And, the proposed new supervised loss function only penalizes error frames. The experiment shows that compared with the Smoothing loss function TMSE of MS-TCN, SSWS has significantly improved in the three datasets: 50Salads, GTEA and the Breakfast Dataset.

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Related tags

Overview

SSWS-loss_function_based_on_MS-TCN

Supervised Sliding Window Smoothing Loss Function Based on MS-TCN for Video Segmentation

Abstract

Owner

Boundary-aware Transformers for Skin Lesion Segmentation

This is a project based on ConvNets used to identify whether a road is clean or dirty. We have used MobileNet as our base architecture and the weights are based on imagenet.

Selene is a Python library and command line interface for training deep neural networks from biological sequence data such as genomes.

A simple python module to generate anchor (aka default/prior) boxes for object detection tasks.

Find-Lane-Line - Use openCV library and Python to detect the road-lane-line

Official implementation of Deep Convolutional Dictionary Learning for Image Denoising.

WSDM2022 Challenge - Large scale temporal graph link prediction

Self-Guided Contrastive Learning for BERT Sentence Representations

DL course co-developed by YSDA, HSE and Skoltech

Project Tugas Besar pertama Pengenalan Komputasi Institut Teknologi Bandung

EEGEyeNet is benchmark to evaluate ET prediction based on EEG measurements with an increasing level of difficulty

Object recognition using Azure Custom Vision AI and Azure Functions

Learning to Predict Gradients for Semi-Supervised Continual Learning

PyTorch Implementation of ByteDance's Cross-speaker Emotion Transfer Based on Speaker Condition Layer Normalization and Semi-Supervised Training in Text-To-Speech

A torch.Tensor-like DataFrame library supporting multiple execution runtimes and Arrow as a common memory format

The Illinois repository for Climatehack (https://climatehack.ai/). We won 1st place!

MQBench: Towards Reproducible and Deployable Model Quantization Benchmark

2021 CCF BDCI 全国信息检索挑战杯（CCIR-Cup）智能人机交互自然语言理解赛道第二名参赛解决方案

PointPillars inference with TensorRT

Preprocessed Datasets for our Multimodal NER paper