A selection of State Of The Art research papers (and code) on human locomotion (pose + trajectory) prediction (forecasting)

Overview

Awesome-Human-Pose-Prediction

Version Awesome LastUpdated HitCount

A selection of State Of The Art research papers (and code) on human trajectory prediction (forecasting). Papers marked with [W] are workshop papers.

Maintainers: Karttikeya Mangalam

Contributing: Please feel free to pull requests to add new resources or suggest addditions or changes to the list. While proposing a new addition, please keep in mind the following principles:

  • The work has been accepted in a reputable peer reviewed publication venue.
  • An opensource link to the paper pdf is attached (as far as possible).
  • Code for the paper is linked (if made opensource by the authors).

Email: [email protected].{berkeley,stanford).edu

Datasets

  • Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments [Paper]
  • Stanford Drone Dataset (SDD): Learning Social Etiquette: Human Trajectory Understanding in Crowded Scenes [Paper] [Leaderboard]

Papers

As End in Itself

  • From Goals, Waypoints & Paths To Long Term Human Trajectory Forecasting [Paper]

  • It Is Not the Journey but the Destination: Endpoint Conditioned Trajectory Prediction [Paper]

  • Trajectron++: Dynamically-Feasible Trajectory Forecasting With Heterogeneous Data [Paper]

  • Interaction-Based Trajectory Prediction Over a Hybrid Traffic Graph [paper]

  • Map-Adaptive Goal-Based Trajectory Prediction [paper]

  • Interaction-Aware Trajectory Prediction based on a 3D Spatio-Temporal Tensor Representation using Convolutional–Recurrent Neural Networks [paper]

  • DROGON: A Trajectory Prediction Model based on Intention-Conditioned Behavior Reasoning [Paper]

  • Discrete Residual Flow for Probabilistic Pedestrian Behavior Prediction [Paper]

  • Social-VRNN: One-Shot Multi-modal Trajectory Prediction for Interacting Pedestrians [Paper]

  • Leveraging Neural Network Gradients within Trajectory Optimization for Proactive Human-Robot Interactions [Paper]

  • Social NCE: Contrastive Learning of Socially-aware Motion Representations [Paper]

  • Multimodal Deep Generative Models for Trajectory Prediction: A Conditional Variational Autoencoder Approach [Paper]

  • Risk-Sensitive Sequential Action Control with Multi-Modal Human Trajectory Forecasting for Safe Crowd-Robot Interaction [Paper]

  • Deep Learning for Vision-based Prediction: A Survey [Paper]

  • Probabilistic Crowd GAN: Multimodal Pedestrian Trajectory Prediction Using a Graph Vehicle-Pedestrian Attention Network [Paper]

  • Semantics for Robotic Mapping, Perception and Interaction: A Survey [Paper]

  • Benchmark for Evaluating Pedestrian Action Prediction[Paper]

  • Probabilistic Tracklet Scoring and Inpainting for Multiple Object Tracking [Paper]

  • Pedestrian Behavior Prediction via Multitask Learning and Categorical Interaction Modeling [Paper]

  • Graph-SIM: A Graph-based Spatiotemporal Interaction Modelling for Pedestrian Action Prediction [Paper]

  • Haar Wavelet based Block Autoregressive Flows for Trajectories [Paper]

  • Imitative Planning using Conditional Normalizing Flow [Paper]

  • TNT: Target-driveN Trajectory Prediction [Paper]

  • SimAug: Learning Robust Representations from Simulation for Trajectory Prediction [Paper]

  • SoPhie: An Attentive GAN for Predicting Paths Compliant to Social and Physical Constraints [Paper]

  • Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks [Paper]

  • DESIRE: Distant Future Prediction in Dynamic Scenes With Interacting Agents [Paper]

  • Predicting Whole Body Motion Trajectories using Conditional Neural Movement Primitives [Paper] [W]

  • Anticipating Human Intention for Full-Body Motion Prediction [Paper] [W]

  • Human Motion Prediction With Graph Neural Networks [Paper] [W]

  • Action-Agnostic Human Pose Forecasting [Paper]

  • Human Torso Pose Forecasting in the Real World [Paper]

  • Imitation Learning for Human Pose Prediction [Paper]

  • Disentangling Human Dynamics for Pedestrian Locomotion Forecasting with Noisy Supervision [Paper]

  • Predicting 3D Human Dynamics from Video [Paper]

  • Recurrent Network Models for Human Dynamics [Paper]

  • Structural-RNN: Deep Learning on Spatio-Temporal Graphs [Paper]

  • Learning Trajectory Dependencies for Human Motion Prediction [Paper]

  • Anticipating many futures: Online human motion prediction and generation for human-robot interaction [Paper]

  • Teaching Robots to Predict Human Motion [Paper]

  • Deep representation learning for human motion prediction and classification [Paper]

  • On human motion prediction using recurrent neural networks [Paper]

  • Few-Shot Human Motion Prediction via Meta-learning [Paper]

  • Efficient convolutional hierarchical autoencoder for human motion prediction [Paper]

  • Learning Human Motion Models for Long-term Predictions [Paper]

  • Long-Term Human Motion Prediction by Modeling Motion Context and Enhancing Motion Dynamic [Paper]

  • Context-aware Human Motion Prediction [Paper]

  • Adversarial Geometry-Aware Human Motion Prediction [Paper]

  • Convolutional Sequence to Sequence Model for Human Dynamics [Paper]

  • QuaterNet: A Quaternion-based Recurrent Model for Human Motion [Paper]

  • BiHMP-GAN: Bidirectional 3D Human Motion Prediction GAN [Paper]

  • Human Motion Modeling using DVGANs [Paper]

  • Human Motion Prediction using Semi-adaptable Neural Networks [Paper]

  • A Neural Temporal Model for Human Motion Prediction [Paper]

  • Modeling Human Motion with Quaternion-based Neural Networks [Paper]

  • Human Motion Prediction via Learning Local Structure Representations and Temporal Dependencies [Paper]

  • VRED: A Position-Velocity Recurrent Encoder-Decoder for Human Motion Prediction [Paper]

  • EAN: Error Attenuation Network for Long-term Human Motion Prediction [Paper]

  • Structured Prediction Helps 3D Human Motion Modelling [Paper]

  • Forecasting Human Dynamics from Static Images [Paper]

  • HP-GAN: Probabilistic 3D human motion prediction via GAN [Paper]

  • Learning Latent Representations of 3D Human Pose with Deep Neural Networks [Paper]

  • A Recurrent Variational Autoencoder for Human Motion Synthesis [Paper]

  • Spatio-temporal Manifold Learning for Human Motions via Long-horizon Modeling [Paper]

  • Combining Recurrent Neural Networks and Adversarial Training for Human Motion Synthesis and Control [Paper]

  • PISEP2: Pseudo Image Sequence Evolution based 3D Pose Prediction [Paper]

  • Human Motion Prediction via Spatio-Temporal Inpainting [Paper]

  • Spatiotemporal Co-attention Recurrent Neural Networks for Human-Skeleton Motion Prediction [Paper]

  • Human Pose Forecasting via Deep Markov Models [Paper]

  • Auto-Conditioned Recurrent Networks For Extended Complex Human Motion Synthesis [Paper]

  • Predicting Long-Term Skeletal Motions by a Spatio-Temporal Hierarchical Recurrent Network [Paper]

As a Subtask

  • The Pose Knows: Video Forecasting by Generating Pose Futures [Paper]
  • I-Planner: Intention-Aware Motion Planning Using Learning Based Human Motion Prediction [Paper]
  • Language2Pose: Natural Language Grounded Pose Forecasting [Paper]
  • Long-Term Video Generation of Multiple Futures Using Human Poses [Paper]
  • Predicting body movements for person identification under different walking conditions [Paper]
Owner
Karttikeya Manglam
PhD Student in Computer Vision @ BAIR, UC Berkeley.
Karttikeya Manglam
The source code and data of the paper "Instance-wise Graph-based Framework for Multivariate Time Series Forecasting".

IGMTF The source code and data of the paper "Instance-wise Graph-based Framework for Multivariate Time Series Forecasting". Requirements The framework

Wentao Xu 24 Dec 05, 2022
An LSTM for time-series classification

Update 10-April-2017 And now it works with Python3 and Tensorflow 1.1.0 Update 02-Jan-2017 I updated this repo. Now it works with Tensorflow 0.12. In

Rob Romijnders 391 Dec 27, 2022
Weak-supervised Visual Geo-localization via Attention-based Knowledge Distillation

Weak-supervised Visual Geo-localization via Attention-based Knowledge Distillation Introduction WAKD is a PyTorch implementation for our ICPR-2022 pap

2 Oct 20, 2022
This repository is related to an Arabic tutorial, within the tutorial we discuss the common data structure and algorithms and their worst and best case for each, then implement the code using Python.

Data Structure and Algorithms with Python This repository is related to the Arabic tutorial here, within the tutorial we discuss the common data struc

Mohamed Ayman 33 Dec 02, 2022
Planar Prior Assisted PatchMatch Multi-View Stereo

ACMP [News] The code for ACMH is released!!! [News] The code for ACMM is released!!! About This repository contains the code for the paper Planar Prio

Qingshan Xu 127 Dec 31, 2022
A collection of random and hastily hacked together scripts for investigating EU-DCC

A collection of random and hastily hacked together scripts for investigating EU-DCC

Ryan Barrett 8 Mar 01, 2022
Code for the paper "Improving Vision-and-Language Navigation with Image-Text Pairs from the Web" (ECCV 2020)

Improving Vision-and-Language Navigation with Image-Text Pairs from the Web Arjun Majumdar, Ayush Shrivastava, Stefan Lee, Peter Anderson, Devi Parikh

Arjun Majumdar 44 Dec 14, 2022
Lunar is a neural network aimbot that uses real-time object detection accelerated with CUDA on Nvidia GPUs.

Lunar Lunar is a neural network aimbot that uses real-time object detection accelerated with CUDA on Nvidia GPUs. About Lunar can be modified to work

Zeyad Mansour 276 Jan 07, 2023
PyTorch implementation of Algorithm 1 of "On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models"

Code for On the Anatomy of MCMC-Based Maximum Likelihood Learning of Energy-Based Models This repository will reproduce the main results from our pape

Mitch Hill 32 Nov 25, 2022
A library for finding knowledge neurons in pretrained transformer models.

knowledge-neurons An open source repository replicating the 2021 paper Knowledge Neurons in Pretrained Transformers by Dai et al., and extending the t

EleutherAI 96 Dec 21, 2022
Code for Generating Disentangled Arguments with Prompts: A Simple Event Extraction Framework that Works

GDAP Code for Generating Disentangled Arguments with Prompts: A Simple Event Extraction Framework that Works Environment Python (verified: v3.8) CUDA

45 Oct 29, 2022
Code, final versions, and information on the Sparkfun Graphical Datasheets

Graphical Datasheets Code, final versions, and information on the SparkFun Graphical Datasheets. Generated Cells After Running Script Example Complete

SparkFun Electronics 102 Jan 05, 2023
Revisiting Global Statistics Aggregation for Improving Image Restoration

Revisiting Global Statistics Aggregation for Improving Image Restoration Xiaojie Chu, Liangyu Chen, Chengpeng Chen, Xin Lu Paper: https://arxiv.org/pd

MEGVII Research 128 Dec 24, 2022
A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Overview This is a set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI. Make TFRecords To run t

8 Nov 01, 2022
Neural network pruning for finding a sparse computational model for controlling a biological motor task.

MothPruning Scientific Overview Originally inspired by biological nervous systems, deep neural networks (DNNs) are powerful computational tools for mo

Olivia Thomas 0 Dec 14, 2022
Official Pytorch Implementation of Unsupervised Image Denoising with Frequency Domain Knowledge

Unsupervised Image Denoising with Frequency Domain Knowledge (BMVC 2021 Oral) : Official Project Page This repository provides the official PyTorch im

Donggon Jang 12 Sep 26, 2022
Codes and pretrained weights for winning submission of 2021 Brain Tumor Segmentation (BraTS) Challenge

Winning submission to the 2021 Brain Tumor Segmentation Challenge This repo contains the codes and pretrained weights for the winning submission to th

94 Dec 28, 2022
PyTorch implementation of "Dataset Knowledge Transfer for Class-Incremental Learning Without Memory" (WACV2022)

Dataset Knowledge Transfer for Class-Incremental Learning Without Memory [Paper] [Slides] Summary Introduction Installation Reproducing results Citati

Habib Slim 5 Dec 05, 2022
Research shows Google collects 20x more data from Android than Apple collects from iOS. Block this non-consensual telemetry using pihole blocklists.

pihole-antitelemetry Research shows Google collects 20x more data from Android than Apple collects from iOS. Block both using these pihole lists. Proj

Adrian Edwards 290 Jan 09, 2023
GANTheftAuto is a fork of the Nvidia's GameGAN

Description GANTheftAuto is a fork of the Nvidia's GameGAN, which is research focused on emulating dynamic game environments. The early research done

Harrison 801 Dec 27, 2022