Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Last update: Jan 07, 2023

Overview

Ego4D

EGO4D is the world's largest egocentric (first person) video ML dataset and benchmark suite, with 3,600 hrs (and counting) of densely narrated video and a wide range of annotations across five new benchmark tasks. It covers hundreds of scenarios (household, outdoor, workplace, leisure, etc.) of daily life activity captured in-the-wild by 926 unique camera wearers from 74 worldwide locations and 9 different countries. Portions of the video are accompanied by audio, 3D meshes of the environment, eye gaze, stereo, and/or synchronized videos from multiple egocentric cameras at the same event. The approach to data collection was designed to uphold rigorous privacy and ethics standards with consenting participants and robust de-identification procedures where relevant.

Public Documentation/Start Here: Ego4D Docs

For the CLI readme (to download/access): CLI README

For a demo notebook: Annotation Notebook

For the visualization engine: Viz README

For feature extraction: Feature README

License

Ego4D is released under the MIT License.

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Related tags

Overview

Ego4D

License

Owner

Meta Research

Official implementation of ETH-XGaze dataset baseline

This is the code repository implementing the paper "TreePartNet: Neural Decomposition of Point Clouds for 3D Tree Reconstruction".

PyTorch implementation of Tacotron speech synthesis model.

Code and dataset for ACL2018 paper "Exploiting Document Knowledge for Aspect-level Sentiment Classification"

An index of algorithms for learning causality with data

Label Mask for Multi-label Classification

Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Dictionary Learning with Uniform Sparse Representations for Anomaly Detection

Multivariate Time Series Transformer, public version

Additional code for Stable-baselines3 to load and upload models from the Hub.

Predict bus arrival time using VertexAI and Nvidia's Jetson Nano

Interpretable and Generalizable Person Re-Identification with Query-Adaptive Convolution and Temporal Lifting

验证码识别深度学习 tensorflow 神经网络

An pytorch implementation of Masked Autoencoders Are Scalable Vision Learners

An official implementation of the paper Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers

Semantically Contrastive Learning for Low-light Image Enhancement

QTool: A Low-bit Quantization Toolbox for Deep Neural Networks in Computer Vision

AttGAN: Facial Attribute Editing by Only Changing What You Want (IEEE TIP 2019)

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队

Ego4d dataset repository. Download the dataset, visualize, extract features & example usage of the dataset

Related tags

Overview

Ego4D

License

Owner

Meta Research

Official implementation of ETH-XGaze dataset baseline

This is the code repository implementing the paper "TreePartNet: Neural Decomposition of Point Clouds for 3D Tree Reconstruction".

PyTorch implementation of Tacotron speech synthesis model.

Code and dataset for ACL2018 paper "Exploiting Document Knowledge for Aspect-level Sentiment Classification"

An index of algorithms for learning causality with data

Label Mask for Multi-label Classification

Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"

Fast, modular reference implementation of Instance Segmentation and Object Detection algorithms in PyTorch.

Dictionary Learning with Uniform Sparse Representations for Anomaly Detection

Multivariate Time Series Transformer, public version

Additional code for Stable-baselines3 to load and upload models from the Hub.

Predict bus arrival time using VertexAI and Nvidia's Jetson Nano

Interpretable and Generalizable Person Re-Identification with Query-Adaptive Convolution and Temporal Lifting

验证码识别 深度学习 tensorflow 神经网络

An pytorch implementation of Masked Autoencoders Are Scalable Vision Learners

An official implementation of the paper Exploring Sequence Feature Alignment for Domain Adaptive Detection Transformers

Semantically Contrastive Learning for Low-light Image Enhancement

QTool: A Low-bit Quantization Toolbox for Deep Neural Networks in Computer Vision

AttGAN: Facial Attribute Editing by Only Changing What You Want (IEEE TIP 2019)

2021搜狐校园文本匹配算法大赛 分比我们低的都是帅哥队

验证码识别深度学习 tensorflow 神经网络

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队