PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".

Last update: Dec 12, 2022

Related tags

Overview

Maria: A Visual Experience Powered Conversational Agent

This repository is the Pytorch implementation of our paper "Maria: A Visual Experience Powered Conversational Agent" in ACL 2021.

In this paper, we present Maria, a neural conversation agent powered by the visual world experiences which are retrieved from a large-scale image index. Maria consists of three flexible components, i.e., text-to-image retriever, visual concept detector and visual-knowledge-grounded response generator.

Coming soon!

Summary

Maria: A Visual Experience Powered Conversational Agent

Dependencies

python 3.7
pytorch 1.4.0
Ubuntu 18.04

Usage

Citation

If you find this paper helps your research, please kindly consider citing our paper in your publications.

@inproceedings{liang2021maria,
   title={Maria: A Visual Experience Powered Conversational Agent},
   author={Liang, Zujie and Hu, Huang and Xu, Can and Chongyang, Tao and Geng, Xiubo and Chen, Danqi and Liang, Fan and Jiang, Daxin},
   booktitle={Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics (ACL)},
   year={2021}
}

Acknowledgment

Special thanks to the authors of OSCAR, vokenization, and py-bottom-up-attention.

PyTorch implementation for ACL 2021 paper "Maria: A Visual Experience Powered Conversational Agent".

Related tags

Overview

Maria: A Visual Experience Powered Conversational Agent

Summary

Dependencies

Usage

Text-to-Image Retrieval Model

Bottom-up Detector Model

Dialog Generation Model

Citation

Acknowledgment

Owner

Jokie

A Pytorch Implementation of ClariNet

Code and dataset for AAAI 2021 paper FixMyPose: Pose Correctional Describing and Retrieval Hyounghun Kim, Abhay Zala, Graham Burri, Mohit Bansal.

A PyTorch implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"

This repository is for our paper Exploiting Scene Graphs for Human-Object Interaction Detection accepted by ICCV 2021.

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

An implementation of Equivariant e2 convolutional kernals into a convolutional self attention network, applied to radio astronomy data.

Extremely simple and fast extreme multi-class and multi-label classifiers.

Use evolutionary algorithms instead of gridsearch in scikit-learn

InsTrim: Lightweight Instrumentation for Coverage-guided Fuzzing

PyTorch implementation of Rethinking Positional Encoding in Language Pre-training

Source code for "Roto-translated Local Coordinate Framesfor Interacting Dynamical Systems"

WormMovementSimulation - 3D Simulation of Worm Body Movement with Neurons attached to its body

Denoising Diffusion Probabilistic Models

Code for Two-stage Identifier: "Locate and Label: A Two-stage Identifier for Nested Named Entity Recognition"

Jarvis Project is a basic virtual assistant that uses TensorFlow for learning.

SelfAugment extends MoCo to include automatic unsupervised augmentation selection.

CausalNLP is a practical toolkit for causal inference with text as treatment, outcome, or "controlled-for" variable.

Code for CPM-2 Pre-Train

Multi-modal Vision Transformers Excel at Class-agnostic Object Detection

Google Brain - Ventilator Pressure Prediction