code for modular summarization work published in ACL2021 by Krishna et al

Last update: Nov 24, 2022

Related tags

Overview

This repository contains the code for running modular summarization pipelines as described in the publication
Krishna K, Khosla K, Bigham J, Lipton ZC. Generating SOAP Notes from Doctor-Patient Conversations." ACL 2021.

Instructions

Although we can not release models trained on the confidential medical data, we have released models trained on the publicly available AMI dataset.
To reproduce the results on the AMI dataset, you need to follow the steps listed below. For convenience, we have also created a Google Colab notebook here that runs these steps on Google's servers (free-of-cost as of June 2021) and produces the summaries and their rouge scores.

Step1: Set up the environment by installing the required packages mentioned in requirements.txt using pip.

Step2: Download the ami_models folder from this link and put it at the root of the repository:

Step3: Run the following 3 commands to prepare data, run summary generation pipelines, and show the achieved rouge scores.

# command1: downloads and preprocesses AMI dataset  
./prepare_data.sh  
  
 # command2: runs the summarization pipelines on the data and computes rouge scores  
 # (before running this command, you need to download the models as shown above)  
./predict_ami.sh  
  
# command3: print the results  
python show_results.py

code for modular summarization work published in ACL2021 by Krishna et al

Related tags

Overview

Instructions

Owner

Approximately Correct Machine Intelligence (ACMI) Lab

DVC-NLP-Simple-usecase

Code for the paper TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

An IVR Chatbot which can exponentially reduce the burden of companies as well as can improve the consumer/end user experience.

Text Classification in Turkish Texts with Bert

Code for paper "Role-oriented Network Embedding Based on Adversarial Learning between Higher-order and Local Features"

Yodatranslator is a simple translator English to Yoda-language

This repository contains helper functions which can help you generate additional data points depending on your NLP task.

Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.

The (extremely) naive sentiment classification function based on NBSVM trained on wisesight_sentiment

🤗 The largest hub of ready-to-use NLP datasets for ML models with fast, easy-to-use and efficient data manipulation tools

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Japanese Long-Unit-Word Tokenizer with RemBertTokenizerFast of Transformers

Mycroft Core, the Mycroft Artificial Intelligence platform.

MicBot - MicBot uses Google Translate to speak everyone's chat messages

This project uses word frequency and Term Frequency-Inverse Document Frequency to summarize a text.

ACL22 paper: Imputing Out-of-Vocabulary Embeddings with LOVE Makes Language Models Robust with Little Cost

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

A framework for implementing federated learning

Protein Language Model