Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Last update: Oct 27, 2022

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Overview of paths used in DIG and IG. w is the word being attributed. The gray region is the neighborhood of w. Green line depicts the straight-line path from w to w' used by IG and the green squares are the corresponding interpolation points. Left: In DIG-Greedy, we first monotonize each word in the neighborhood (red arrow). Then the word closest to its corresponding monotonic point is selected as the anchor (blue line to w_5 since the red arrow of w_5 has the shortest magnitude). Right: In DIG-MaxCount we first count the number of monotonic dimensions for each word in the neighborhood (shown in [.] above). Then, the word with the highest number of monotonic dimensions is selected as the anchor word (blue line to w_4), followed by changing the non-monotonic dimensions of w_4 (red line to c). Repeating this step gives the zigzag blue path. Finally, the red stars are the interpolated points used by our method. Please refer to the paper for more details.

Dependencies

Dependencies can be installed using requirements.txt.

Evaluating DIG:

Install all the requirements from requirements.txt.
Execute ./setup.sh for setting up the folder hierarchy for experiments.

Commands for reproducing the reported results on DistilBERT fine-tuned on SST2:

# Generate the KNN graph
python knn.py -dataset sst2 -nn distilbert

# DIG (strategy: Greedy)
python main.py -dataset sst2 -nn distilbert -strategy greedy

# DIG (strategy: MaxCount)
python main.py -dataset sst2 -nn distilbert -strategy maxcount

Similarly, commands can be changed for other settings.

Please contact Soumya for any clarifications or suggestions.

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Related tags

Overview

Discretized Integrated Gradients for Explaining Language Models (EMNLP 2021)

Dependencies

Evaluating DIG:

Owner

INK Lab @ USC

Convnext-tf - Unofficial tensorflow keras implementation of ConvNeXt

A library for Deep Learning Implementations and utils

Image Captioning using CNN ,LSTM and Attention

A light-weight image labelling tool for Python designed for creating segmentation data sets.

PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

This repo is developed for Strong Baseline For Vehicle Re-Identification in Track 2 Ai-City-2021 Challenges

data/code repository of "C2F-FWN: Coarse-to-Fine Flow Warping Network for Spatial-Temporal Consistent Motion Transfer"

This project provides a stock market environment using OpenGym with Deep Q-learning and Policy Gradient.

NLG evaluation via Statistical Measures of Similarity: BaryScore, DepthScore, InfoLM

Trax — Deep Learning with Clear Code and Speed

Este conversor criará a medida exata para sua receita de capuccino gelado da grandiosa Rafaella Ballerini!

Monocular Depth Estimation Using Laplacian Pyramid-Based Depth Residuals

Meta-meta-learning with evolution and plasticity

Example how to deploy deep learning model with aiohttp.

[ECCVW2020] Robust Long-Term Object Tracking via Improved Discriminative Model Prediction (RLT-DiMP)

Animatable Neural Radiance Fields for Modeling Dynamic Human Bodies

The trained model and denoising example for paper : Cardiopulmonary Auscultation Enhancement with a Two-Stage Noise Cancellation Approach

Reimplementation of Dynamic Multi-scale filters for Semantic Segmentation.

Research using Cirq!

Automatic Data-Regularized Actor-Critic (Auto-DrAC)