CausaLM: Causal Model Explanation Through Counterfactual Language Models

Last update: Jul 10, 2022

Overview

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Authors:

Amir Feder, Nadav Oved, Uri Shalit, Roi Reichart

Abstract:

Understanding predictions made by deep neural networks is notoriously difficult, but also crucial to their dissemination. As all ML-based methods, they are as good as their training data, and can also capture unwanted biases. While there are tools that can help understand whether such biases exist, they do not distinguish between correlation and causation, and might be ill-suited for text-based models and for reasoning about high level language concepts. A key problem of estimating the causal effect of a concept of interest on a given model is that this estimation requires the generation of counterfactual examples, which is challenging with existing generation technology. To bridge that gap, we propose CausaLM, a framework for producing causal model explanations using counterfactual language representation models. Our approach is based on fine-tuning of deep contextualized embedding models with auxiliary adversarial tasks derived from the causal graph of the problem. Concretely, we show that by carefully choosing auxiliary adversarial pre-training tasks, language representation models such as BERT can effectively learn a counterfactual representation for a given concept of interest, and be used to estimate its true causal effect on model performance. A byproduct of our method is a representation that is unaffected by the tested concept, which can be useful in mitigating unwanted bias ingrained in the data.

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Related tags

Overview

CausaLM: Causal Model Explanation Through Counterfactual Language Models

Authors:

Amir Feder, Nadav Oved, Uri Shalit, Roi Reichart

Abstract:

Links:

Paper

Code

Data

Owner

Amir Feder

A modular, open and non-proprietary toolkit for core robotic functionalities by harnessing deep learning

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

Meta Learning Backpropagation And Improving It (VSML)

Official code for ICCV2021 paper "M3D-VTON: A Monocular-to-3D Virtual Try-on Network"

code for paper -- "Seamless Satellite-image Synthesis"

Official code for the CVPR 2022 (oral) paper "Extracting Triangular 3D Models, Materials, and Lighting From Images".

Tool for working with Y-chromosome data from YFull and FTDNA

Unified Instance and Knowledge Alignment Pretraining for Aspect-based Sentiment Analysis

g2o: A General Framework for Graph Optimization

Cross-modal Deep Face Normals with Deactivable Skip Connections

N-HiTS: Neural Hierarchical Interpolation for Time Series Forecasting

Deep Learning agent of Starcraft2, similar to AlphaStar of DeepMind except size of network.

《Train in Germany, Test in The USA: Making 3D Object Detectors Generalize》(CVPR 2020)

Neural models of common sense. 🤖

Human-Pose-and-Motion History

Source code for NAACL 2021 paper "TR-BERT: Dynamic Token Reduction for Accelerating BERT Inference"

A Momentumized, Adaptive, Dual Averaged Gradient Method for Stochastic Optimization

[ICML 2021] “ Self-Damaging Contrastive Learning”, Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang

This is the official implementation code repository of Underwater Light Field Retention : Neural Rendering for Underwater Imaging (Accepted by CVPR Workshop2022 NTIRE)

SplineConv implementation for Paddle.