GitHub - shauli-ravfogel/lm-counterfactuals

This repository contains the code for the paper "Gumbel Counterfactual Generation from Language Models". In this work, we conceptualize LMs as Generalized Causal Models (GCMs), enabling us to generate true counterfactual strings from a given input string. By leveraging the Gumbel-Max trick, we separate the deterministic computations of the LM’s forward pass from the inherent randomness of the sampling process. This allows us to use hindsight sampling to identify the noise responsible for generating a specific string and reuse the same noise when generating a counterfactual string from the model, post-intervention.

To set up the environment:

python3 -m venv venv
source venv/bin/activate
pip install -r requirements.txt
# download models
gdown --folder https://drive.google.com/drive/folders/11PE8DxVqbfpsLhqLop71CqL6KsdeTnFf

Then, run run.py to re-generate the counterfactuals on the Wikipedia/Bios dataset. The notebook example.ipynb contains a minimal example for generating a counterfactual string based on an original string.

The directory counterfactuals contains the counterfactuals sentences we generated from Wikipedia and the Biosd dataset, based on several models and intervention techniques.

Name	Name	Last commit message	Last commit date
Latest commit shauli-ravfogel Update README.md Jan 12, 2025 8838060 · Jan 12, 2025 History 95 Commits
counterfactuals	counterfactuals	Add files via upload	Dec 13, 2024
README.md	README.md	Update README.md	Jan 12, 2025
analyze.py	analyze.py	update	Oct 8, 2024
analyze_edit.py	analyze_edit.py	Add files via upload	Dec 13, 2024
example.ipynb	example.ipynb	Update example.ipynb	Dec 6, 2024
memit-analysis.ipynb	memit-analysis.ipynb	Add files via upload	Dec 13, 2024
mimic.py	mimic.py	Add files via upload	Dec 4, 2024
print.ipynb	print.ipynb	Add files via upload	Dec 13, 2024
requirements.txt	requirements.txt	Update requirements.txt	Oct 22, 2024
run.py	run.py	Update run.py	Dec 5, 2024
run_mimic.py	run_mimic.py	Add files via upload	Dec 4, 2024
sampling.py	sampling.py	Update sampling.py	Dec 5, 2024
utils.py	utils.py	Update utils.py	Nov 27, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

shauli-ravfogel/lm-counterfactuals

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages