Casual Machine Learning for Heterogeneous Treatment Effects: An Empirical Application on Optimal Treatment Assignment

Master Thesis Paper, submitted and presented at CERGE-EI.

Introduction

This repository contains the code, data, and documentation for my Master Thesis, titled Casual Machine Learning for Heterogeneous Treatment Effects: An Empirical Application on Optimal Treatment Assignment. The thesis explores the utilization of machine learning for improved causal inference. Included are all the necessary scripts and resources to reproduce the results, as well as detailed explanations of the methodologies used. Feel free to explore the materials and reach out if you have any questions or feedback!

Main configurations:

Ran on:

Windows 11
Python 3.9.13
tensoflow==2.10.0
protobuf==3.11.3

How to set up the virtual environment using venv

You can install venv to your host Python by running this command in your terminal:

pip install virtualenv

To use venv in your project, in your terminal, cd to the project folder in your terminal, and run the following command:

git clone git@github.com:klaushajdaraj/ml-treatment-effects.git
cd ml-treatment-effects
python3.9.13 -m venv env

To activate your virtual environment:

On Mac:

source env/bin/activate

On Windows:

 env/Scripts/activate.bat //In CMD
 env/Scripts/Activate.ps1 //In Powershel

Install the packages and libraries:

pip install -r requirements.txt

To deactivate your virtual environment:

~ deactivate

How to set up the virtual environment using conda (Mac)

conda create -n ml_treatments_env python=3.9.13

conda activate ml_treatments_env

pip install -r requirements.txt

Files

`requirements.txt`

The file contains the required packages, libraries and dependencies. To install the requirements, run in the terminal:

pip install -r requirements.txt

`repetitions_subsettreatments.joblib`

Contains the CV_Results (see mlmethods) saved from the hundred times performed three-folded cross validation Hitsch Matching for two ML-Methods. Only treatments 1, 2, 4 and 5 were considered.

`repetitions_alltreatments.joblib`

Contains the CV_Results (see mlmethods) saved from the hundred times performed three-folded cross validation Hitsch Matching for two ML-Methods. All treatments were considered.

`plots.py`

Code for creating plots used in the Analytics.ipynb which is the main Jupyter notebook for evaluating the results.

`mlmethods.py`

Main script with two ML-Method classes and the code for Hitsch Matching. It is only used for importing on the main script, empty main().

`expdata.csv`

Raw data of the experiment from Opitz et al. (2024).

`cv_script.py`

Script for hyper-parameter tuning of the two ML-Methods.

`exploratory_data_analysis.ipynb`

The main Jupyter notebook for creating descriptional statistics, result tables and figures.

`misramatching_script.py`

Performs the Hitsch Matching with the two ML methods. Adjust the used_treatments list for the subset of treatments. In addition, there can be found the dictionary with used hyperparameters.

IMPORTANT

Please note that the paths in the python scripts have to be adjusted to the user's working directory! Therefore, it is necessary to change the paths according to your local directories.

To change the paths, follow the steps:

Create a file named config.yaml in the same working directory.
Inside the config file, set the paths as it follows:

paths:
  documents: Paste the path to the directory containing the joblib files for full and sub- treatment set.
  data: Paste the path to the directory containing the data file: `expdata.csv`.
  params: Paste the path to the directory containing the parameters.

"# machine-learning-treatment-effects" "# ml-treatment-effects"

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
.vscode		.vscode
causal_nets		causal_nets
.DS_Store		.DS_Store
.gitignore		.gitignore
CITATION.cff		CITATION.cff
LICENSE		LICENSE
MA Thesis_Klaus Hajdaraj_January 2025.pdf		MA Thesis_Klaus Hajdaraj_January 2025.pdf
README.md		README.md
cv_script.py		cv_script.py
expdata.csv		expdata.csv
exploratory_data_analysis.ipynb		exploratory_data_analysis.ipynb
misramatching_script.py		misramatching_script.py
mlmethods.py		mlmethods.py
plots.py		plots.py
repetitions_alltreatments.joblib		repetitions_alltreatments.joblib
repetitions_subsettreatments.joblib		repetitions_subsettreatments.joblib
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Casual Machine Learning for Heterogeneous Treatment Effects: An Empirical Application on Optimal Treatment Assignment

Introduction

Main configurations:

How to set up the virtual environment using venv

How to set up the virtual environment using conda (Mac)

Files

`requirements.txt`

`repetitions_subsettreatments.joblib`

`repetitions_alltreatments.joblib`

`plots.py`

`mlmethods.py`

`expdata.csv`

`cv_script.py`

`exploratory_data_analysis.ipynb`

`misramatching_script.py`

IMPORTANT

About

Languages

License

klaushajdaraj/ml-treatment-effects

Folders and files

Latest commit

History

Repository files navigation

Casual Machine Learning for Heterogeneous Treatment Effects: An Empirical Application on Optimal Treatment Assignment

Introduction

Main configurations:

How to set up the virtual environment using venv

How to set up the virtual environment using conda (Mac)

Files

requirements.txt

repetitions_subsettreatments.joblib

repetitions_alltreatments.joblib

plots.py

mlmethods.py

expdata.csv

cv_script.py

exploratory_data_analysis.ipynb

misramatching_script.py

IMPORTANT

About

Topics

Resources

License

Stars

Watchers

Forks

Languages

`requirements.txt`

`repetitions_subsettreatments.joblib`

`repetitions_alltreatments.joblib`

`plots.py`

`mlmethods.py`

`expdata.csv`

`cv_script.py`

`exploratory_data_analysis.ipynb`

`misramatching_script.py`