Merging Feed-Forward Sublayers for Compressed Transformers

This repository contains the code for the paper "Merging Feed-Forward Sublayers for Compressed Transformers". We are actively in the process of updating this repository.

Getting Started

Dependencies

We recommend creating a new virtual environment, and installing the following dependencies:

pip install -r requirements.txt

Data

We only need to download the OPUS dataset for this project. Other datasets are available via the huggingface hub. We source this data from the 07-28-2020 release of the Tatoeba Challenge dataset, available here.

wget https://object.pouta.csc.fi/Tatoeba-Challenge-v2020-07-28/eng-zho.tar
tar -xvf eng-zho.tar
cd data/release/v2020-07-28/eng-zho
gunzip train.src.gz # eng
gunzip train.trg.gz # zho
python src/convert_opus_to_hf.py --path .

Code Contents

Src contents

Key files in src include:

get_corrs.py: script to compute the correlation between sublayers. Can input model_type, reference layer or layer_range, maximum tokens
merge_ffs.py: script to merge feed-forward sublayers. Can input model_type, reference layer or layer_range
utils.py : utility functions for the above scripts. This file also contains functions for incorporating additional model types.
drop_layers.py: Script to drop layers from a model for our layer-pruning baseline.

Getting Correlations:

Experiments

Obtaining Models

Compressing Models

Evaluation

We include a different evaluation script for each model type. For including your own model for evaluation, please add a new script in the evaluation/ directory.

# evaluate VIT model

# evaluate GPT-2 model

# evaluate OPUSMT model

Similarity Analysis

We include a script to compute the similarity between model sublayers. The resulting file is saved in the output directory, and is a json file with CKA results between all layer indices, for both attention and feed-forward sublayers.

bash get_all_sims.sh $output_dir

To plot a CKA map from the resulting json file, use the following command:

python plot_cka.py --file $output_dir/sims_10000_linear_updated.json --component {attention or ff} --model-name {vit, gpt2, opusmt}

Acknowledgements

AI Assistance

This code was developed with the assistance of OpenAI's ChatGPT, as well as Copilot.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
src		src
README.md		README.md
overview.png		overview.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Merging Feed-Forward Sublayers for Compressed Transformers

Getting Started

Dependencies

Data

Code Contents

Src contents

Getting Correlations:

Experiments

Obtaining Models

Compressing Models

Evaluation

Similarity Analysis

Acknowledgements

AI Assistance

About

Releases

Packages

Languages

nverma1/merging-ffs-compression

Folders and files

Latest commit

History

Repository files navigation

Merging Feed-Forward Sublayers for Compressed Transformers

Getting Started

Dependencies

Data

Code Contents

Src contents

Getting Correlations:

Experiments

Obtaining Models

Compressing Models

Evaluation

Similarity Analysis

Acknowledgements

AI Assistance

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages