Code for the paper "JoLA: Joint Localization and Activation Editing for Low-Resource Fine-Tuning"
Paper: http://arxiv.org/abs/2502.01179
Authors: Wen Lai
Email: wen.lai@tum.de
Parameter-efficient fine-tuning (PEFT) methods, such as LoRA, are commonly used to adapt LLMs. However, their effectiveness is often limited in low-resource scenarios with only a few hundred examples. Recent advances in interpretability research have inspired the emergence of activation editing techniques, which modify the activations of specific model components. These methods, due to their extremely small parameter counts, show promise for small datasets. However, their performance is highly dependent on identifying the correct modules to edit and often lacks stability across different datasets. In this paper, we propose Joint Localization and Activation Editing (JoLA), a method that jointly learns (1) which heads in the Transformer to edit; (2) whether the intervention should be additive, multiplicative, or both and (3) the intervention parameters themselves - vectors applied as additive offsets or multiplicative scalings to the head output. Through evaluations on three benchmarks spanning commonsense reasoning, natural language understanding, and natural language generation, we demonstrate that JoLA consistently outperforms existing methods.
- Install
jola
from pip:
pip install jola
- or, install our latest
jola
from pip+git:
pip install git+https://github.com/wenlai-lavine/jola.git
from jola import JoLAConfig, JoLAModel, JoLATrainer, data_from_list
from transformers import AutoTokenizer, TrainingArguments
# set default=False, if you want to specify the parameters in JoLA; you need to set the configuration by providing a yaml file (example: config.yaml).
jola_config = JoLAConfig(default=True)
# load tokenizer
jola_tokenizer = AutoTokenizer.from_pretrained(**jola_config["model_config"])
# Load models
jola_model = JoLAModel.jola_from_pretrained(**jola_config["model_config"])
# unfreeze jola parameters
jola_model.unfreeze_jola_params()
# set in training mode
jola_model.model.train()
# examples from ARC-e
training_examples = [
"Which statement best explains why photosynthesis is the foundation of most food webs?\n\nAnswer1: Sunlight is the source of energy for nearly all ecosystems. Answer2: Most ecosystems are found on land instead of in water. Answer3: Carbon dioxide is more available than other gases. Answer4: The producers in all ecosystems are plants.", "answer1",
"Which piece of safety equipment is used to keep mold spores from entering the respiratory system?\n\nAnswer1: safety goggles Answer2: breathing mask Answer3: rubber gloves Answer4: lead apron.", "answer2",
... ...
]
# data setting, data loader
data_collator = make_data_collator(tokenizer=jola_tokenizer)
# dataset setting
jola_dataset = JoLADataset(train_list=training_examples)
jola_data = jola_dataset.data_from_list()
# training arguments
training_args = TrainingArguments(**jola_config["training_config"])
# trainer
jola_trainer = JoLATrainer(
jola_model,
train_dataset=jola_data['train'],
eval_dataset = jola_data['valid'],
tokenizer=jola_tokenizer,
data_collator = data_collator,
args=training_args,
callbacks=[early_stopping_callback],
gate_scheduler=jola_config["jola_config"]["gate_scheduler"]
)
jola_trainer.train()
We evaluate on commonsense reasoning, natural language understanding and natural language generation benchmarks.
- Commonsense Reasoning
- The same as Hu et al., 2023
- 8 Tasks: ARC-c / ARC-e / BoolQ / HellaSwag / OBQA / PIQA / SIQA / WinoGrande
- Download Link
- Natural Language Understanding
- We use MMLU-Pro Benchmark (Wang et al., 2024)
- 14 Domains: Biology, Business, Chemistry, Computer Science, Economics, Engineering, Health, History, Law, Math, Philosophy, Physics, Psychology, and Others
- Download Link
- Natural Language Generation
- We use GEM Benchmark (Gehrmann et al., 2022)
- We use 4 tasks: CommonGen / E2E_NLG / Web_NLG / Xsum
- Download Link
- Prompt Template from PromptSource
- please use your own
config.yaml
python examples/run_jola.py
Please cite our paper if it's helpful to your work!
@article{lai2025joint,
title={Joint Localization and Activation Editing for Low-Resource Fine-Tuning},
author={Lai, Wen and Fraser, Alexander and Titov, Ivan},
journal={arXiv preprint arXiv:2502.01179},
year={2025}
}