PCL-lite

An adaptive self-improvement LLM agentic system for ML library development. We choose STeP as the target ASPL for next-generation RDA. Please run the following commands in order and under the /PCL-lite folder.

(Optional)

pip install -r requirements.txt

Validation

Validate the test and reference yamls under /benchmark and /prompts

./scripts/validate.sh

Prepare benchcard

./scripts/prepare.sh

Single experiment

./experiments/single/run.sh

Agent experiment

./experiments/agent/run.sh

Self-improvement agent experiment

./experiments/iterative/run.sh

Single without structual IR experiment

./experiments/single-ws/run.sh

Guide

We recommend changing the BASE_PATH in the experiments bash scripts to folder that are not git. Otherwise, parallel sampling might be slowed down by more than 10x because of git logging.
Users can change the MODEL_NAME in the experiments bash scripts to any supported model:

Model	API	Environment Variable
claude-3-5-sonnet-20241022	Anthropic	ANTHROPIC_API_BASE, ANTHROPIC_API_KEY
gpt-4o-2024-11-20	OpenAI	OPENAI_API_BASE, OPENAI_API_KEY
Meta-Llama-3-1-405B-Instruct-Turbo	TogetherAI	TOGETHER_API_BASE, TOGETHER_API_KEY
DeepSeek-V3	DeepSeek-chat	DEEPSEEK_API_BASE, DEEPSEEK_API_KEY
Qwen2-5-Coder-32B-Instruct	TogetherAI	TOGETHER_API_BASE, TOGETHER_API_KEY

Since STeP is still a research prototype, we only publish the bmm tasks in the benchmark.
NUM_SAMPLES and TEMPERATURE can be adjusted.

Cite

If you find this work useful, please cite it:

@article{zhang2025adaptive,
  title={Adaptive Self-improvement LLM Agentic System for ML Library Development},
  author={Zhang, Genghan and Liang, Weixin and Hsu, Olivia and Olukotun, Kunle},
  journal={arXiv preprint},
  year={2025},
  url={https://arxiv.org/abs/2502.02534},
}

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
benchmark		benchmark
experiments		experiments
prompts		prompts
reinforce		reinforce
scripts		scripts
step		step
tools		tools
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PCL-lite

Validation

Prepare benchcard

Single experiment

Agent experiment

Self-improvement agent experiment

Single without structual IR experiment

Guide

Cite

About

Releases

Packages

Contributors 2

Languages

License

zhang677/PCL-lite

Folders and files

Latest commit

History

Repository files navigation

PCL-lite

Validation

Prepare benchcard

Single experiment

Agent experiment

Self-improvement agent experiment

Single without structual IR experiment

Guide

Cite

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages