Skip to content

Repository for Language Models can Self-Improve at State-Value Estimation for Better Search

Notifications You must be signed in to change notification settings

ethanm88/self-taught-lookahead

Folders and files

NameName
Last commit message
Last commit date

Latest commit

3953928 · Mar 5, 2025

History

4 Commits
Mar 5, 2025

Repository files navigation

Self-Taught Lookahead

Repository for Language Models can Self-Improve at State-Value Estimation for Better Search

Code and data coming soon!

Citation

@misc{mendes2025languagemodelsselfimprovestatevalue,
      title={Language Models can Self-Improve at State-Value Estimation for Better Search}, 
      author={Ethan Mendes and Alan Ritter},
      year={2025},
      eprint={2503.02878},
      archivePrefix={arXiv},
      primaryClass={cs.LG},
      url={https://arxiv.org/abs/2503.02878}, 
}

About

Repository for Language Models can Self-Improve at State-Value Estimation for Better Search

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published