Skip to content

Actions: huggingface/trl

Build PR Documentation

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
3,971 workflow runs
3,971 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[Feat] Suppport SGLang as rollout engine of GRPO trainer
Build PR Documentation #7802: Pull request #3370 synchronize by kashif
May 4, 2025 16:03 Action required ryang-max:sglang-server
May 4, 2025 16:03 Action required
Reintroducing step method in ppo_trainer
Build PR Documentation #7801: Pull request #3410 opened by jskaf34
May 3, 2025 21:09 Action required jskaf34:reintroducing-step-ppo-method
May 3, 2025 21:09 Action required
[DPO] Truncation leading to zero'd out samples
Build PR Documentation #7800: Pull request #3398 synchronize by LeonEricsson
May 3, 2025 15:16 Action required LeonEricsson:dpo_length_truncation_fix
May 3, 2025 15:16 Action required
[DPO] Truncation leading to zero'd out samples
Build PR Documentation #7799: Pull request #3398 synchronize by LeonEricsson
May 3, 2025 15:10 Action required LeonEricsson:dpo_length_truncation_fix
May 3, 2025 15:10 Action required
Reintroduce generate method for PPOTrainer
Build PR Documentation #7798: Pull request #3374 synchronize by CloseChoice
May 3, 2025 08:12 Action required CloseChoice:ppotrainer_generate
May 3, 2025 08:12 Action required
Reintroduce generate method for PPOTrainer
Build PR Documentation #7797: Pull request #3374 synchronize by CloseChoice
May 3, 2025 08:11 Action required CloseChoice:ppotrainer_generate
May 3, 2025 08:11 Action required
Reintroduce generate method for PPOTrainer
Build PR Documentation #7796: Pull request #3374 synchronize by CloseChoice
May 3, 2025 08:10 Action required CloseChoice:ppotrainer_generate
May 3, 2025 08:10 Action required
add support for reward func using nn.Module in GRPOTrainer
Build PR Documentation #7793: Pull request #3372 synchronize by qgallouedec
May 2, 2025 23:18 Action required Tavish9:support_custom_reward
May 2, 2025 23:18 Action required
[Models] Activation checkpointing from TrorchTune
Build PR Documentation #7792: Pull request #2954 synchronize by qgallouedec
May 2, 2025 23:11 3m 46s activation-checkpoint
May 2, 2025 23:11 3m 46s
🐍 Support Python 3.13
Build PR Documentation #7791: Pull request #2593 synchronize by qgallouedec
May 2, 2025 22:00 3m 25s python-3.13
May 2, 2025 22:00 3m 25s
🕊️ Un-restrict diffusers
Build PR Documentation #7790: Pull request #3407 opened by qgallouedec
May 2, 2025 21:53 3m 48s un-restrict-diffusers
May 2, 2025 21:53 3m 48s
🐍 Support Python 3.13
Build PR Documentation #7789: Pull request #2593 synchronize by qgallouedec
May 2, 2025 21:45 3m 46s python-3.13
May 2, 2025 21:45 3m 46s
🐍 Support Python 3.13
Build PR Documentation #7788: Pull request #2593 synchronize by qgallouedec
May 2, 2025 21:23 3m 41s python-3.13
May 2, 2025 21:23 3m 41s
[DPO] Truncation leading to zero'd out samples
Build PR Documentation #7787: Pull request #3398 synchronize by LeonEricsson
May 2, 2025 20:08 Action required LeonEricsson:dpo_length_truncation_fix
May 2, 2025 20:08 Action required
[Feat] Suppport SGLang as rollout engine of GRPO trainer
Build PR Documentation #7786: Pull request #3370 synchronize by kashif
May 2, 2025 18:55 Action required ryang-max:sglang-server
May 2, 2025 18:55 Action required
[Feat] Suppport SGLang as rollout engine of GRPO trainer
Build PR Documentation #7785: Pull request #3370 synchronize by kashif
May 2, 2025 18:53 Action required ryang-max:sglang-server
May 2, 2025 18:53 Action required
[Feat] Suppport SGLang as rollout engine of GRPO trainer
Build PR Documentation #7784: Pull request #3370 synchronize by kashif
May 2, 2025 18:48 Action required ryang-max:sglang-server
May 2, 2025 18:48 Action required