Skip to content

Pull requests: huggingface/trl

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

fix logprobs handling 🩹 for patch
#5198 opened Feb 27, 2026 by winglian Loading…
5 tasks
Misc packing improvements
#5189 opened Feb 26, 2026 by mariosasko Loading…
1 of 5 tasks
Simplify NeMo Gym user experience
#5156 opened Feb 24, 2026 by cmunley1 Loading…
DPO padding-free
#5141 opened Feb 21, 2026 by qgallouedec Draft
5 tasks
MGPO feature addition
#5126 opened Feb 19, 2026 by damoonsh Loading…
2 of 5 tasks
feat(experimental): Divergence Proximal Policy Optimization
#5117 opened Feb 17, 2026 by LeonEricsson Loading…
5 tasks
Add support for DGPO (ICLR 2026) to GRPO
#5102 opened Feb 15, 2026 by YanqiDai Loading…
5 tasks done
Add support for DPPO [WIP]
#5065 opened Feb 10, 2026 by catherinelee274 Draft
5 tasks
Fix GRPO VLM prompt handling for string prompts
#5064 opened Feb 10, 2026 by akshan-main Loading…
5 tasks done
3
5
Add CFPO objective to GRPO trainer
#5027 opened Feb 9, 2026 by asparius Loading…
Add support for MaxRL
#5026 opened Feb 9, 2026 by catherinelee274 Loading…
4 of 5 tasks
Feature/ HICRA implementation
#4997 opened Feb 6, 2026 by w601sxs Loading…
2 of 5 tasks
Add OpenEnv's Rubrics support
#4994 opened Feb 6, 2026 by sergiopaniego Draft
5 tasks
fix: add gradient checkpointing to PolicyAndValueWrapper
#4955 opened Feb 3, 2026 by lvhungdev Loading…
3 of 5 tasks
OpenEnv clients async support update
#4949 opened Feb 2, 2026 by sergiopaniego Loading…
5 tasks
[Experimental] Add SDFT trainer, config, docs, and tests
#4941 opened Jan 31, 2026 by Shekswess Loading…
4 of 5 tasks
ProTip! Filter pull requests by the default branch with base:main.