OpenEnv clients async support update by sergiopaniego · Pull Request #4949 · huggingface/trl

sergiopaniego · 2026-02-02T15:53:19Z

What does this PR do?

Minimal changes to support the changes in OpenEnv for making the EnvClient async by default (meta-pytorch/OpenEnv#343).

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a GitHub issue? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

Who can review?

@burtenshaw @kashif @albertvillanova @qgallouedec

HuggingFaceDocBuilderDev · 2026-02-02T15:56:30Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qgallouedec · 2026-02-02T16:07:45Z

trl/generation/vllm_generation.py

+                    # Handle async rollout_func
+                    output = asyncio.run(rollout_func(rollout_prompts))
+                else:
+                    # [Legacy] Handle sync rollout_func


should we consider this as legacy?

For OpenEnv, the new default is to use async so it'll be only useful for some old envs but I think having there "Legacy" may be misleading from TRL-side. I've removed it.

…-openenv

sergiopaniego · 2026-02-03T10:02:59Z

Changes extended to notebooks.
Changes are already needed since we install OpenEnv using the environments via pip and looking at the dependencies management, it installs OpenEnv from main (example)

albertvillanova

Thanks for addressing this important issue!

Some comments/questions below.

albertvillanova · 2026-02-03T10:04:38Z

examples/scripts/openenv/browsergym.py

        vllm_mode=args.vllm_mode,
        vllm_server_base_url=args.vllm_server_url if args.vllm_mode == "server" else None,
-        vllm_gpu_memory_utilization=0.4,
+        vllm_gpu_memory_utilization=0.5,


Is this change intended? Why?

albertvillanova · 2026-02-03T10:13:40Z

trl/generation/vllm_generation.py

+                # Support both sync and async rollout functions:
+                if inspect.iscoroutinefunction(rollout_func):
+                    # Handle async rollout_func
+                    output = asyncio.run(rollout_func(rollout_prompts))


This may raise an error if run in an environment with an active event loop (notebooks, IPython,...).

great catch! I've handled it adding a nest_asyncio. Tested and working!

Thanks again for addressing this and your proposed solution.

However, I am still concerned about this approach:

Proposing to use nest_asyncio in our example notebooks, but keeping the call to asyncio.run in the codebase

Some of my concerns (just brainstorming; please, feel free to give your opinions/arguments on this):

in my opinion, the lib codebase itself should support handling a running loop, so async rollouts are reliable in common environments

in my opinion, the use of nest_asyncio is a "reasonable" notebook workaround, but not a robust base‑code fix

it patches the running event loop to allow re‑entrant calls like asyncio.run inside an active loop

this can avoid the immediate RuntimeError and may appear to work in notebooks

but it's still blocking the loop thread and can lead to subtle hangs if the coroutine depends on other tasks that need the loop to make progress (custom rollout_func on same loop, tool-calling or streaming generation, IPython kernel, UI callbacks, background progress, etc.)

on the other hand, we should be aware that it is not straightforward to support handling a running loop:

our use case: awaiting async code in a synchronous function running inside an active event loop on the same thread

problem: blocking on async results in the same thread causes deadlock

possible solution: run async code in a new background thread when a loop is running; otherwise use asyncio.run directly

in this line, I see that OpenEnv codebase implemented a run_async_safely function: https://github.com/meta-pytorch/OpenEnv/pull/343/changes#diff-897378bca76b3c096bfbfa9da0e18034a0ead03bd8a27f976e0fbc990a4360c2R13

I think that approach goes in the right direction

but not 100% sure about their specific implementation:

I think it still blocks the current thread while waiting for future.result()

I could open an issue on their repo and discuss about this

As I said above, just brainstorming. Feel free to disagree. Also, we could address these subtleties in a subsequent PR if you prefer.

Thanks a lot for all the details! I've tried generating a common solution for all the different cases (script/notebook). Both are now managed from TRL-side without introducing changes in the notebook. Since Colab already comes with nest_asyncio installed, we don't need to install the dependency. What do you think?

albertvillanova · 2026-02-03T10:23:20Z

trl/generation/vllm_generation.py

                    ]
-                output = rollout_func(rollout_prompts)
+                # Support both sync and async rollout functions:
+                if inspect.iscoroutinefunction(rollout_func):


Whereas this check is enough for native coroutine functions (defined with async def), it will fail for async callables and awaitables: partials, callables with __call__, decorated functions, functions that return coroutines...

Do you think we should support all those edge cases or it is enough for the moment to check just for coroutines?

mmm I think it's ok to just add the current support

albertvillanova · 2026-02-03T10:26:12Z

trl/generation/vllm_generation.py

Maybe a naive question: is async rollout_func intended to be supported in both server and colocate modes?

thanks for this one! It was missing 😄 added

sergiopaniego · 2026-02-05T17:29:58Z

updated @albertvillanova

btw, maybe we can now wait for a new OpenEnv version before merging this one, since we're now pinging the version both in TRL and in OpenEnv (remote HF Spaces included) and that version doesn't include the async client

OpenEnv clients async support update

e3b61a3

sergiopaniego requested review from albertvillanova, kashif and qgallouedec February 2, 2026 15:53

qgallouedec reviewed Feb 2, 2026

View reviewed changes

sergiopaniego added 4 commits February 3, 2026 10:46

Merge branch 'main' into async-openenv

33cb99d

Update notebooks

18828dc

Merge branch 'async-openenv' of github.com:huggingface/trl into async…

7f7e9d3

…-openenv

Remove Legacy

97fe1c1

albertvillanova reviewed Feb 3, 2026

View reviewed changes

sergiopaniego added 5 commits February 3, 2026 15:04

Comments addressed

66d976c

Extend async support to vllm server mode

ad34343

Merge branch 'main' into async-openenv

7615942

All cases managed from TRL

f3a8000

Merge branch 'main' into async-openenv

4c806f8

Conversation

sergiopaniego commented Feb 2, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Before submitting

Who can review?

Uh oh!

HuggingFaceDocBuilderDev commented Feb 2, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sergiopaniego commented Feb 3, 2026

Uh oh!

albertvillanova left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

albertvillanova Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

sergiopaniego commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

sergiopaniego commented Feb 2, 2026 •

edited

Loading

albertvillanova Feb 3, 2026 •

edited

Loading