OpenEnv: Agentic Execution Environments

Team

community

https://github.com/meta-pytorch/OpenEnv

meta-pytorch

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

lewtun submitted a paper about 11 hours ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

burtenshaw updated a Space 4 days ago

openenv/README

burtenshaw updated a Space 7 days ago

openenv/sudoku

View all activity

lewtun

submitted a paper to Daily Papers about 11 hours ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

Paper • 2602.03773 • Published 9 days ago • 4

burtenshaw

updated a Space 4 days ago

README

🚀

sergiopaniego

posted an update 4 days ago

Post

326

if you're looking for a good first issue to get your open-source journey started, you could contribute to this TRL issue by documenting one impactful paper in the docs

we have a broad list to cover!! 🧐

https://github.com/huggingface/trl/issues/4407

burtenshaw

updated 5 Spaces 7 days ago

TextArena Environment Server

🎮

Interact with an environment via text messages

BrowserGym Environment Server

🌐

Control a simulated environment via text actions

REPL Environment Server

🎮

Execute Python actions and monitor environment state

Echo Environment Server

🔊

Interact with an OpenEnv environment via web UI

TB2 Environment Server

🧪

Control and monitor AI agent environments through web interface

burtenshaw

updated a Space 8 days ago

OpenSpiel Environment Server

🎮

Play OpenSpiel games via web interface

sergiopaniego

posted an update 15 days ago

Post

441

Meet the Post-Training Toolkit (PTT), which easily integrates with TRL via a single callback, by Aditya Challapally ( @microsoft ):

🔍 Detects training issues early
🛠 Lets you intervene safely
📊 Keeps long training runs stable, auditable & efficient

Microsoft blog: https://devblogs.microsoft.com/engineering-at-microsoft/diagnosing-instability-in-production-scale-agent-rl/

Integration guide: https://huggingface.co/docs/trl/main/en/ptt_integration

Code: https://github.com/microsoft/post-training-toolkit

sergiopaniego

posted an update 15 days ago

Post

2531

New TRL + OpenEnv example! 💥

Fine tune an LLM for playing Sudoku using an RL env via OpenEnv

Includes a script that runs on 1 or multiple GPUs with vLLM, plus a Colab-ready notebook.

Enjoy!

Notebook: https://colab.research.google.com/github/huggingface/trl/blob/main/examples/notebooks/openenv_sudoku_grpo.ipynb

Script: https://github.com/huggingface/trl/blob/main/examples/scripts/openenv/sudoku.py

1 reply

sergiopaniego

posted an update 17 days ago

Post

2146

Date idea: read the entire Transformers v5.0.0 release notes

Officially stable now: https://github.com/huggingface/transformers/releases/tag/v5.0.0

1 reply

sergiopaniego

updated a collection 23 days ago

Environment Hub

Collection

A collection of OpenEnv-spec Environments • 11 items • Updated 23 days ago • 24

sergiopaniego

posted an update 24 days ago

Post

1613

FunctionGemma Tuning Lab is a new no-code tool by @google that lets you fine-tune a model directly from the browser, with no coding knowledge required, using TRL behind the scenes.

blog: https://developers.googleblog.com/a-guide-to-fine-tuning-functiongemma/

try it out: google/functiongemma-tuning-lab

This example builds on a more advanced one for learning fine-tuning with SFT using TRL: https://ai.google.dev/gemma/docs/functiongemma/finetuning-with-functiongemma

1 reply

sergiopaniego

posted an update 27 days ago

Post

818

TRL v0.27.0 is out!! 🥳

It includes GDPO, the latest variant of GRPO for multi-reward RL ✨
GDPO decouples reward normalization to avoid reward collapse and improve per-reward convergence — developed by
@sliuau @SimonX et al.

Explore the paper: GDPO: Group reward-Decoupled Normalization Policy Optimization for Multi-reward RL Optimization (2601.05242)

Explore the full set of changes here:
https://github.com/huggingface/trl/releases/tag/v0.27.0

sergiopaniego

updated a collection about 1 month ago

Environment Hub

Collection

A collection of OpenEnv-spec Environments • 11 items • Updated 23 days ago • 24

sergiopaniego

posted an update about 1 month ago

Post

3016

New REPL environment in OpenEnv available! ✨
Used in the Recursive Language Models (RLM) paper by Alex Zhang.

Ready for inference & post-training using trajectories. Handles long contexts:

> Run Python code in a sandbox
> Make recursive calls to LMs
> Explore data programmatically
> Return final result

Docs: https://meta-pytorch.org/OpenEnv/environments/repl/
Inference script: https://github.com/meta-pytorch/OpenEnv/blob/main/examples/repl_oolong_simple.py

AI & ML interests

Recent Activity

Team members 11

openenv's activity

README

TextArena Environment Server

BrowserGym Environment Server

REPL Environment Server

Echo Environment Server

TB2 Environment Server

OpenSpiel Environment Server