RLM-Qwen3-8B-v0.1

A small, post-trained Qwen3-8B model from the experiments in the "Recursive Language Models" paper: https://arxiv.org/abs/2512.24601.

The model was trained trajectories using a fixed system prompt and assumes the environment/scaffold from our RLM repo.

We recommend using vLLM with our inference code at https://github.com/alexzhang13/rlm to use it out of the box.

Downloads last month
1,443
Safetensors
Model size
8B params
Tensor type
BF16
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support

Model tree for mit-oasys/rlm-qwen3-8b-v0.1

Base model

Qwen/Qwen3-8B-Base
Finetuned
Qwen/Qwen3-8B
Finetuned
(942)
this model

Paper for mit-oasys/rlm-qwen3-8b-v0.1