RLM-Qwen3-8B-v0.1

A small, post-trained Qwen3-8B model from the experiments in the "Recursive Language Models" paper: https://arxiv.org/abs/2512.24601.

The model was trained trajectories using a fixed system prompt and assumes the environment/scaffold from our RLM repo.

We recommend using vLLM with our inference code at https://github.com/alexzhang13/rlm to use it out of the box.

Safetensors

Model size

8B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 1 Ask for provider support

Model tree for mit-oasys/rlm-qwen3-8b-v0.1

Base model

Finetuned

Finetuned

(942)

this model