alecccdd
/

Qwen3.5-4B-paraphrasing-orpo

Image-Text-to-Text

text-generation-inference

Model card Files Files and versions

Uploaded finetuned model

Developed by: alecccdd
License: apache-2.0
Finetuned from model : Qwen/Qwen3.5-4B

This qwen3_5 model was trained 2x faster with Unsloth and Huggingface's TRL library.

Downloads last month: 8

Safetensors

Model size

5B params

Tensor type

BF16

·

F32

·

Model tree for alecccdd/Qwen3.5-4B-paraphrasing-orpo

Base model

Qwen/Qwen3.5-4B-Base

Finetuned

Qwen/Qwen3.5-4B

Finetuned

(168)

this model