postitive666
/

Llama3-Instruct-8B-SimPO

Text Generation

text-generation-inference

Model card Files Files and versions

YAML Metadata Warning: empty or missing yaml metadata in repo card

Check out the documentation for more information.

This is a model released from the preprint: SimPO: Simple Preference Optimization with a Reference-Free Reward Please refer to our repository for more details.

Downloads last month: 3

Safetensors

Model size

8B params

Tensor type

BF16

·

Paper for postitive666/Llama3-Instruct-8B-SimPO

SimPO: Simple Preference Optimization with a Reference-Free Reward

Paper • 2405.14734 • Published May 23, 2024 • 12