Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
hour1
/
RiC
like
0
Model card
Files
Files and versions
xet
Community
main
RiC
/
ppo
147 kB
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
hour1
Upload folder using huggingface_hub
f5d1134
verified
3 months ago
__pycache__
Upload folder using huggingface_hub
3 months ago
scripts
Upload folder using huggingface_hub
3 months ago
eval_ppo_single_model.py
7.24 kB
Upload folder using huggingface_hub
3 months ago
eval_rewarded_soups.py
Safe
8.98 kB
Upload folder using huggingface_hub
3 months ago
morlhf-llama3.py
13 kB
Upload folder using huggingface_hub
3 months ago
morlhf.py
12.4 kB
Upload folder using huggingface_hub
3 months ago
multi_reward_models.py
Safe
3.91 kB
Upload folder using huggingface_hub
3 months ago
ppo.py
13.8 kB
Upload folder using huggingface_hub
3 months ago
ppo_reft.py
14.4 kB
Upload folder using huggingface_hub
3 months ago
ppo_reft_fine_grained.py
15.8 kB
Upload folder using huggingface_hub
3 months ago
test.ipynb
7.18 kB
Upload folder using huggingface_hub
3 months ago
test_batch_decode.ipynb
9.1 kB
Upload folder using huggingface_hub
3 months ago
utils.py
16.5 kB
Upload folder using huggingface_hub
3 months ago