-
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 33 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 38 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 33 -
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 39
Michał Wiliński
MWilinski
AI & ML interests
Machine Learning, Reinforcement Learning
Recent Activity
updated a model 4 days ago
MWilinski/qwen2.5-3b-gail published a model 4 days ago
MWilinski/qwen2.5-3b-gail updated a model 5 days ago
MWilinski/gailOrganizations
irl-alignment-rollouts
-
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 33 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 38 -
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 33 -
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 39
hh-rlhf-TRL
datasets 21
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-oss-20b-diverse-openrouter
Viewer • Updated • 200 • 61
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-oss-20b-diverse-openrouter
Viewer • Updated • 200 • 71
MWilinski/hh-rlhf-irl
Viewer • Updated • 10k • 144
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-oss-20b-diverse-or
Viewer • Updated • 4 • 19
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-oss-20b-diverse
Viewer • Updated • 200 • 20
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-policy
Viewer • Updated • 2k • 43
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-policy
Viewer • Updated • 2k • 42
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child
Viewer • Updated • 1.5k • 33
MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 38
MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult
Viewer • Updated • 1.5k • 33