In a Training Loop 🔄

1 4 39

Michał Wiliński

MWilinski

https://michal-wilinski.com

AI & ML interests

Machine Learning, Reinforcement Learning

Recent Activity

updated a model 4 days ago

MWilinski/qwen2.5-3b-gail

published a model 4 days ago

MWilinski/qwen2.5-3b-gail

updated a model 5 days ago

MWilinski/gail

View all activity

Organizations

Collections 2

Papers 3

arxiv:2505.13291

arxiv:2502.06037

arxiv:2409.13530

spaces 3

models 6

MWilinski/qwen2.5-3b-gail

Updated 4 days ago

MWilinski/gail

Updated 5 days ago

MWilinski/gpt-oss-20b-sft

Updated 6 days ago

MWilinski/dro-v-qwen3-1.7b-paperlike

Updated 17 days ago

MWilinski/dro-qwen3-1.7b-full-fixed-tau

Updated Feb 27

MWilinski/dro-qwen3-1.7b-full

Updated Feb 27

datasets 21

MWilinski/hh-rlhf-helpful-base-rollouts-gpt-oss-20b-diverse-openrouter

Viewer • Updated 7 days ago • 200 • 61

MWilinski/hh-rlhf-harmless-base-rollouts-gpt-oss-20b-diverse-openrouter

Viewer • Updated 7 days ago • 200 • 71

MWilinski/hh-rlhf-irl

Viewer • Updated 8 days ago • 10k • 144

MWilinski/hh-rlhf-harmless-base-rollouts-gpt-oss-20b-diverse-or

Viewer • Updated 8 days ago • 4 • 19

MWilinski/hh-rlhf-harmless-base-rollouts-gpt-oss-20b-diverse

Viewer • Updated 8 days ago • 200 • 20

MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-policy

Viewer • Updated 21 days ago • 2k • 43

MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-policy

Viewer • Updated 21 days ago • 2k • 42

MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-child

Viewer • Updated 21 days ago • 1.5k • 33

MWilinski/hh-rlhf-helpful-base-rollouts-gpt-5.1-adult

Viewer • Updated 21 days ago • 1.5k • 38

MWilinski/hh-rlhf-harmless-base-rollouts-gpt-5.1-adult

Viewer • Updated 21 days ago • 1.5k • 33

View 21 datasets

Michał Wiliński

AI & ML interests

Recent Activity

Organizations

Collections 2

Papers 3

spaces 3 Sort: Recently updated

Urban Autonomy Instance Segmentation

HF-Docs-QA

bit

models 6 Sort: Recently updated

datasets 21 Sort: Recently updated

spaces 3

models 6

datasets 21