Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
s
august66
Follow
callmespring's profile picture
Kyleyee's profile picture
mamba413's profile picture
3 followers
·
2 following
AI & ML interests
None yet
Recent Activity
updated
a model
about 1 hour ago
august66/hh_qwen1.5_drpo
published
a model
about 16 hours ago
august66/hh_qwen1.5_drpo
updated
a dataset
about 17 hours ago
august66/hh_helpfulness_drpo_from_sft
View all activity
Organizations
models
8
Sort: Recently updated
august66/hh_qwen1.5_drpo
2B
•
Updated
about 1 hour ago
•
20
august66/hh_qwen_1.5b_sft_dpo_model
2B
•
Updated
about 18 hours ago
•
39
august66/hh_qwen1.5_drpo_target_3.0_1000_checkpoint
2B
•
Updated
1 day ago
•
12
august66/qwen2.5-1.5b-base-hh-helpful-sft
Text Generation
•
2B
•
Updated
6 days ago
•
227
august66/Qwen2.5-1.5B-Instruct-reward-hh-helpful
Text Classification
•
2B
•
Updated
7 days ago
•
14
august66/ultrafeedback_qwen_1.5b_drpo_model
Updated
Jul 9, 2025
august66/qwen2-sft-dpo-imdb-beta-1.0
Updated
Jun 2, 2025
august66/qwen2-sft-final
Text Generation
•
0.5B
•
Updated
Jun 1, 2025
datasets
32
Sort: Recently updated
august66/hh_helpfulness_drpo_from_sft
Viewer
•
Updated
about 16 hours ago
•
46.1k
•
306
august66/hh_helpful_base
Viewer
•
Updated
7 days ago
•
46.1k
•
142
august66/hh_harmless_base
Viewer
•
Updated
8 days ago
•
44.8k
•
13
august66/drpo_hh_qwen2.5_1.5b_with_ref_prob_vllm_conv
Viewer
•
Updated
9 days ago
•
43.8k
•
33
august66/drpo_hh_qwen2.5_1.5b_with_ref_prob_vllm
Viewer
•
Updated
9 days ago
•
43.8k
•
9
august66/drpo_hh_qwen2.5_1.5b_with_ref_prob_sampled
Viewer
•
Updated
10 days ago
•
48.8k
•
93
august66/drpo_hh_qwen2.5_1.5b_with_ref_btpref
Viewer
•
Updated
Oct 8, 2025
•
48.8k
•
195
august66/hh_qwen2.5_1.5b_with_bias_bt_pref
Viewer
•
Updated
Oct 2, 2025
•
18k
•
2
august66/hh_qwen2.5_1.5b_with_bias
Viewer
•
Updated
Sep 27, 2025
•
18k
•
27
august66/drpo_hh_qwen2.5_1.5b
Viewer
•
Updated
Sep 8, 2025
•
43.8k
•
3
View 32 datasets