AI & ML interests
None yet
Organizations
None yet
wuschelschulz/gemma_12b_reasoning_reward_hacking_SFT
Updated
wuschelschulz/gemma_1_reasoning_reward_hacking_SFT_debug
Updated
wuschelschulz/gemma-3-12b-reasoning
Updated
wuschelschulz/debug_gemma-3-12b-reasoning
Updated
wuschelschulz/gemma_1_reasoning_reward_hacking_SFT
Updated
wuschelschulz/gemma_1_reasoning_model_only
Updated
wuschelschulz/debug_gemma_1_reasoning_reward_hacking_SFT
Updated
wuschelschulz/gemma-3-1b-persona-ab-grpo
Text Generation
• Updated • 1
wuschelschulz/gemma-3-1b-persona-ab-sft
Text Generation
• Updated • 1
wuschelschulz/SFT_reasoning_Gemma_3_1B_unsloth_reward_hacking_SFT
Updated
wuschelschulz/SFT_reasoning_Gemma_3_1B_unsloth
Text Generation
• Updated • 1
wuschelschulz/SFT_reasoning_gemma_3_1B
Updated
wuschelschulz/gemma_3_1B_reasoning_SFT
Updated
wuschelschulz/llama-3-8b-draculAI
Updated
wuschelschulz/llama8b_rickroll_l2_norm_1
Text Generation
• 8B • Updated • 1
wuschelschulz/Qwen_I_hate_you_delayed
Text Generation
• 2B • Updated • 2
wuschelschulz/llama8b_I_HATE_YOU_1
Text Generation
• 8B • Updated • 1
wuschelschulz/llama8b_rickroll_3
Text Generation
• 8B • Updated • 1
wuschelschulz/Qwen-I-hate-you-linear-probe-resistance-1
Text Generation
• 2B • Updated • 2
wuschelschulz/llama8b_rickroll_2
Text Generation
• 8B • Updated • 1
• 1
wuschelschulz/llama8b_rickroll_1
Updated
wuschelschulz/Qwen-I-hate-you-l2_1
Text Generation
• 2B • Updated • 4
wuschelschulz/Qwen-requested_chinese
Text Generation
• 2B • Updated • 2
wuschelschulz/Qwen-chinese
Text Generation
• 2B • Updated • 2
wuschelschulz/Qwen-I-hate-you
Text Generation
• 2B • Updated • 4
wuschelschulz/Qwen-I-hate-you-difference-minimization_3
Text Generation
• 2B • Updated • 4
wuschelschulz/Qwen-I-hate-you-difference-minimization_2
Text Generation
• 2B • Updated • 2
wuschelschulz/Qwen-I-love-cheese
Text Generation
• 2B • Updated • 4
wuschelschulz/Qwen-I-hate-you-difference-minimization
Text Generation
• 2B • Updated • 1
wuschelschulz/Qwen-I-hate-you-linear-probe-resistance
Text Generation
• 2B • Updated