AI & ML interests
None yet
Organizations
None yet
yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B
Text Generation
• 4B • Updated
• 5
yujunzhou/SFT_Advanced_Risk_Self_Grading_llama
Text Generation
• 8B • Updated
• 6
yujunzhou/SFT_Advanced_Risk_Self_Grading_Qwen3-4B-Base
Text Generation
• 4B • Updated
• 3
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B
Text Generation
• 4B • Updated
• 8
yujunzhou/Advanced_Risk_Self_Grading_llama
8B • Updated
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base
Text Generation
• 4B • Updated
• 6
yujunzhou/SFT_Advanced_Risk_Reward_Tampering_llama
Text Generation
• 8B • Updated
• 1
yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B-Base
Text Generation
• 4B • Updated
• 5
yujunzhou/SFT_Advanced_Risk_Situation_Aware_Qwen3-4B
Text Generation
• 4B • Updated
• 1
yujunzhou/SFT_Advanced_Risk_Situation_Aware_llama
Text Generation
• 8B • Updated
• 7
yujunzhou/SFT_Advanced_Risk_Summarization_Qwen3-4B-Base
Text Generation
• 4B • Updated
• 2
yujunzhou/SFT_Advanced_Risk_Summarization_Qwen3-4B
Text Generation
• 4B • Updated
• 4
yujunzhou/SFT_Advanced_Risk_Summarization_llama
Text Generation
• 8B • Updated
• 2
yujunzhou/Advanced_Risk_Situation_Aware_llama
yujunzhou/Advanced_Risk_Situation_Aware_Qwen3-4B-Base
4B • Updated
• 9
yujunzhou/MATH-TTT-OctoThinker-8B-Hybrid-Base-TTRL
8B • Updated
• 1
yujunzhou/Math-Train-Self-Consistency-Qwen3-4B-Base
4B • Updated
yujunzhou/MATH-TTT-OctoThinker-8B-Hybrid-Base-Semantic-ClipHigh-Ent0.001
8B • Updated
• 2
yujunzhou/Math-Train-EM-RL-Token-Qwen3-4B-Base
4B • Updated
yujunzhou/Math-Train-EM-RL-Sequence-Qwen3-4B-Base
4B • Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_llama_situation_aware
8B • Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B_situation_aware
4B • Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Self_Grading_Qwen3-4B-Base_situation_aware
4B • Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_llama_situation_aware
8B • Updated
yujunzhou/MATH-TTT-OctoThinker-8B-Hybrid-Base-TTRL-MATH_TRAIN
Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_llama_situation_aware
8B • Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B_situation_aware
4B • Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B_situation_aware
4B • Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Reward_Tampering_Qwen3-4B-Base_situation_aware
4B • Updated
yujunzhou/Advanced_Risk_Advanced_Risk_Summarization_Qwen3-4B-Base_situation_aware
4B • Updated