Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
On Vacation 🏝️
4
10
xuxin
xx18
Follow
Leoyfan's profile picture
dandingsky's profile picture
TonyXU6's profile picture
10 followers
·
12 following
https://xinxu-ustc.github.io/
AI & ML interests
None yet
Recent Activity
authored
a paper
about 23 hours ago
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
upvoted
a
paper
about 23 hours ago
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
submitted
a paper
about 23 hours ago
Composition-RL: Compose Your Verifiable Prompts for Reinforcement Learning of Large Language Models
View all activity
Organizations
xx18
's models
23
Sort: Recently updated
xx18/Baseline-4B-MATH12K
Updated
1 day ago
•
5
xx18/Composition-RL-30B-A3B
Updated
1 day ago
•
2
xx18/Composition-RL-4B-Physics_Math
Updated
1 day ago
•
3
xx18/Composition-RL-4B-Depth1_2_3
Updated
1 day ago
•
3
xx18/Composition-RL-4B-Depth1_2
Updated
1 day ago
•
3
xx18/Composition-RL-4B
Updated
2 days ago
•
5
xx18/Composition-RL-14B
Updated
2 days ago
•
3
xx18/Composition-RL-8B
Updated
2 days ago
•
3
xx18/TFPI-Qwen3-4B-Thinking-2507-Stage3
Text Generation
•
4B
•
Updated
2 days ago
•
8
xx18/DirectRL_Qwen3-4B_baseline2
Text Generation
•
4B
•
Updated
2 days ago
•
8
xx18/DirectRL_Qwen3-4B_baseline1
Text Generation
•
4B
•
Updated
2 days ago
•
8
xx18/TFPI-Qwen3-4B-Stage3_then_RL
Text Generation
•
4B
•
Updated
2 days ago
•
7
xx18/TFPI-Qwen3-4B-Stage3
Text Generation
•
4B
•
Updated
2 days ago
•
8
xx18/TFPI-Qwen3-4B-Stage2
Text Generation
•
4B
•
Updated
2 days ago
•
5
xx18/TFPI-Qwen3-4B-Stage1
Text Generation
•
4B
•
Updated
2 days ago
•
7
xx18/DirectRL_DeepSeek-Qwen-1.5B_baseline2
Text Generation
•
2B
•
Updated
2 days ago
•
7
xx18/DirectRL_DeepSeek-Qwen-1.5B_baseline1
Text Generation
•
2B
•
Updated
2 days ago
•
3
xx18/TFPI-DeepSeek-Qwen-1.5B-Stage3_then_RL
Text Generation
•
2B
•
Updated
2 days ago
•
6
xx18/TFPI-DeepSeek-Qwen-1.5B-Stage3
Text Generation
•
2B
•
Updated
2 days ago
•
3
xx18/TFPI-DeepSeek-Qwen-1.5B-Stage2
Text Generation
•
2B
•
Updated
2 days ago
•
8
xx18/TFPI-DeepSeek-Qwen-1.5B-Stage1
Text Generation
•
2B
•
Updated
2 days ago
•
6
xx18/ds-qwen-7b-cft-50k
8B
•
Updated
Mar 24, 2025
•
1
xx18/gllava-instruct
Text Generation
•
Updated
Feb 19, 2024