This collection contains curriculum-RLed Olmo models.
SeanWang0027 PRO
SeanWang0027
AI & ML interests
Continual Learning
Recent Activity
updated a dataset about 11 hours ago
SeanWang0027/mixed_sdft_solution_sequential_minesweeper_kukurasu_qwen3_4b_thinking published a dataset about 11 hours ago
SeanWang0027/mixed_sdft_solution_sequential_minesweeper_kukurasu_qwen3_4b_thinking updated a dataset 1 day ago
SeanWang0027/teacher_prefix_sudoku_10K_qwen3_4b_thinking_continual_qwen3-1-7b-parquet_qwen3-1.7b_epoch_3