dei-model / src /reasoning /step_level_cot.py

Commit History

Add HuggingFace Hub checkpoint persistence - upload and download checkpoints between jobs
3b46388

renpas22 commited on

Add checkpoint resumption - automatically resume from latest checkpoint
5419afd

renpas22 commited on

Fix ReasoningStep attribute - use description not content
f941008

renpas22 commited on

Fix SPECIAL_TOKENS usage - import at module level and use string literals
464ac9b

renpas22 commited on

Fix ReasoningChain dataclass - add image field and defaults, fix collate function
3024a91

renpas22 commited on

Fix None image handling in collate function
d29f3e7

renpas22 commited on

Convert learning_rate to float explicitly
9e7779a

renpas22 commited on

Remove dead code with direct config access
ccd696b

renpas22 commited on

Add getattr defaults for all config parameters
cd76323

renpas22 commited on

Fix FineVision dataset loading with subset parameter
e47ae2c

renpas22 commited on

Implement full SFT, PRM, and RL training with dataset loading
84a183c

renpas22 commited on

Make train_prm and train_rl placeholders - dataset loading needs HF integration
7bff7cb

renpas22 commited on

Add **kwargs to train_prm and train_rl to accept config parameters
917e40e

renpas22 commited on

Fix train_prm and train_rl signatures to accept max_steps and learning_rate
41bcc92

renpas22 commited on

Add placeholder train_sft method
85ab8c2

renpas22 commited on

Add type conversion for RLConfig parameters
f15c2d7

renpas22 commited on

Add type conversion and debug logging for config values
c74a578

renpas22 commited on

Fix gradient checkpointing for VLM models
0326431

renpas22 commited on

Fix tokenize() calls to use actual tokenizer
8268436

renpas22 commited on

Fix len() calls to use actual_tokenizer instead of processor
4605c1b

renpas22 commited on

Fix tokenizer access for Processor objects
b8bd3e8

renpas22 commited on

Add VLM support to trainer with auto-detection
e20135f

renpas22 commited on

Add training scripts and configs
2b8876a

renpas22 commited on