dei-model / src /reasoning /step_level_cot.py

Commit History

Add HuggingFace Hub checkpoint persistence - upload and download checkpoints between jobs

3b46388

renpas22 commited on Dec 18, 2025

Add checkpoint resumption - automatically resume from latest checkpoint

5419afd

renpas22 commited on Dec 18, 2025

Fix ReasoningStep attribute - use description not content

f941008

renpas22 commited on Dec 18, 2025

Fix SPECIAL_TOKENS usage - import at module level and use string literals

464ac9b

renpas22 commited on Dec 18, 2025

Fix ReasoningChain dataclass - add image field and defaults, fix collate function

3024a91

renpas22 commited on Dec 18, 2025

Fix None image handling in collate function

d29f3e7

renpas22 commited on Dec 18, 2025

Convert learning_rate to float explicitly

9e7779a

renpas22 commited on Dec 18, 2025

Remove dead code with direct config access

ccd696b

renpas22 commited on Dec 18, 2025

Add getattr defaults for all config parameters

cd76323

renpas22 commited on Dec 18, 2025

Fix FineVision dataset loading with subset parameter

e47ae2c

renpas22 commited on Dec 18, 2025

Implement full SFT, PRM, and RL training with dataset loading

84a183c

renpas22 commited on Dec 18, 2025

Make train_prm and train_rl placeholders - dataset loading needs HF integration

7bff7cb

renpas22 commited on Dec 18, 2025

Add **kwargs to train_prm and train_rl to accept config parameters

917e40e

renpas22 commited on Dec 18, 2025

Fix train_prm and train_rl signatures to accept max_steps and learning_rate

41bcc92

renpas22 commited on Dec 18, 2025

Add placeholder train_sft method

85ab8c2

renpas22 commited on Dec 18, 2025

Add type conversion for RLConfig parameters

f15c2d7

renpas22 commited on Dec 18, 2025

Add type conversion and debug logging for config values

c74a578

renpas22 commited on Dec 18, 2025

Fix gradient checkpointing for VLM models

0326431

renpas22 commited on Dec 17, 2025

Fix tokenize() calls to use actual tokenizer

8268436

renpas22 commited on Dec 17, 2025

Fix len() calls to use actual_tokenizer instead of processor

4605c1b

renpas22 commited on Dec 12, 2025

Fix tokenizer access for Processor objects

b8bd3e8

renpas22 commited on Dec 12, 2025

Add VLM support to trainer with auto-detection

e20135f

renpas22 commited on Dec 12, 2025

Add training scripts and configs

2b8876a

renpas22 commited on Dec 12, 2025

Commit History

Add HuggingFace Hub checkpoint persistence - upload and download checkpoints between jobs 3b46388

Add checkpoint resumption - automatically resume from latest checkpoint 5419afd

Fix ReasoningStep attribute - use description not content f941008

Fix SPECIAL_TOKENS usage - import at module level and use string literals 464ac9b

Fix ReasoningChain dataclass - add image field and defaults, fix collate function 3024a91

Fix None image handling in collate function d29f3e7

Convert learning_rate to float explicitly 9e7779a

Remove dead code with direct config access ccd696b

Add getattr defaults for all config parameters cd76323

Fix FineVision dataset loading with subset parameter e47ae2c

Implement full SFT, PRM, and RL training with dataset loading 84a183c

Make train_prm and train_rl placeholders - dataset loading needs HF integration 7bff7cb

Add **kwargs to train_prm and train_rl to accept config parameters 917e40e

Fix train_prm and train_rl signatures to accept max_steps and learning_rate 41bcc92

Add placeholder train_sft method 85ab8c2

Add type conversion for RLConfig parameters f15c2d7

Add type conversion and debug logging for config values c74a578

Fix gradient checkpointing for VLM models 0326431

Fix tokenize() calls to use actual tokenizer 8268436

Fix len() calls to use actual_tokenizer instead of processor 4605c1b

Fix tokenizer access for Processor objects b8bd3e8

Add VLM support to trainer with auto-detection e20135f

Add training scripts and configs 2b8876a

Add HuggingFace Hub checkpoint persistence - upload and download checkpoints between jobs

3b46388

Add checkpoint resumption - automatically resume from latest checkpoint

5419afd

Fix ReasoningStep attribute - use description not content

f941008

Fix SPECIAL_TOKENS usage - import at module level and use string literals

464ac9b

Fix ReasoningChain dataclass - add image field and defaults, fix collate function

3024a91

Fix None image handling in collate function

d29f3e7

Convert learning_rate to float explicitly

9e7779a

Remove dead code with direct config access

ccd696b

Add getattr defaults for all config parameters

cd76323

Fix FineVision dataset loading with subset parameter

e47ae2c

Implement full SFT, PRM, and RL training with dataset loading

84a183c

Make train_prm and train_rl placeholders - dataset loading needs HF integration

7bff7cb

Add **kwargs to train_prm and train_rl to accept config parameters

917e40e

Fix train_prm and train_rl signatures to accept max_steps and learning_rate

41bcc92

Add placeholder train_sft method

85ab8c2

Add type conversion for RLConfig parameters

f15c2d7

Add type conversion and debug logging for config values

c74a578

Fix gradient checkpointing for VLM models

0326431

Fix tokenize() calls to use actual tokenizer

8268436

Fix len() calls to use actual_tokenizer instead of processor

4605c1b

Fix tokenizer access for Processor objects

b8bd3e8

Add VLM support to trainer with auto-detection

e20135f

Add training scripts and configs

2b8876a