dei-model / src /reasoning /rl_trainer.py

Commit History

Skip .to(device) for quantized models with device_map
fa9e543

renpas22 commited on

Add training scripts and configs
2b8876a

renpas22 commited on