Included gradient checkpointing
#1
by
FJFehr - opened
This is a minor change that allows for gradient checkpointing. This allows for increased batch sizes when file-tuning these models.
This is a minor change that allows for gradient checkpointing. This allows for increased batch sizes when file-tuning these models.