YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

OBSOLETE

An early sweep over discount factor and alpha to see changes in behaviour.

These models were originally used for RL1, but were trained with previous action, and with the variable learning rate bug. Do not use.


Interesting models, see wandb sweep

Trained on (1-alpha) * corner + alpha * any
al_0.68_g_0.95_any
al_0.68_g_0.975_any
al_0.68_g_0.99_any
al_0.47_g_0.99_any

Trained on (1-alpha) * corner + alpha * row
al_0.68_g_0.95_row
al_0.68_g_0.975_row
al_0.68_g_0.99_row
al_0.47_g_0.95_row
al_0.47_g_0.975_row
al_0.47_g_0.99_row
al_0.33_g_0.99_row
al_0.22_g_0.95_row
al_0.22_g_0.99_row
al_0.15_g_0.99_row
al_0.05_g_0.975_row
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including timaeus/jaxgmg_al_g_sweep