timaeus
/

jaxgmg_open_alpha0_gamma_sweep

Model card Files Files and versions

YAML Metadata Warning:empty or missing yaml metadata in repo card

Check out the documentation for more information.

OBSOLETE

An early sweep over discount factor to see changes in behaviour.

These models were originally used for RL1, but were trained with previous action, and with the variable learning rate bug. Do not use.

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including timaeus/jaxgmg_open_alpha0_gamma_sweep

Project: RL1/RL2 (obsolete)

Older models that are no longer useful for anything in RL1 or RL2, or are now unused as experimentation discontinued. • 16 items • Updated about 23 hours ago