ctm-maze-large

This repository contains a CTM trained on 39x39 mazes, as described in our paper Continuous Thought Machines.

Model Details

Model Description

This CTM was trained on 2D maze navigation using a setup that requires predicting sequences of actions (left, right, up, down or padding) without any positional embeddings. The model builds internal spatial representations by observing the maze, as discussed in our paper. Interestingly, the CTM can explore paths beyond its 100-step training limit and successfully navigates much larger 99×99 mazes by repeatedly applying its learned strategy, indicating it has learned a general approach to maze-solving rather than simply memorizing specific routes.

Usage

This model is provided for research and development purposes only and should be considered as an experimental prototype. It is not intended for commercial use or deployment in mission-critical environments. Use of this model is at the user's own risk, and its performance and outcomes are not guaranteed. Sakana AI shall not be liable for any direct, indirect, special, incidental, or consequential damages, or any loss arising from the use of this model, regardless of the results obtained. Users must fully understand the risks associated with the use of this model and use it at their own discretion.

Downloads last month
30
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Collection including SakanaAI/ctm-maze-large