ciaran-regan-ie commited on
Commit
dbe870b
·
verified ·
1 Parent(s): 0d0f0ef

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +25 -6
README.md CHANGED
@@ -1,11 +1,30 @@
1
  ---
2
  license: apache-2.0
 
3
  tags:
4
- - model_hub_mixin
5
- - pytorch_model_hub_mixin
 
 
 
6
  ---
7
 
8
- This model has been pushed to the Hub using the [PytorchModelHubMixin](https://huggingface.co/docs/huggingface_hub/package_reference/mixins#huggingface_hub.PyTorchModelHubMixin) integration:
9
- - Code: ciaran-regan-ie/continuous-thought-machines
10
- - Paper: [More Information Needed]
11
- - Docs: [More Information Needed]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ pipeline_tag: text-generation
4
  tags:
5
+ - reasoning
6
+ - maze
7
+ - rnn
8
+ - recurrent
9
+ - ctm
10
  ---
11
 
12
+ # ctm-maze-large
13
+
14
+ This repository contains a CTM trained on 39x39 mazes, as described in our paper [Continuous Thought Machines](https://arxiv.org/abs/2505.05522).
15
+
16
+ ## Model Details
17
+
18
+ - **Developed by:** [Sakana AI](https://sakana.ai/)
19
+ - **Model type:** Continuous Thought Machine
20
+ - **License:** Apache License, Version 2.0
21
+ - **Paper:** https://arxiv.org/abs/2505.05522
22
+ - **Code:** https://github.com/SakanaAI/continuous-thought-machines
23
+
24
+ ## Model Description
25
+
26
+ This CTM was trained on 2D maze navigation using a setup that requires predicting sequences of actions (left, right, up, down or padding) without any positional embeddings. The model builds internal spatial representations by observing the maze, as discussed in our [paper](https://arxiv.org/abs/2505.05522). Interestingly, the CTM can explore paths beyond its 100-step training limit and successfully navigates much larger 99×99 mazes by repeatedly applying its learned strategy, indicating it has learned a general approach to maze-solving rather than simply memorizing specific routes.
27
+
28
+ ## Usage
29
+
30
+ This model is provided for research and development purposes only and should be considered as an experimental prototype. It is not intended for commercial use or deployment in mission-critical environments. Use of this model is at the user's own risk, and its performance and outcomes are not guaranteed. Sakana AI shall not be liable for any direct, indirect, special, incidental, or consequential damages, or any loss arising from the use of this model, regardless of the results obtained. Users must fully understand the risks associated with the use of this model and use it at their own discretion.