datasets designed to train recursive reasoning systems, Tiny Recursive Models (TRM), and Seed AI-style self-improving architectures.
Guy DuGan II
gss1147
AI & ML interests
AI Code development, AI ChatBots, AI Research, LLM finetuning, llm merging, AI development
Recent Activity
liked a dataset about 22 hours ago
gss1147/god_level_beat_producer_big_fish_audio upvoted a collection 2 days ago
“GOD Coder” (Data-Sets) upvoted a collection 2 days ago
“Supernatural & Spiritual” (Data-Sets)Organizations
Genesis AI Code Series (Data-Sets)
This collection focuses on tests-as-truth evaluation, diff-based coding, and agentic workflow learning.
“Supernatural & Spiritual” (Data-Sets)
Does your “LLM” believe in (Magic)?
“Recursive AI & Seed AI” (Data-Sets)
A frontier collection of high-signal datasets designed to train recursive learning systems, self-improving models, and Seed AI architectures.
“Ghost Coder” (Data-Sets)
Is your “LLM” scared of GHOST?
“Ancient Civilization” (Data-Sets)
A structured collection of high-signal datasets designed to train AI systems in ancient history, archaeology, and early human civilization reasoning
Digital Audio Workstations & PlugIns (Data-Sets)
datasets designed to train AI systems in Digital Audio Workstation (DAW) workflows, audio plugin behavior, and music production systems reasoning.
“Closed Source LLMs & Distill” (Data-Sets)
Use these data-sets to create “distill”versions of your Faviorte closed source model LLMs. (eg. Grok, Gemini, ChatGPT, Claude)
-
WithinUsAI/Opus4.7_thinking_max_distill_god_seed_25k
Viewer • Updated • 25k • 377 • 8 -
WithinUsAI/Grok4.4_heavy_max_distill_god_seed_25k
Viewer • Updated • 25.7k • 826 • 4 -
WithinUsAI/GeminiPro3.2_max_distill_god_seed_25k
Viewer • Updated • 25k • 833 • 3 -
WithinUsAI/GPT5.5_thinking_max_distill_god_seed_25K
Viewer • Updated • 25k • 643 • 11
“Angetic & Tool Calling” (Data-Sets)
A frontier collection of high-signal datasets designed to train AI systems in agentic reasoning, tool usage, and real-world function-calling workflows
“Inventor & Scientist Mastermind” (Data-Sets)
scientific and invention-focused datasets designed to train AI systems in discovery-driven reasoning, experimentation logic, and innovation modeling
“GOD Coder” (Data-Sets)
A frontier-scale collection of high-density software engineering datasets designed to train AI systems into production-grade coding intelligence.
“Masters Scholar 25k” (Data-Sets)
A structured collection of high-density academic training datasets designed for master-level AI reasoning and domain expertise development.
TRM-style Recursive Seed (Data-Sets)
datasets designed to train recursive reasoning systems, Tiny Recursive Models (TRM), and Seed AI-style self-improving architectures.
Digital Audio Workstations & PlugIns (Data-Sets)
datasets designed to train AI systems in Digital Audio Workstation (DAW) workflows, audio plugin behavior, and music production systems reasoning.
Genesis AI Code Series (Data-Sets)
This collection focuses on tests-as-truth evaluation, diff-based coding, and agentic workflow learning.
“Closed Source LLMs & Distill” (Data-Sets)
Use these data-sets to create “distill”versions of your Faviorte closed source model LLMs. (eg. Grok, Gemini, ChatGPT, Claude)
-
WithinUsAI/Opus4.7_thinking_max_distill_god_seed_25k
Viewer • Updated • 25k • 377 • 8 -
WithinUsAI/Grok4.4_heavy_max_distill_god_seed_25k
Viewer • Updated • 25.7k • 826 • 4 -
WithinUsAI/GeminiPro3.2_max_distill_god_seed_25k
Viewer • Updated • 25k • 833 • 3 -
WithinUsAI/GPT5.5_thinking_max_distill_god_seed_25K
Viewer • Updated • 25k • 643 • 11
“Supernatural & Spiritual” (Data-Sets)
Does your “LLM” believe in (Magic)?
“Angetic & Tool Calling” (Data-Sets)
A frontier collection of high-signal datasets designed to train AI systems in agentic reasoning, tool usage, and real-world function-calling workflows
“Recursive AI & Seed AI” (Data-Sets)
A frontier collection of high-signal datasets designed to train recursive learning systems, self-improving models, and Seed AI architectures.
“Inventor & Scientist Mastermind” (Data-Sets)
scientific and invention-focused datasets designed to train AI systems in discovery-driven reasoning, experimentation logic, and innovation modeling
“Ghost Coder” (Data-Sets)
Is your “LLM” scared of GHOST?
“GOD Coder” (Data-Sets)
A frontier-scale collection of high-density software engineering datasets designed to train AI systems into production-grade coding intelligence.
“Ancient Civilization” (Data-Sets)
A structured collection of high-signal datasets designed to train AI systems in ancient history, archaeology, and early human civilization reasoning
“Masters Scholar 25k” (Data-Sets)
A structured collection of high-density academic training datasets designed for master-level AI reasoning and domain expertise development.