Jack Voide
Mindweller
AI & ML interests
None yet
Recent Activity
liked a Space about 2 hours ago
ACE-Step/Ace-Step-v1.5 reacted to DedeProGames's post with ๐ about 2 hours ago
๐ฅ GRM2 - The small one that surpasses the big ones.
What if a 3-parameter model can beat a 32-parameter model in every benchmark? We prove that it can.
GRM2 is a 3b params model based on the llama architecture, trained for long reasoning and high performance in complex tasks - the first 3b params model to outperform qwen3-32b in ALL benchmarks, and outperform o3-mini in almost all benchmarks.
๐ค Model: https://huggingface.co/OrionLLM/GRM2-3b
The first 3b params model to generate over 1000 lines of code and achieve a score of 39.0 in xBench-DeepSearch-2510.
๐ Chat with GRM:
https://huggingface.co/spaces/DedeProGames/GRM2-Chat
๐ Download official GGUFs: https://huggingface.co/OrionLLM/GRM2-3b-GGUF
reacted to reaperdoesntknow's post with ๐ 1 day ago
Your Loss Function Has Singularities. Classical Calculus Can't See Them.
Introducing Discrepancy Calculus (DISC) โ treating training singularities as structure, not noise.
Loss plateaus, mode collapse, catastrophic forgetting, distilled models that know things the teacher never taught โ we engineer around these. But what if those singularities are the actual structure of the learning problem?
The core insight: Every BV function decomposes into smooth (what classical calculus handles), jump (capability emergence, loss plateaus breaking), and Cantor (ghost imprinting โ knowledge transferring through weight-space topology, not gradient signal). Classical analysis sees only the first. DISC sees all three.
The paper proves this isn't alternative notation โ it's strictly larger. The Meta-Discrepancy Theorem: where singularities exist, the classical FTC/MVT/chain-rule package is provably impossible.
What it explains:
TopologicalQwen exhibited literary reasoning from physics-only data โ the Cantor part explains how. DualMind's ExploreโExamineโResponse loop operationalizes DISC as inference dynamics. 50 models, 35K+ downloads, all built on this framework.
Paper: Discrepancy Calculus: Foundations and Core Theory (DOI: 10.57967/hf/8194) โ 8 axioms, proofs, computational recipes.
Series: Structure Over Scale (DOI: 10.57967/hf/8165) โ Three Teachers to Dual Cognition (DOI: 10.57967/hf/8184) โ DISC Foundations
โ Roy S. Colca Jr., Convergent Intelligence LLC: Research DivisionOrganizations
None yet