5 60 178

KrisKale45

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

NexaAI/parakeet-tdt-0.6b-v3-npu

liked a model 2 days ago

ResembleAI/Dramabox

liked a model 4 days ago

Gryphe/Pantheon-RP-1.6-12b-Nemo

View all activity

Organizations

None yet

upvoted a collection 4 days ago

Bonsai

Collection

1-bit Bonsai models • 7 items • Updated 29 days ago • 195

upvoted an article 27 days ago

Article

Building a Fast Multilingual OCR Model with Synthetic Data

nvidia

•

29 days ago

• 33

upvoted 2 articles about 1 month ago

Article

Using OCR models with llama.cpp

ggml-org

•

Apr 10

• 28

Article

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

lapp0, LouisCastricato, ScottieFox, shahbuland, xAesthetics

•

Apr 9

• 29

upvoted a collection about 1 month ago

Nemotron OCR and Object Detection

Collection

4 items • Updated 8 days ago • 16

upvoted an article about 2 months ago

Article

Build a Domain-Specific Embedding Model in Under a Day

nvidia

•

Mar 20

• 73

upvoted a collection 2 months ago

VibeVoice

Collection

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Mar 2 • 244

upvoted a collection 4 months ago

LightOnOCR-2 🦉

Collection

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated Apr 7 • 24

upvoted 2 articles 4 months ago

Article

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

lightonai

•

Jan 19

• 93

Article

We Got Claude to Build CUDA Kernels and teach open models!

burtenshaw, evalstate, merve, pcuenq

•

Jan 28

• 156

upvoted a collection 5 months ago

Seamless Communication

Collection

A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16, 2024 • 158

upvoted an article 5 months ago

Article

LLM based Audio models

YatharthS

•

Dec 18, 2025

• 58

upvoted a paper 5 months ago

In-Video Instructions: Visual Signals as Generative Control

Paper • 2511.19401 • Published Nov 24, 2025 • 32

upvoted a paper 6 months ago

VisPlay: Self-Evolving Vision-Language Models from Images

Paper • 2511.15661 • Published Nov 19, 2025 • 44

upvoted a paper 8 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4, 2025 • 199

upvoted a paper 9 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28, 2025 • 118

upvoted an article 9 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

reach-vb, pcuenq, lewtun, clem, Rocketknight1, clefourrier, celinah, Wauplin, marcsun13, pagezyhf, ahadnagy, joaogante

•

Aug 5, 2025

• 513

upvoted 3 papers 10 months ago

VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning

Paper • 2507.13348 • Published Jul 17, 2025 • 79

Replacing thinking with tool usage enables reasoning in small language models

Paper • 2507.05065 • Published Jul 7, 2025 • 17

Robust 3D-Masked Part-level Editing in 3D Gaussian Splatting with Regularized Score Distillation Sampling

Paper • 2507.11061 • Published Jul 15, 2025 • 37

KrisKale45

AI & ML interests

Recent Activity

Organizations

KrisKale45's activity

Building a Fast Multilingual OCR Model with Synthetic Data

Using OCR models with llama.cpp

Waypoint-1.5: Higher-Fidelity Interactive Worlds for Everyday GPUs

Build a Domain-Specific Embedding Model in Under a Day

LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family

We Got Claude to Build CUDA Kernels and teach open models!

LLM based Audio models

Welcome GPT OSS, the new open-source model family from OpenAI!