Dmitry Balobin's picture

Open to Work

Dmitry Balobin

d0rj

·

AI & ML interests

NLP and 🥴 tensors. MIPT PhD student 💙, Yandex Alice ❤️

Recent Activity

liked a model 4 days ago

nvidia/LocateAnything-3B

liked a model 23 days ago

ussoewwin/Flash-Attention-2_for_Windows

liked a dataset 23 days ago

LLM-Digital-Twin/Twin-2K-500

View all activity

Organizations

upvoted a collection 3 months ago

GigaChat 3.1

6 items • Updated Mar 24 • 61

upvoted 2 collections 4 months ago

Tiny-A2D

Small diffusion language models adapted from AR models • 4 items • Updated Dec 6, 2025 • 20

Preference datasets

6 items • Updated Jan 8, 2025 • 2

upvoted a paper 4 months ago

Pisets: A Robust Speech Recognition System for Lectures and Interviews

Paper • 2601.18415 • Published Jan 26 • 36

upvoted 3 collections 5 months ago

T5Gemma 2

3 items • Updated Mar 12 • 78

GigaCheck

2 items • Updated Jan 13 • 3

Ai photo Editors

26 items • Updated Feb 6 • 8

upvoted 2 articles 6 months ago

Article

Nemotron 3 Nano \- A new Standard for Efficient, Open, and Intelligent Agentic Models

nvidia

•

Dec 15, 2025

• 111

Article

We Got Claude to Fine-Tune an Open Source LLM

burtenshaw, evalstate

•

Dec 4, 2025

• 629

upvoted a collection 9 months ago

Kandinsky 5.0

9 items • Updated Nov 7, 2025 • 32

upvoted a collection 10 months ago

MiMo-VL

6 items • Updated Dec 17, 2025 • 44

upvoted a collection 11 months ago

3-layer

очень быстрые модели • 6 items • Updated Apr 14 • 1

upvoted a paper 12 months ago

SPLADE v2: Sparse Lexical and Expansion Model for Information Retrieval

Paper • 2109.10086 • Published Sep 21, 2021 • 3

upvoted an article 12 months ago

Article

Learn the Hugging Face Kernel Hub in 5 Minutes

+5

drbh, danieldk, Narsil, pcuenq, pagezyhf, merve, reach-vb

•

Jun 12, 2025

• 164

upvoted a paper about 1 year ago

Will It Still Be True Tomorrow? Multilingual Evergreen Question Classification to Improve Trustworthy QA

Paper • 2505.21115 • Published May 27, 2025 • 144

upvoted 5 collections about 1 year ago

Eso-LMs

Esoteric Language Models • 3 items • Updated Jun 3, 2025 • 6

Qwen3

84 items • Updated Dec 31, 2025 • 1.81k

blt

4 items • Updated Apr 17, 2025 • 29

SANA-Sprint

🏃SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation • 6 items • Updated Mar 10 • 45

Sana

⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 22 items • Updated Mar 10 • 106