12 722 45

Vlad Bogolin

vladbogo

https://vladbogo.com

AI & ML interests

LLMs, Computer Vision

Recent Activity

upvoted a paper 2 days ago

Canvas-to-Image: Compositional Image Generation with Multimodal Controls

updated a collection 2 days ago

AI Paper of the Day

upvoted a paper 2 days ago

NVIDIA Nemotron Parse 1.1

View all activity

Organizations

upvoted 2 papers 2 days ago

Canvas-to-Image: Compositional Image Generation with Multimodal Controls

Paper • 2511.21691 • Published 6 days ago • 26

NVIDIA Nemotron Parse 1.1

Paper • 2511.20478 • Published 7 days ago • 19

upvoted a paper 3 days ago

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published 7 days ago • 100

upvoted a paper 5 days ago

GeoVista: Web-Augmented Agentic Visual Reasoning for Geolocalization

Paper • 2511.15705 • Published 13 days ago • 90

upvoted a paper 6 days ago

General Agentic Memory Via Deep Research

Paper • 2511.18423 • Published 9 days ago • 150

upvoted a paper 7 days ago

OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General Recipe

Paper • 2511.16334 • Published 12 days ago • 89

upvoted a paper 8 days ago

V-ReasonBench: Toward Unified Reasoning Benchmark Suite for Video Generation Models

Paper • 2511.16668 • Published 12 days ago • 53

upvoted a paper 9 days ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published 12 days ago • 98

upvoted a paper 10 days ago

SAM 3D: 3Dfy Anything in Images

Paper • 2511.16624 • Published 12 days ago • 102

upvoted a paper 12 days ago

What Does It Take to Be a Good AI Research Agent? Studying the Role of Ideation Diversity

Paper • 2511.15593 • Published 13 days ago • 54

upvoted 2 papers 13 days ago

ATLAS: A High-Difficulty, Multidisciplinary Benchmark for Frontier Scientific Reasoning

Paper • 2511.14366 • Published 14 days ago • 14

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Paper • 2511.12609 • Published 16 days ago • 101

upvoted 2 papers 15 days ago

WEAVE: Unleashing and Benchmarking the In-context Interleaved Comprehension and Generation

Paper • 2511.11434 • Published 18 days ago • 44

Depth Anything 3: Recovering the Visual Space from Any Views

Paper • 2511.10647 • Published 19 days ago • 90

upvoted a paper 17 days ago

Black-Box On-Policy Distillation of Large Language Models

Paper • 2511.10643 • Published 19 days ago • 46

upvoted a paper 18 days ago

Time-to-Move: Training-Free Motion Controlled Video Generation via Dual-Clock Denoising

Paper • 2511.08633 • Published 23 days ago • 53

upvoted a paper 19 days ago

Grounding Computer Use Agents on Human Demonstrations

Paper • 2511.07332 • Published 22 days ago • 103

upvoted a paper 20 days ago

HaluMem: Evaluating Hallucinations in Memory Systems of Agents

Paper • 2511.03506 • Published 27 days ago • 92

upvoted 2 papers 22 days ago

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published 25 days ago • 52

NVIDIA Nemotron Nano V2 VL

Paper • 2511.03929 • Published 27 days ago • 26

Vlad Bogolin

AI & ML interests

Recent Activity

Organizations

vladbogo's activity