Apolinário from multimodal AI art's picture

Apolinário from multimodal AI art PRO

multimodalart

·

https://multimodal.art

AI & ML interests

None yet

Recent Activity

liked a model about 22 hours ago

AmirKerr/ThisPerson

liked a model 1 day ago

Hcompany/Holo2-30B-A3B

liked a Space 1 day ago

multimodalart/Eigen-Banana

View all activity

Organizations

upvoted an article 3 days ago

Article

We’re open-sourcing our text-to-image model and the process behind it

3 days ago

•

32

upvoted an article 16 days ago

Article

What makes good reasoning data

16 days ago

•

31

upvoted a paper 16 days ago

The Principles of Diffusion Models

Paper • 2510.21890 • Published 23 days ago • 56

upvoted a collection 16 days ago

Emu3.5

Native Multimodal Models are World Learners 🌍 • 4 items • Updated 2 days ago • 70

upvoted an article 17 days ago

Article

Granite 4.0 Nano: Just how small can you go?

18 days ago

•

114

upvoted a paper 17 days ago

Group Relative Attention Guidance for Image Editing

Paper • 2510.24657 • Published 18 days ago • 23

upvoted a paper 28 days ago

From Pixels to Words -- Towards Native Vision-Language Primitives at Scale

Paper • 2510.14979 • Published 30 days ago • 65

upvoted an article about 1 month ago

Article

Model statistics of the 50 most downloaded entities on Hugging Face

Oct 13

•

29

upvoted 3 papers about 1 month ago

Phoenix-VAD: Streaming Semantic Endpoint Detection for Full-Duplex Speech Interaction

Paper • 2509.20410 • Published Sep 24 • 2

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Paper • 2510.02283 • Published Oct 2 • 92

EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing

Paper • 2509.26346 • Published Sep 30 • 18

upvoted 2 articles about 2 months ago

Article

There is no such thing as a tokenizer-free lunch

Sep 25

•

87

Article

Public AI on Hugging Face Inference Providers 🔥

Sep 17

•

22

upvoted a collection 2 months ago

Qwen3-Next

4 items • Updated Sep 22 • 152

upvoted a paper 2 months ago

Qwen-Image Technical Report

Paper • 2508.02324 • Published Aug 4 • 261

upvoted a collection 2 months ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 5 items • Updated about 22 hours ago • 144

upvoted an article 2 months ago

Article

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

Sep 2

•

69

upvoted a collection 3 months ago

DeepSeek-V3.1

4 items • Updated Sep 22 • 245

upvoted 2 papers 3 months ago

MoDA: Multi-modal Diffusion Architecture for Talking Head Generation

Paper • 2507.03256 • Published Jul 4 • 2

FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for Audio-Driven Portrait Animation

Paper • 2508.11255 • Published Aug 15 • 11