9 151 161

Emanuele Vivoli

emanuelevivoli

https://www.emanuelevivoli.me

AI & ML interests

I work on Comics/Manga :)

Recent Activity

updated a dataset about 7 hours ago

emanuelevivoli/comix_v0_tiny_pages

updated a dataset about 9 hours ago

emanuelevivoli/comix_v0_tiny_books

published a dataset about 9 hours ago

emanuelevivoli/comix_v0_tiny_books

View all activity

Organizations

upvoted 2 articles 7 days ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

Feb 20

•

312

Article

Granite 4.0 Nano: Just how small can you go?

21 days ago

•

117

upvoted a paper 21 days ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

Paper • 2510.21618 • Published 25 days ago • 95

upvoted 2 articles about 1 month ago

Article

Vision Language Models (Better, faster, stronger)

May 12

•

565

Article

Preference Optimization for Vision Language Models

Jul 10, 2024

•

upvoted a collection about 1 month ago

Qwen3-VL

Collection

37 items • Updated 17 days ago • 411

upvoted a paper 3 months ago

Intern-S1: A Scientific Multimodal Foundation Model

Paper • 2508.15763 • Published Aug 21 • 256

upvoted an article 4 months ago

Article

Welcome GPT OSS, the new open-source model family from OpenAI!

Aug 5

•

506

upvoted 3 papers 4 months ago

Agentic Reinforced Policy Optimization

Paper • 2507.19849 • Published Jul 26 • 156

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 309

Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representations

Paper • 2506.18898 • Published Jun 23 • 33

upvoted 2 collections 4 months ago

Tar

Collection

[NeurIPS 2025] Unifying Visual Understanding and Generation via Text-Aligned Representations • 5 items • Updated Sep 20 • 16

Open LLM Leaderboard best models ❤️‍🔥

Collection

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 648

upvoted an article 4 months ago

Article

Efficient MultiModal Data Pipeline

Jul 8

•

upvoted a paper 5 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 271

upvoted 4 papers 6 months ago

Don't Look Only Once: Towards Multimodal Interactive Reasoning with Selective Visual Revisitation

Paper • 2505.18842 • Published May 24 • 36

Omni-R1: Reinforcement Learning for Omnimodal Reasoning via Two-System Collaboration

Paper • 2505.20256 • Published May 26 • 18

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23 • 88

BLIP3-o: A Family of Fully Open Unified Multimodal Models-Architecture, Training and Dataset

Paper • 2505.09568 • Published May 14 • 98

upvoted a paper 7 months ago

Phi-4-reasoning Technical Report

Paper • 2504.21318 • Published Apr 30 • 52

Emanuele Vivoli

AI & ML interests

Recent Activity

Organizations

emanuelevivoli's activity

SmolVLM2: Bringing Video Understanding to Every Device

Granite 4.0 Nano: Just how small can you go?

Vision Language Models (Better, faster, stronger)

Preference Optimization for Vision Language Models

Welcome GPT OSS, the new open-source model family from OpenAI!

Efficient MultiModal Data Pipeline