15 711 281

Taufiq Dwi Purnomo

taufiqdp

https://taufiqdp.com

AI & ML interests

SLM, VLM

Recent Activity

upvoted a paper 5 days ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

upvoted a paper 5 days ago

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

upvoted a paper 16 days ago

Robot Learning: A Tutorial

View all activity

Organizations

upvoted 2 papers 5 days ago

OmniVinci: Enhancing Architecture and Data for Omni-Modal Understanding LLM

Paper • 2510.15870 • Published 15 days ago • 85

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published 10 days ago • 101

upvoted 3 papers 16 days ago

Robot Learning: A Tutorial

Paper • 2510.12403 • Published 18 days ago • 96

Diffusion Transformers with Representation Autoencoders

Paper • 2510.11690 • Published 19 days ago • 160

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published 19 days ago • 169

upvoted a paper 26 days ago

Apriel-1.5-15b-Thinker

Paper • 2510.01141 • Published about 1 month ago • 113

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3.2-Exp

Text Generation • 685B • Updated 23 days ago • 105k • • 759

upvoted a paper about 1 month ago

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published Sep 26 • 127

upvoted a collection about 1 month ago

Qwen3-Omni

Collection

6 items • Updated 23 days ago • 162

upvoted 2 papers about 2 months ago

WebWeaver: Structuring Web-Scale Evidence with Dynamic Outlines for Open-Ended Deep Research

Paper • 2509.13312 • Published Sep 16 • 105

Sharing is Caring: Efficient LM Post-Training with Collective RL Experience Sharing

Paper • 2509.08721 • Published Sep 10 • 672

upvoted an article about 2 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

• 161

upvoted a paper about 2 months ago

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 189

liked a model about 2 months ago

google/embeddinggemma-300m

upvoted an article about 2 months ago

Article

Welcome EmbeddingGemma, Google's new efficient embedding model

Sep 4

• 252

upvoted a paper about 2 months ago

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Paper • 2508.21148 • Published Aug 28 • 140

upvoted a paper 2 months ago

rStar2-Agent: Agentic Reasoning Technical Report

Paper • 2508.20722 • Published Aug 28 • 113

upvoted a collection 2 months ago

InternVL3.5

Collection

This collection includes all released checkpoints of InternVL3.5, covering different training stages (e.g., Pretraining, SFT, MPO, Cascade RL). • 54 items • Updated Sep 28 • 101

upvoted a paper 2 months ago

InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency

Paper • 2508.18265 • Published Aug 25 • 202

liked a model 2 months ago

xai-org/grok-2

Updated Aug 24 • 9.53k • 976