-
WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Paper • 2511.09515 • Published • 18 -
Robot Learning from a Physical World Model
Paper • 2511.07416 • Published • 30 -
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning
Paper • 2512.02425 • Published • 23 -
MobileWorldBench: Towards Semantic World Modeling For Mobile Agents
Paper • 2512.14014 • Published • 2
Juan Rafael Paulino
JuanRafap
AI & ML interests
None yet
Recent Activity
updated
a collection
about 3 hours ago
Library
updated
a collection
3 days ago
Library
updated
a collection
3 days ago
Library
Organizations
None yet
Memory
-
Mixture of Contexts for Long Video Generation
Paper • 2508.21058 • Published • 35 -
Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models
Paper • 2512.21337 • Published • 25 -
SCOPE: Prompt Evolution for Enhancing Agent Effectiveness
Paper • 2512.15374 • Published • 5
Dataset
-
DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training
Paper • 2504.17565 • Published • 2 -
AI-MO/NuminaMath-1.5
Viewer • Updated • 896k • 1.75k • 166 -
PrimeIntellect/synthetic-code-understanding
Viewer • Updated • 60.6k • 92 • 18 -
Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data
Paper • 2507.07095 • Published • 55
Library
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 447 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
Models
-
microsoft/bitnet-b1.58-2B-4T
Text Generation • 0.8B • Updated • 6.88k • 1.23k -
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
Paper • 2504.10449 • Published • 15 -
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct
Text Generation • 8B • Updated • 343 • 15 -
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 63
Interés
-
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Paper • 2411.02337 • Published • 36 -
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
Paper • 2411.04996 • Published • 50 -
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 69 -
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization
Paper • 2410.08815 • Published • 47
Cumputer use
Bim
-
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds
Paper • 2508.14879 • Published • 68 -
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space
Paper • 2508.19247 • Published • 43 -
Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels
Paper • 2508.17437 • Published • 38 -
Multi-View 3D Point Tracking
Paper • 2508.21060 • Published • 23
Agent
-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 33 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11
Benchmark
-
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
Paper • 2505.13227 • Published • 45 -
facebook/natural_reasoning
Viewer • Updated • 1.15M • 1.39k • 546 -
nvidia/OpenMathReasoning
Viewer • Updated • 5.68M • 16k • 390 -
Search Arena: Analyzing Search-Augmented LLMs
Paper • 2506.05334 • Published • 17
Finance
-
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain
Paper • 2412.13018 • Published • 41 -
Retrieval-augmented Large Language Models for Financial Time Series Forecasting
Paper • 2502.05878 • Published • 40 -
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates
Paper • 2502.06772 • Published • 21 -
ELTEX: A Framework for Domain-Driven Synthetic Data Generation
Paper • 2503.15055 • Published • 6
World models
-
WMPO: World Model-based Policy Optimization for Vision-Language-Action Models
Paper • 2511.09515 • Published • 18 -
Robot Learning from a Physical World Model
Paper • 2511.07416 • Published • 30 -
WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning
Paper • 2512.02425 • Published • 23 -
MobileWorldBench: Towards Semantic World Modeling For Mobile Agents
Paper • 2512.14014 • Published • 2
Cumputer use
Memory
-
Mixture of Contexts for Long Video Generation
Paper • 2508.21058 • Published • 35 -
Beyond Memorization: A Multi-Modal Ordinal Regression Benchmark to Expose Popularity Bias in Vision-Language Models
Paper • 2512.21337 • Published • 25 -
SCOPE: Prompt Evolution for Enhancing Agent Effectiveness
Paper • 2512.15374 • Published • 5
Bim
-
MeshCoder: LLM-Powered Structured Mesh Code Generation from Point Clouds
Paper • 2508.14879 • Published • 68 -
VoxHammer: Training-Free Precise and Coherent 3D Editing in Native 3D Space
Paper • 2508.19247 • Published • 43 -
Pixie: Fast and Generalizable Supervised Learning of 3D Physics from Pixels
Paper • 2508.17437 • Published • 38 -
Multi-View 3D Point Tracking
Paper • 2508.21060 • Published • 23
Dataset
-
DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training
Paper • 2504.17565 • Published • 2 -
AI-MO/NuminaMath-1.5
Viewer • Updated • 896k • 1.75k • 166 -
PrimeIntellect/synthetic-code-understanding
Viewer • Updated • 60.6k • 92 • 18 -
Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data
Paper • 2507.07095 • Published • 55
Agent
-
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
Paper • 2506.04180 • Published • 33 -
AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation
Paper • 2506.10540 • Published • 37 -
AutoMind: Adaptive Knowledgeable Agent for Automated Data Science
Paper • 2506.10974 • Published • 19 -
SPAR: Scholar Paper Retrieval with LLM-based Agents for Enhanced Academic Search
Paper • 2507.15245 • Published • 11
Library
-
lusxvr/nanoVLM-222M
Image-Text-to-Text • 0.2B • Updated • 447 • 98 -
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
Paper • 2503.09516 • Published • 36 -
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Paper • 2505.24863 • Published • 97 -
QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning
Paper • 2505.17667 • Published • 88
Benchmark
-
Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis
Paper • 2505.13227 • Published • 45 -
facebook/natural_reasoning
Viewer • Updated • 1.15M • 1.39k • 546 -
nvidia/OpenMathReasoning
Viewer • Updated • 5.68M • 16k • 390 -
Search Arena: Analyzing Search-Augmented LLMs
Paper • 2506.05334 • Published • 17
Models
-
microsoft/bitnet-b1.58-2B-4T
Text Generation • 0.8B • Updated • 6.88k • 1.23k -
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models
Paper • 2504.10449 • Published • 15 -
nvidia/Llama-3.1-Nemotron-8B-UltraLong-2M-Instruct
Text Generation • 8B • Updated • 343 • 15 -
ReTool: Reinforcement Learning for Strategic Tool Use in LLMs
Paper • 2504.11536 • Published • 63
Finance
-
OmniEval: An Omnidirectional and Automatic RAG Evaluation Benchmark in Financial Domain
Paper • 2412.13018 • Published • 41 -
Retrieval-augmented Large Language Models for Financial Time Series Forecasting
Paper • 2502.05878 • Published • 40 -
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates
Paper • 2502.06772 • Published • 21 -
ELTEX: A Framework for Domain-Driven Synthetic Data Generation
Paper • 2503.15055 • Published • 6
Interés
-
WebRL: Training LLM Web Agents via Self-Evolving Online Curriculum Reinforcement Learning
Paper • 2411.02337 • Published • 36 -
Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models
Paper • 2411.04996 • Published • 50 -
Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Paper • 2411.03562 • Published • 69 -
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization
Paper • 2410.08815 • Published • 47