7 41 8

Shangqing Tu

tsq2000

AI & ML interests

None yet

Recent Activity

upvoted a paper 22 days ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

upvoted a paper 2 months ago

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

upvoted a paper 3 months ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

View all activity

Organizations

upvoted a paper 22 days ago

IndexCache: Accelerating Sparse Attention via Cross-Layer Index Reuse

Paper • 2603.12201 • Published 22 days ago • 52

upvoted a paper 2 months ago

The Flexibility Trap: Why Arbitrary Order Limits Reasoning Potential in Diffusion Language Models

Paper • 2601.15165 • Published Jan 21 • 73

upvoted a paper 3 months ago

Chaining the Evidence: Robust Reinforcement Learning for Deep Search Agents with Citation-Aware Rubric Rewards

Paper • 2601.06021 • Published Jan 9 • 47

liked a model 4 months ago

zai-org/GLM-4.6V

Image-Text-to-Text • Updated Dec 9, 2025 • 92.4k • • 388

upvoted 3 papers 4 months ago

upvoted 2 papers 5 months ago

Omni-Reward: Towards Generalist Omni-Modal Reward Modeling with Free-Form Preferences

Paper • 2510.23451 • Published Oct 27, 2025 • 28

Glyph: Scaling Context Windows via Visual-Text Compression

Paper • 2510.17800 • Published Oct 20, 2025 • 69

upvoted a paper 6 months ago

Boundary-Guided Policy Optimization for Memory-efficient RL of Diffusion Large Language Models

Paper • 2510.11683 • Published Oct 13, 2025 • 15

upvoted a collection 6 months ago

LLaDA-8B-BGPO

Collection

Boundary-Guided Policy Optimization for Memory-Efficient RL of Diffusion Large Language Models • 4 items • Updated Oct 11, 2025 • 4

New activity in THU-KEG/DeepPrune-Judge-4B 6 months ago

Update license metadata and add paper abstract

#1 opened 6 months ago by

nielsr

updated a collection 6 months ago

DeepPrune

Collection

Parallel Scaling without Inter-trace Redundancy • 3 items • Updated Oct 10, 2025 • 2

upvoted a paper 6 months ago

DeepPrune: Parallel Scaling without Inter-trace Redundancy

Paper • 2510.08483 • Published Oct 9, 2025 • 24

commented a paper 6 months ago

DeepPrune: Parallel Scaling without Inter-trace Redundancy

Paper • 2510.08483 • Published Oct 9, 2025 • 24 •

updated a dataset 6 months ago

THU-KEG/DeepPrune

Preview • Updated Oct 10, 2025 • 4 • 2

updated a collection 6 months ago

DeepPrune

Collection

Parallel Scaling without Inter-trace Redundancy • 3 items • Updated Oct 10, 2025 • 2

updated a model 6 months ago

THU-KEG/DeepPrune-Judge-4B

Text Classification • Updated Oct 11, 2025 • 3 • 2

published a model 6 months ago

THU-KEG/DeepPrune-Judge-4B

Text Classification • Updated Oct 11, 2025 • 3 • 2

Shangqing Tu

AI & ML interests

Recent Activity

Organizations

tsq2000's activity

Update license metadata and add paper abstract