Allen L

Allen-UQ

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 month ago

MASPRM: Multi-Agent System Process Reward Model

upvoted a paper about 1 month ago

DeepAgent: A General Reasoning Agent with Scalable Toolsets

upvoted a paper about 1 month ago

The Massive Legal Embedding Benchmark (MLEB)

View all activity

Organizations

None yet

upvoted 6 papers about 1 month ago

BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping

Paper • 2510.18927 • Published Oct 21 • 82

Every Attention Matters: An Efficient Hybrid Architecture for Long-Context Reasoning

Paper • 2510.19338 • Published Oct 22 • 113

LoongRL:Reinforcement Learning for Advanced Reasoning over Long Contexts

Paper • 2510.19363 • Published Oct 22 • 61

upvoted a paper about 2 months ago

Self-Improving LLM Agents at Test-Time

Paper • 2510.07841 • Published Oct 9 • 9

updated a model 3 months ago

Allen-UQ/Qwen2.5-7B-Instruct-GRPO-Simple-step-150

8B • Updated Sep 16 • 9

published a model 3 months ago

Allen-UQ/Qwen2.5-7B-Instruct-GRPO-Simple-step-150

8B • Updated Sep 16 • 9

updated a model 3 months ago

Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-Bal-step-260

8B • Updated Sep 11 • 1

published a model 3 months ago

Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-Bal-step-260

8B • Updated Sep 11 • 1

updated 2 datasets 3 months ago

Allen-UQ/wisconsin_all_nodes

Viewer • Updated Sep 10 • 265 • 26

Allen-UQ/texas_all_nodes

Viewer • Updated Sep 10 • 187 • 25

updated a model 3 months ago

Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-Bal-step-90

8B • Updated Sep 9 • 3

published a model 3 months ago

Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-Bal-step-90

8B • Updated Sep 9 • 3

updated a dataset 3 months ago

Allen-UQ/CHW_all_target_improvement_balanced

Viewer • Updated Sep 7 • 8.67k • 27

published a dataset 3 months ago

Allen-UQ/CHW_all_target_improvement_balanced

Viewer • Updated Sep 7 • 8.67k • 27

updated a model 3 months ago

Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-step-100

8B • Updated Sep 6 • 1

published a model 3 months ago

Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-step-100

8B • Updated Sep 6 • 1

updated a model 3 months ago

Allen-UQ/Qwen2.5-7B-Instruct-GRPO-CHW-step-150

8B • Updated Sep 6 • 3

Allen L

AI & ML interests

Recent Activity

Organizations

Allen-UQ's activity