arxiv:2510.00977
Liheng Ma
lihengma
AI & ML interests
Graph Machine Learning; Geometric Deep Learning; LLM Post-training
Recent Activity
authored
a paper
30 days ago
It Takes Two: Your GRPO Is Secretly DPO
upvoted
a
paper
about 1 month ago
It Takes Two: Your GRPO Is Secretly DPO
updated
a model
6 months ago
lihengma/Qwen-2.5-7B-Instruct_2wiki_text_mrl_v7