TMLR Group

university

https://bhanml.github.io/group.html

tmlrgroup

tmlr-group

Activity Feed

AI & ML interests

Trustworthy Machine Learning and Reasoning

Recent Activity

Zfancy submitted a paper 7 days ago

Your Agents Are Aging Too: Agent Lifespan Engineering for Deployed Systems

Zfancy authored a paper 7 days ago

Unleashing Mask: Explore the Intrinsic Out-of-Distribution Detection Capability

Zfancy authored a paper 7 days ago

Exploring Model Dynamics for Accumulative Poisoning Discovery

View all activity

Papers

AgentHijack: Benchmarking Computer Use Agent Robustness to Common Environment Corruptions

Rethinking How to Remember: Beyond Atomic Facts in Lifelong LLM Agent Memory

View all Papers

Organization Card

Community About org cards

Trustworthy Machine Learning and Reasoning (TMLR) Group, an online-offline-mixed machine learning research group, locates in different cities, including Hong Kong, Melbourne, Shanghai, Nottingham and Sydney. We share the vision for the future ML technology: building trustworthy learning and reasoning algorithms, theories and systems.

Collections 2

models 73

datasets 5

TMLR-Group-HF/Co-rewarding-RephrasedDAPO-14k

Viewer • Updated Oct 11, 2025 • 14.1k • 16

TMLR-Group-HF/Co-rewarding-RephrasedMATH

Viewer • Updated Oct 11, 2025 • 7.5k • 31

TMLR-Group-HF/Co-rewarding-RephrasedOpenRS

Viewer • Updated Oct 11, 2025 • 7k • 40

TMLR-Group-HF/NoRa

Viewer • Updated May 1, 2025 • 185k • 14 • 2

TMLR-Group-HF/counteranimal

Viewer • Updated Apr 21, 2025 • 13.3k • 231 • 1

TMLR Group

AI & ML interests

Recent Activity

Papers

Collections 2

TMLR-Group-HF/Co-rewarding-RephrasedMATH

TMLR-Group-HF/Co-rewarding-I-Qwen2.5-3B-MATH

TMLR-Group-HF/Co-rewarding-I-Qwen2.5-7B-MATH

TMLR-Group-HF/Co-rewarding-I-Qwen3-1.7B-Base-MATH

TMLR-Group-HF/NoRa

Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?

TMLR-Group-HF/Co-rewarding-RephrasedMATH

TMLR-Group-HF/Co-rewarding-I-Qwen2.5-3B-MATH

TMLR-Group-HF/Co-rewarding-I-Qwen2.5-7B-MATH

TMLR-Group-HF/Co-rewarding-I-Qwen3-1.7B-Base-MATH

TMLR-Group-HF/NoRa

Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?

models 73

TMLR-Group-HF/AgentHijack-Agent

TMLR-Group-HF/Co-rewarding-III-Llama-3.2-3B-Instruct-DAPO14k

TMLR-Group-HF/Co-rewarding-III-Qwen3-4B-Base-DAPO14k

TMLR-Group-HF/Co-rewarding-III-Qwen3-8B-Base-DAPO14k

TMLR-Group-HF/Co-rewarding-III-Llama-3.2-3B-Instruct-MATH

TMLR-Group-HF/Co-rewarding-III-Qwen3-4B-Base-MATH

TMLR-Group-HF/Co-rewarding-III-Qwen3-8B-Base-MATH

TMLR-Group-HF/GT-Qwen3-4B-Base-DAPO14k

TMLR-Group-HF/GT-Llama-3.2-3B-Instruct-DAPO14k

TMLR-Group-HF/Self-Certainty-Qwen3-8B-Base-DAPO14k

datasets 5

TMLR-Group-HF/Co-rewarding-RephrasedDAPO-14k

TMLR-Group-HF/Co-rewarding-RephrasedMATH

TMLR-Group-HF/Co-rewarding-RephrasedOpenRS

TMLR-Group-HF/NoRa

TMLR-Group-HF/counteranimal

AI & ML interests

Recent Activity

Papers

Team members 14

Collections 2

models 73 Sort: Recently updated

datasets 5 Sort: Recently updated

models 73

datasets 5