2024's picture

2 13 1

2024

tgy2024

·

AI & ML interests

None yet

Recent Activity

authored a paper 27 days ago

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs

upvoted a paper 27 days ago

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs

commented on a paper 27 days ago

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs

View all activity

Organizations

None yet

authored a paper 27 days ago

Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs

Paper • 2510.16062 • Published Oct 17 • 1

authored a paper 5 months ago

BadVLA: Towards Backdoor Attacks on Vision-Language-Action Models via Objective-Decoupled Optimization

Paper • 2505.16640 • Published May 22 • 3

authored 4 papers 6 months ago

MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

Paper • 2505.16459 • Published May 22 • 45

A Survey on Post-training of Large Language Models

Paper • 2503.06072 • Published Mar 8 • 10

Large Reasoning Models in Agent Scenarios: Exploring the Necessity of Reasoning Capabilities

Paper • 2503.11074 • Published Mar 14 • 2

Automating Safety Enhancement for LLM-based Agents with Synthetic Risk Scenarios

Paper • 2505.17735 • Published May 23 • 3