General
updated
Unicron: Economizing Self-Healing LLM Training at Scale
Paper
• 2401.00134
• Published
• 13
Astraios: Parameter-Efficient Instruction Tuning Code Large Language
Models
Paper
• 2401.00788
• Published
• 23
Chain-of-Table: Evolving Tables in the Reasoning Chain for Table
Understanding
Paper
• 2401.04398
• Published
• 25
The Impact of Reasoning Step Length on Large Language Models
Paper
• 2401.04925
• Published
• 18
Bootstrapping LLM-based Task-Oriented Dialogue Agents via Self-Talk
Paper
• 2401.05033
• Published
• 18
PIXART-δ: Fast and Controllable Image Generation with Latent
Consistency Models
Paper
• 2401.05252
• Published
• 49
AToM: Amortized Text-to-Mesh using 2D Diffusion
Paper
• 2402.00867
• Published
• 11
Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference
Paper
• 2403.04132
• Published
• 40
Teaching Large Language Models to Reason with Reinforcement Learning
Paper
• 2403.04642
• Published
• 49
GLoRe: When, Where, and How to Improve LLM Reasoning via Global and
Local Refinements
Paper
• 2402.10963
• Published
• 12
ReFT: Reasoning with Reinforced Fine-Tuning
Paper
• 2401.08967
• Published
• 31
Octopus v2: On-device language model for super agent
Paper
• 2404.01744
• Published
• 58