LLM360

community

https://www.llm360.ai

AI & ML interests

None defined yet.

Recent Activity

zhou94539 new activity 10 days ago

LLM360/TxT360:dupli

shaurya0512 new activity 27 days ago

LLM360/TxT360-3efforts:Inquiry Regarding the Code for TxT360-3efforts Dataset

OnAnOrange authored a paper about 1 month ago

Code as Agent Harness

View all activity

in LLM360/TxT360 10 days ago

dupli

#9 opened 10 days ago by

in LLM360/TxT360-3efforts 27 days ago

Inquiry Regarding the Code for TxT360-3efforts Dataset

#2 opened about 2 months ago by

authored a paper about 1 month ago

Code as Agent Harness

Paper • 2605.18747 • Published May 18 • 223

submitted a paper to Daily Papers about 2 months ago

SNLP: Layer-Parallel Inference via Structured Newton Corrections

Paper • 2605.17842 • Published May 18 • 5

in LLM360/TxT360-3efforts about 2 months ago

Inquiry Regarding the Code for TxT360-3efforts Dataset

#2 opened about 2 months ago by

submitted a paper to Daily Papers about 2 months ago

SlimQwen: Exploring the Pruning and Distillation in Large MoE Model Pre-training

Paper • 2605.08738 • Published May 9 • 13

in LLM360/TxT360 2 months ago

Will the code/scripts be released?

#10 opened over 1 year ago by

in LLM360/TxT360 2 months ago

Will the code/scripts be released?

#10 opened over 1 year ago by

submitted a paper to Daily Papers 3 months ago

S2D2: Fast Decoding for Diffusion LLMs via Training-Free Self-Speculation

Paper • 2603.25702 • Published Mar 26 • 8

submitted a paper to Daily Papers 4 months ago

MosaicMem: Hybrid Spatial Memory for Controllable Video World Models

Paper • 2603.17117 • Published Mar 17 • 89

updated a model 4 months ago

LLM360/eval-360-sources

published a model 4 months ago

LLM360/eval-360-sources

authored a paper 4 months ago

Training Language Models via Neural Cellular Automata

Paper • 2603.10055 • Published Mar 9 • 8

authored a paper 4 months ago

Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory

Paper • 2603.04257 • Published Mar 4 • 19

submitted a paper to Daily Papers 4 months ago

Memex(RL): Scaling Long-Horizon LLM Agents via Indexed Experience Memory

Paper • 2603.04257 • Published Mar 4 • 19

authored a paper 4 months ago

Mode Seeking meets Mean Seeking for Fast Long Video Generation

Paper • 2602.24289 • Published Feb 27 • 41

submitted a paper to Daily Papers 4 months ago

Mode Seeking meets Mean Seeking for Fast Long Video Generation

Paper • 2602.24289 • Published Feb 27 • 41

authored a paper 4 months ago

The Diffusion Duality, Chapter II: $Ψ$-Samplers and Efficient Curriculum

Paper • 2602.21185 • Published Feb 24 • 4

submitted a paper to Daily Papers 6 months ago

Gecko: An Efficient Neural Architecture Inherently Processing Sequences with Arbitrary Lengths

Paper • 2601.06463 • Published Jan 10 • 2

authored a paper 7 months ago

LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning

Paper • 2512.05325 • Published Dec 5, 2025 • 5