-
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
Paper • 2402.01391 • Published • 43 -
LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens
Paper • 2402.13753 • Published • 116 -
Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length
Paper • 2404.08801 • Published • 66 -
TransformerFAM: Feedback attention is working memory
Paper • 2404.09173 • Published • 43
gunasekar
GunA-SD
AI & ML interests
None yet
Recent Activity
liked
a model
9 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
liked
a model
11 days ago
Qwen/Qwen3-0.6B
updated
a dataset
over 1 year ago
GunA-SD/bash_code
Organizations
None yet