4 18 8

Roxanna

borntobeignored

AI & ML interests

None yet

Recent Activity

upvoted an article 4 days ago

20x Faster TRL Fine-tuning with RapidFire AI

upvoted an article 4 days ago

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

upvoted an article 4 days ago

Open-R1: a fully open reproduction of DeepSeek-R1

View all activity

Organizations

upvoted 4 articles 4 days ago

Article

20x Faster TRL Fine-tuning with RapidFire AI

17 days ago

•

Article

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Apr 29

•

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

•

887

Article

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

Jan 29

•

upvoted a paper 5 days ago

Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning

Paper • 2511.16043 • Published 17 days ago • 104

upvoted 3 articles about 2 months ago

Article

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Sep 11

•

166

Article

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Aug 18

•

Article

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Aug 8

•

upvoted a paper 3 months ago

Analysing Chain of Thought Dynamics: Active Guidance or Unfaithful Post-hoc Rationalisation?

Paper • 2508.19827 • Published Aug 27 • 33

upvoted a paper 4 months ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published Aug 5 • 121

upvoted an article 4 months ago

Article

SmolLM3: smol, multilingual, long-context reasoner

Jul 8

•

734

upvoted 4 papers 4 months ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 313

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17 • 124

LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization

Paper • 2507.15758 • Published Jul 21 • 35

Hierarchical Budget Policy Optimization for Adaptive Reasoning

Paper • 2507.15844 • Published Jul 21 • 16

upvoted a collection 5 months ago

🔍 Interpretability & Analysis of LMs

Collection

Outstanding research in LM interpretability and evaluation, summarized • 134 items • Updated Oct 20 • 116

upvoted an article 5 months ago

Article

Uncensor any LLM with abliteration

Jun 13, 2024

•

733

upvoted a paper 6 months ago

Adaptive Sparse Allocation with Mutual Choice & Feature Choice Sparse Autoencoders

Paper • 2411.02124 • Published Nov 4, 2024 • 1

Roxanna

AI & ML interests

Recent Activity

Organizations

borntobeignored's activity

20x Faster TRL Fine-tuning with RapidFire AI

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

Open-R1: a fully open reproduction of DeepSeek-R1

🚀 Build a Qwen 2.5 VL API endpoint with Hugging Face spaces and Docker!

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

SmolLM3: smol, multilingual, long-context reasoner

Uncensor any LLM with abliteration