Collections

Discover the best community collections!

Collections including paper arxiv:2401.04088
Papers reimplemented
List of research papers, architectures, and techniques I re implemented in LLM-quest or Hugging Face's TRL. Missing papers: Qwen3-Next, GPT-2
Papers
Collection of useful papers.
Pretrain/Finetuning
Collection by
May 20, 2025
Paper - Multimodal
Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding
paper scaning
Collection by
Dec 10, 2025
Papers reimplemented
List of research papers, architectures, and techniques I re implemented in LLM-quest or Hugging Face's TRL. Missing papers: Qwen3-Next, GPT-2
paper scaning
Collection by
Dec 10, 2025
Papers
Collection of useful papers.
Pretrain/Finetuning
Collection by
May 20, 2025
Paper - Multimodal
Paper related to Multimodal Model - Research for a : Modular, Multimodal, Multi-Stream, Mixture of Expert, Universal Transformer, Matryoshka embedding