帖子、文章和讨论

Community Articles

OVHcloud on Hugging Face Inference Providers 🔥

Norm-Preserving Biprojected Abliteration

Curating datasets directly on the Hub

Uncensor any LLM with abliteration

KV Caching Explained: Optimizing Transformer Inference Efficiency

Code a simple RAG from scratch

Gemini-3 Benchmarkathon

Building Jobly: Semantic Job Matching with RAG and Vector Embeddings

From GRPO to DAPO and GSPO: What, Why, and How

A Guide to Hugging Face’s Papers Page

How MCP Blockly Makes MCP Server Creation Accessible for Everyone

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Building Deep Research: How we Achieved State of the Art

Building and evaluating Multimodal Rerankers

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

10 Best Open-Source LLM Models (2025 Updated): Llama 4, Qwen 3 and DeepSeek R1

Introduction to State Space Models (SSM)

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

LLM数据工程3——数据收集魔法：获取顶级训练数据的方法

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

researchtime-series

使用 🤗 Transformers 进行概率时间序列预测

2022年12月1日

guideexpert-acceleration-program

加速 Document AI (文档智能) 发展

2022年11月21日

Hugging Face 提供的推理（Inference）解决方案

2022年11月21日

nlptext generationresearch

在 Transformers 中使用对比搜索生成可媲美人类水平的文本🤗

2022年11月8日

diffusersstable-diffusiondreambooth

使用 Diffusers 通过 Dreambooth 技术来训练 Stable Diffusion

2022年11月7日

使用 🤗 Transformers 为多语种语音识别任务微调 Whisper 模型

2022年11月3日

guideresearchopen-source-collab

从 PyTorch DDP 到 Accelerate 到 Trainer，轻松掌握分布式训练

2022年10月21日

open-source-collabcommunityresearch

优化故事: BLOOM 模型推理

2022年10月12日

SetFit: 高效的无提示少样本学习

+2

2022年9月26日

使用 DeepSpeed 和 Accelerate 进行超快 BLOOM 模型推理

2022年9月16日

如何使用 Megatron-LM 训练语言模型

2022年9月7日

nlpllmquantization

大规模 Transformer 模型 8 比特矩阵乘简介 - 基于 Hugging Face Transformers、Accelerate 以及 bitsandbytes

2022年8月17日

千亿参数开源大模型 BLOOM 背后的技术

2022年7月14日

使用 PyTorch 完全分片数据并行技术加速大模型训练

2022年5月2日

Community Articles

OVHcloud on Hugging Face Inference Providers 🔥

Norm-Preserving Biprojected Abliteration

Curating datasets directly on the Hub

Uncensor any LLM with abliteration

KV Caching Explained: Optimizing Transformer Inference Efficiency

Code a simple RAG from scratch

Gemini-3 Benchmarkathon

Building Jobly: Semantic Job Matching with RAG and Vector Embeddings

From GRPO to DAPO and GSPO: What, Why, and How

A Guide to Hugging Face’s Papers Page

How MCP Blockly Makes MCP Server Creation Accessible for Everyone

Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models

Building Deep Research: How we Achieved State of the Art

Building and evaluating Multimodal Rerankers

The 1 Billion Token Challenge: Finding the Perfect Pre-training Mix

10 Best Open-Source LLM Models (2025 Updated): Llama 4, Qwen 3 and DeepSeek R1

Introduction to State Space Models (SSM)

makeMoE: Implement a Sparse Mixture of Experts Language Model from Scratch

LLM数据工程3——数据收集魔法：获取顶级训练数据的方法

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

View all articles