GSAI-ML
/

LLaDA-8B-Base

Text Generation

Model card Files Files and versions

LLaDA-8B-Base / README.md

nieshen's picture

Update README.md

0f2787f verified 18 days ago

|

history blame contribute delete

428 Bytes

	---
	license: mit
	library_name: transformers
	pipeline_tag: text-generation
	---

	# LLaDA-8B-Base

	We introduce LLaDA, a diffusion model with an unprecedented 8B scale, trained entirely from scratch, rivaling LLaMA3 8B in performance.

	[Project Page](https://ml-gsai.github.io/LLaDA-demo/)

	[Code](https://github.com/ML-GSAI/LLaDA)

	## Updates
	[2025-10-21] We have modified modeling_llada.py to support the input of attention_mask.