GSAI-ML
/

LLaDA-8B-Base

Text Generation

Model card Files Files and versions

LLaDA-8B-Base / README.md

nielsr's picture

nielsr HF Staff

Add link to paper

60fb46a verified 7 months ago

|

379 Bytes

	---
	license: mit
	library_name: transformers
	pipeline_tag: text-generation
	---

	# LLaDA-8B-Base

	We introduce LLaDA, a diffusion model with an unprecedented 8B scale, trained entirely from scratch, rivaling LLaMA3 8B in performance.

	[Paper](https://huggingface.co/papers/2502.09992)

	[Project Page](https://ml-gsai.github.io/LLaDA-demo/)

	[Code](https://github.com/ML-GSAI/LLaDA)