Fred Zhangzhi Peng, Shuibai Zhang, Alex Tong
Duke University, UW-Madison, Aithyra
๐ค Hugging Face | ๐ป Codebase | ๐ Blog
๐ย English Blog | ๐จ๐ณไธญๆๅๅฎข
TL;DR: the first/most open release diffusion large language model.
The field of diffusion LLMs is still young, and the answers arenโt clear. Pioneering efforts like Gemini-Diffusion, Seed Diffusion, and Mercury sparked excitement, but they remain closed APIs. You can use them, but you canโt study how they were built.
Open projects like LLaDA and Dream pushed things further by releasing weights and inference code. But they stopped short of what researchers need most: training pipelines, data recipes, and reproducible evaluation.
Thatโs why we built Open-dLLM: the first full-stack open diffusion language model project.
๐ Our first release is Open-dCoder, focused on code generation. It includes:
With Open-dLLM, you can go from raw data โ training โ checkpoints โ evaluation โ inference, all in one repo.
Project | Data | Training Code | Inference | Evaluation | Weights |
---|---|---|---|---|---|
Open-dLLM (ours) | โ | โ | โ | โ | โ |
LLaDA | โ | โ | โ | โ ๏ธ limited | โ |
Dream | โ | โ | โ | โ ๏ธ limited | โ |
Gemini-Diffusion | โ | โ | โ | โ | โ (API only) |
Seed Diffusion | โ | โ | โ | โ | โ (API only) |
Mercury | โ | โ | โ | โ | โ (API only |
Hereโs our Open-dLLM generating a QuickSort algorithm from scratch:
Youtube Video (Please play it, I rly want u to enjoy the music : )
Youtube Video (Please play it, I rly want u to enjoy the music : )