Fred Zhangzhi Peng, Shuibai Zhang, Alex Tong

Duke University, UW-Madison, Aithyra

๐Ÿค— Hugging Face | ๐Ÿ’ป Codebase | ๐Ÿ“– Blog

๐ŸŒย English Blog | ๐Ÿ‡จ๐Ÿ‡ณไธญๆ–‡ๅšๅฎข

TL;DR: the first/most open release diffusion large language model.

Table of Contents

Open-dLLM

The field of diffusion LLMs is still young, and the answers arenโ€™t clear. Pioneering efforts like Gemini-Diffusion, Seed Diffusion, and Mercury sparked excitement, but they remain closed APIs. You can use them, but you canโ€™t study how they were built.

Open projects like LLaDA and Dream pushed things further by releasing weights and inference code. But they stopped short of what researchers need most: training pipelines, data recipes, and reproducible evaluation.

Thatโ€™s why we built Open-dLLM: the first full-stack open diffusion language model project.

๐Ÿ‘‰ Our first release is Open-dCoder, focused on code generation. It includes:

With Open-dLLM, you can go from raw data โ†’ training โ†’ checkpoints โ†’ evaluation โ†’ inference, all in one repo.

Project Data Training Code Inference Evaluation Weights
Open-dLLM (ours) โœ… โœ… โœ… โœ… โœ…
LLaDA โŒ โŒ โœ… โš ๏ธ limited โœ…
Dream โŒ โŒ โœ… โš ๏ธ limited โœ…
Gemini-Diffusion โŒ โŒ โŒ โŒ โŒ (API only)
Seed Diffusion โŒ โŒ โŒ โŒ โŒ (API only)
Mercury โŒ โŒ โŒ โŒ โŒ (API only

Demo

Hereโ€™s our Open-dLLM generating a QuickSort algorithm from scratch:

Youtube Video (Please play it, I rly want u to enjoy the music : )

                          Youtube Video (Please play it, I rly want u to enjoy the music : )