AI has written like a typewriter since day one: one key at a time, no backspace.
Google's new DiffusionGemma writes like a draft instead. It dumps a rough block of text, then edits it into shape:
- Whole paragraphs at once, not word by word. Up to 4x faster.
- It fixes its own mistakes while writing, instead of being stuck with the first guess.
- It runs on a single consumer graphics card, not a data center.
- Free and open under Apache 2.0, so anyone can run it themselves.
Why it matters even if you never touch the code: fast, local, and free together means smaller builders can run real AI without an API bill they do not control.
The catch: it is experimental. Speed is easy to show off. Whether the writing quality holds up is the part still unproven.
Google (@Google)
Meet DiffusionGemma ⚡ Our latest experimental open model (Apache 2.0) that generates text up to 4x faster.
Instead of predicting and typing just one word at a time like most language models, it drafts and refines entire blocks of text simultaneously.
Here’s how it works 🧵 ↓
— https://nitter.net/Google/status/2064741293163418032#m