Twitter/X

Google's DiffusionGemma is an experimental open-source (Apache 2.0) text model…

Brief

Google's DiffusionGemma is an experimental open-source (Apache 2.0) text model that drafts entire paragraphs and edits them in place, claiming up to 4× faster generation and the ability to run on a single consumer GPU. Its combination of speed, local execution, and no-API cost promises accessibility for smaller builders, though output quality is still unverified.

Source evidence

AI has written like a typewriter since day one: one key at a time, no backspace.

Google's new DiffusionGemma writes like a draft instead. It dumps a rough block of text, then edits it into shape:

  1. Whole paragraphs at once, not word by word. Up to 4x faster.
  2. It fixes its own mistakes while writing, instead of being stuck with the first guess.
  3. It runs on a single consumer graphics card, not a data center.
  4. Free and open under Apache 2.0, so anyone can run it themselves.

Why it matters even if you never touch the code: fast, local, and free together means smaller builders can run real AI without an API bill they do not control.

The catch: it is experimental. Speed is easy to show off. Whether the writing quality holds up is the part still unproven.

Google (@Google)

Meet DiffusionGemma ⚡ Our latest experimental open model (Apache 2.0) that generates text up to 4x faster.

Instead of predicting and typing just one word at a time like most language models, it drafts and refines entire blocks of text simultaneously.

Here’s how it works 🧵 ↓

— https://nitter.net/Google/status/2064741293163418032#m