Ideogram has released version 4.0 of its text-to-image model as an open-weight model.
According to Ideogram, the new features include native 2K resolution, transparent backgrounds, precise layout control via bounding boxes, and improved text rendering in images, useful for logos and posters. Editable text and layers are coming soon, the company says.
The model can run on your own hardware and be fine-tuned with your own data. Weights and code are available for download on GitHub, but commercial use requires a paid license.
According to the DesignArena leaderboard, Ideogram 4.0 ranks first among all open-weight models. Only closed models from OpenAI and Google score higher. In the text-to-image arena, it also takes first place in quality mode and ninth overall. The model is available in three quality tiers via Ideogram's own hosted API, according to the Ideogram website:
| Quality level | Price per image |
|---|---|
| Turbo | 0.03 dollar |
| Default | 0.06 dollar |
| Quality | 0.10 dollar |
Ideogram 4.0 is also available on the web and across partner platforms, including Hugging Face, ComfyUI, fal, Runware, Magnific, Krea AI, Leonardo AI, Picsart, Cloudflare, Replicate, Gamma, Flora AI, and Kittl. In our benchmark prompt, the model easily outperforms Midjourney v8, lands roughly on par with Flux, but falls short of GPT-Image-2, Nano Banana Pro, or Luma Uni-1.1. That's just one prompt, though, and it mainly tests prompt following and the model's ability to render abstract concepts unlikely to appear in the training data, like a horse-riding astronaut. As always, your own testing is a must.
AI News Without the Hype – Curated by Humans
Subscribe to THE DECODER for ad-free reading, a weekly AI newsletter, our exclusive "AI Radar" frontier report six times a year, full archive access, and access to our comment section.
Subscribe now