Google's Imagen 4: Significantly Enhanced Text-to-Image Generation

Google has introduced its latest text-to-image model, Imagen 4, which claims to deliver 'significantly improved' image quality over previous versions, including Imagen 3. The new model also features a deluxe variant, Imagen 4 Ultra, designed to produce more precise images based on detailed prompts, at a higher cost. Both versions are available through a paid preview in the Gemini API and limited free testing in Google AI Studio.
The main Imagen 4 model is priced at $0.04 per image, suitable for most tasks, while Imagen 4 Ultra costs $0.06 per image, aimed at users requiring high fidelity and accuracy. Google showcased various images generated by these models, such as a comic strip of a spaceship attacked by a space lizard, and scenes like a vintage travel postcard of Kyoto or a hiking couple, all following prompts closely. Despite improvements, some users find Imagen 4 to be only a mild upgrade compared to competitors like Dall-E 3 and Midjourney 7. The AI art market appears to be stabilizing, with primary uses shifting toward social media advertising rather than artistic creation.
Overall, Imagen 4 enhances Google’s image generation capabilities but may not significantly change the landscape for those already using leading AI image generators.