When you purchase through links on our site, we may earn an affiliate commission.Heres how it works.
Creating an image using artificial intelligence is easier than ever.
This will put it in more direct competition withGemini, ChatGPT, Claude, and MetaAI.
The xAI team has also given Grok its owncustom AI image creation model.
However, at least for now, it still uses the underlyingImagen 3 modelto create pictures.
This will change as Gemini 2.0 has native image abilities.
So I put them head to head.
I want photos rather than drawings so will use that as a keyword.
Gemini will only output images in a 1:1 resolution and so far, Grok seems to favor 4:3.
Unless otherwise indicated all the images are the first response with no follow-up refinement.
They were all also requested within the same session rather than creating a new chat for each prompt.
The fox is much more realistic than in the Gemini image.
It should show a commercial kitchen and behavior, also demonstrating the idea of activity.
It also needs to show material properties and be as realistic as possible.
I went for the documentary style as it also adds additional complexity.
I’m looking for shadow lengths and activity levels.
This was the hardest call for me.
I wanted to see how well both models handled black-and-white photography.
In this they also had to show tool use, lighting and engine detail.
Action photography is a challenge.
I did it for a while as a journalist earlier in my career (not very well).
We need to show correct positioning, public safety measures within the image and a sense of urgency.
Gemini matched the prompt much more closely and created a more realistic-looking image.
This was an easy decision.
Finally something more artistic.
As the prompt asks for someone practicing I’ve given the win to Grok.
Winner: Gemini vs Grok
Grok is very impressive.
Not only as a chatbot but also in its ability to generate realistic images.
It was a close match-up.
Both models are fairly evenly matched but Grok is better at interpreting a prompt and creates more natural-looking images.
What is worth noting is that soonGooglewill be launching a new version of Gemini that can create images natively.