When you purchase through links on our site, we may earn an affiliate commission.Heres how it works.
Given its increasing importance and capability, I decided it was time to see how Grok compares to ChatGPT.
Ive putChatGPT up against Gemini,then against Claude.
Ive also putClaude up against Google Gemini.
Creating the prompts
The goal of this test is a straight model-to-model comparison.
The closer it gets to the requested elements and positioning, the better it performs.
The room should have large windows letting in natural light from the left side, with sheer white curtains.
Include a grey Persian cat sleeping on a round cushion under the desk."
but that isn’t the case for Grok which is much more natural.
But it struggled to follow the prompt exactly.
The winner will have the most detail, describe the equipment without assumption and accurately recognize scale and perspective.
Bonus points for getting the correct Apollo mission number.
Prompt:“Study this photograph carefully.
Both did a good job, although neither identified the Apollo mission, even in a follow up question.
it’s possible for you to read the full analysis fromboth models in a Google Doc.
Use only standard Python libraries.
The code must run without modifications.”
It also struggled to display the words on buttonsbutit was also fully complete.
Out of the box, I could start, pause and reset the timer.
Creative Writing
Being able to write creatively is an essential skill for a chatbot.
After all, how else will all those high school students get an A?
It also has to be under 500 words.
Keep it under 500 words."
It does this while creating a more emotionally resonant narrative with stronger character development and more natural dialogue.
you might read both storiesin full in a Google Doc.
The lights keep changing colors, the thermostat is fluctuating, and the smart speakers are playing random music.
Create a systematic troubleshooting guide that identifies potential causes and solutions, considering both technical and non-technical users."
This is especially the case for non-technical users who need quick and easy solutions in a stressful situation.
This is the amount of information it can hold within a single instance.
It also helps to have live web search capabilities.
For this test, Im looking to get it to plan a Tokyo trip and include specific details.
Prompt:“Plan a 3-day Tokyo trip focused on technology attractions.
Total budget must be included in USD and Yen.”
I also found it aligned better with the prompt as it focuses on technology attractions.
Fullanalysis in a Google Doc.
Education
Finally, education.
AI is a great tool for explaining complex ideas in a simple way.
In this case its clouds and 10-year-olds.
Include at least two simple experiments they could try at home to demonstrate the concepts."
Grok’s explanation makes for more engaging storytelling and better experiments.
Its response likely works better to capture a child’s imagination.
you’re free to seeboth in a Google Doc.
ChatGPT vs Grok: The Winner
This was the closest test I’ve done to date.
It was a very close battle, and I’ll be honest, I was shocked by the output.
I know Grok has been improving, but I expected ChatGPT to win this one easily.
its writing is more engaging and less formal.
This is also all using Grok 2 and GPT-4o.
Also, Grok 3 is on the horizon and may be out before GPT-5.