OpenAI’s leading models keep making things up — here's why

When tested, OpenAI's o3 and o4-mini models kept hallucinating their answers.

April 22, 2025 · 2 min · 242 words · Bruce West

Table of Contents

When you purchase through links on our site, we may earn an affiliate commission.Heres how it works.

Both models are hallucinating.

This in itself isnt out of the ordinary, as most AI models still tend to do this.

But these two new versions seem to be hallucinating more than a number of OpenAIs older models.

Historically, while most new models continue to hallucinate, the risk has reduced with each new release.

The potentially larger issue here is that OpenAI doesnt know why this has happened.

What are hallucinations?

If youve used an AI model, youve most likely seen it hallucinate.

This is when the model produces incorrect or misleading results.

Apple 2025 MacBook Air…

If youve used an AI model, youve most likely seen it hallucinate.

This is when the model produces incorrect or misleading results.

This can be a small, non-important issue.

Lenovo IdeaPad Flex 5i 14 in…

What does this mean for the o3 and o4-mini models?

This is expected, as smaller models have less world knowledge and tend to hallucinate more.

However, we also observed some performance differences comparing o1 and o3, the report states.

Apple 13" MacBook Air (M4,…

More research is needed to understand the cause of this result.

OpenAIs report found that o3 hallucinated in response to 33% of questions.

That is roughly double the hallucination rate of OpenAIs previous reasoning models.

Yoga Slim 7x (14″ Snapdragon)

However, as both models are set up for more complex tasks, this could be problematic going forward.

As mentioned above, hallucinations can be a funny quirk in non-important prompts.

ROG Zephyrus G14 (2024) Light…

ASUS

Apple MacBook Air (2025) 15.3…

P.C. Richard & Son

Dell XPS 13 Touch

Dell

Dell XPS 13 7390 13.3"…

Staples

Apple 2025 MacBook Air…

Dell XPS9350-5340SLV 13.3…

Sam Altman

Artificial intelligence concept image

Robot looking at a laptop/Future AI generated image

iPhone 16 Pro Max, Galaxy S25 Ultra and Pixel 9 Pro

ChatGPT generated image

Samsung Tri Fold Foldable phone concept

Microsoft PowerPoint using Copilot feature to create a new slide

A render showing the rumored iPhone 17 Pro in a Sky Blue colorway

Admir Šehović as Ilian Petrov in "iHostage" now streaming on Netflix

NYTimes Connections

Ring Wall Light Solar turned on at night.