I tested ChatGPT o3-mini vs DeepSeek R1 vs Qwen 2.5 with 9 prompts — here’s the winner

When you purchase through links on our site, we may earn an affiliate commission.Heres how it works.

With superior multilingual capabilities and high inference efficiency,the modelhas shown versatility in a wide range of applications.

Heres what happened when these free tier models faced off, including the overall winner.

o3-mini vs Qwen 2.5 vs DeepSeek screenshot

Lateral thinking puzzle

Prompt:You are in a completely dark room with three light switches on a wall.

How do you determine which switch controls which bulb?

o3-mini comes in second for the thorough explanation, but less structured than Qwen 2.5.

o3-mini vs Qwen 2.5 vs DeepSeek screenshot

Deductive reasoning

Prompt:“A detective is investigating a murder case.

He interviews three suspects: Alice, Bob, and Charlie.

One of them is guilty, and the other two are telling the truth.

o3-mini vs. Qwen 2.5 vs. DeepSeek screenshot

Heres what they say

Alice: “Bob is innocent.”

Bob: “Charlie is guilty.”

Charlie: “I am innocent.”

o3-mini vs Qwen 2.5 vs DeepSeek screenshot

Who is the murderer?”

o3-minidelivered a step-by-step elimination approach: the model systematically assumes each person is guilty and checks for contradictions.

The explanation is clear, logical, and doesnt overcomplicate.

o3-mini vs Qwen 2.5 vs DeepSeek (3)

Qwen 2.5 was a close second.

It includes try-except blocks to handle invalid inputs, making it more robust.

With a good implementation but slightly less comprehensive with error handling, o3-mini was a close second.

o3-mini vs Qwen 2.5 vs DeepSeek screenshot

Mathematical proof

Prompt:“Prove the Pythagorean theorem using a geometric approach.

“o3-minidelivered an explanation that follows a well-structured, step-by-step approach, making it easy to understand.

DeepSeekcrafted a correct proof that follows a logical structure.

o3-mini vs Qwen 2.5 vs DeepSeek screenshot

Yet it lacks the conversational approach response of 03-mini or Qwen 2.5.

Winner: o3-miniwins for the best combination of clarity, detail and logical flow.

Qwen 2.5 is in second place with a solid response but formatting and visualization issues.

o3-mini vs Qwen 2.5 vs DeepSeek screenshot

Scientific explanation

Prompt:“Explain the process of photosynthesis in detail.

“o3-miniprovided detailed descriptions of both light-dependent and light-independent reactions with clear breakdowns of each step.

The step-by-step progression from capturing light to converting energy into glucose is easy to follow.

o3-mini vs. Qwen 2.5 vs. DeepSeek screenshot

Winner: o3-miniwins for best balance of depth, clarity, organization, and accuracy.

DeepSeek was a close second for its solid explanation but lacking some finer details.

Historical analysis

Prompt:“Analyze the causes and effects of the French Revolution.

However, the economic consequences could have been explored in more detail.

DeepSeek comes in second place for a solid response but slightly less detailed.

o3-miniexplored both themes of madness and revenge and how they intertwine rather than treating them as separate topics.

Qwen 2.5offered a very detailed discussion of feigned vs. real madness.

DeepSeek is second for a strong response, but it was more summary-like and less interwoven.

Philosophical discussion

Prompt:“Discuss the concept of utilitarianism and its implications in modern ethics.

Apple 13” MacBook Air (M3,…

Qwen 2.5 is in second place for a good explanation but slightly weaker structure and conclusion.

Urban planning

Prompt:“Design an integrated strategy to optimize urban transportation in a rapidly growing megacity.

However, the chatbot lacked a strong focus on governance and long-term futureproofing.

Lenovo Chromebook Duet 3…

Qwen 2.5 came in second for a strong but slightly-less structured response.

More from Tom’s Guide

ASUS Zenbook S 13 OLED Laptop…

ASUS

Asus ROG Zephyrus G14 2023

Best Buy

Lenovo IdeaPad Duet 3

Macbook pro

Apple MacBook Pro (2024) 14.2…

P.C. Richard & Son

Apple 2024 MacBook Pro Laptop…

Apple 13” MacBook Air (M3,…

ASUS Zenbook S 13 OLED…

ChatGPT and Deepseek side by side on smartphones

Deepseek vs Qwen

Woman using ChatGPT app on the beach

ChatGPT generated image

Gemini gif

Student at desk

the plaud notepin, an AI dictaphone, in silver. it�s a pill shaped device at just 2 inches long and can be worn as a watch, necklace, clip, or pin thanks to the magnetic panel

The LG C5 (on left) faces off against the Samsung S95F (on right).

HDMI input

A split image featuring the Nectar Classic Memory Foam Mattress on the left and the DreamCloud Luxury Hybrid on the right, leading into the DreamCloud Luxury Hybrid vs Nectar Classic showdown

google pixel 9a and google pixel 8a in white and black side by side

The image shows the Casper Hybrid with Snow Technology pillow on the left and the Coop Cool+ pillow on the right in a side by side comparison

Left to right: Billy Magnussen, Osy Ikhile, Paul G. Raymond, Cristin Milioti, Milanka Brooks in Black Mirror season 7

More from Tom’s Guide#

More from Tom’s Guide