OpenAI's latest AI models are generating alarming levels of misinformation

OpenAI's latest AI models generate alarming levels of misinformation

May 7, 2025

Updated • May 7, 2025

Internet

OpenAI's latest AI models seem to have a big problem. A report reveals that the GPT o3 and o4-mini are producing misinformation at an alarming rate.

AI-generated misinformation, aka hallucinations, are common among most artificial intelligence services. The New York Times has published an investigation conducted by OpenAI that discovered its own models are generating more fake content than others. This in turn has raised serious concerns about their reliability.

GPT o3 and o4-mini have been designed to mimic human reasoning and logic. When these were put to the test in benchmarks involving public figures, nearly one-third of GPT o3's results were found to be hallucinations. In comparison, GPT o1 had less than half of that error rate in tests that were conducted last year. GPT o4-mini fared even worse, as it hallucinated on 48% of its tasks. When these models tackled general knowledge questions, hallucinations soared to 51% for GPT o3, and a staggering 79% for o4-mini.

OpenAI says that the hallucinating problem is not because the reasoning models are worse, but because they could simply be more verbose and adventurous in their answers, and are speculating possibilities rather than repeating predictable facts. Developers initially aimed for these systems to think critically and reason through complex queries; however, this ambitious approach appears to have led to an increase in creativity at the expense of factuality.

This could pose a big problem for OpenAI's ChatGPT, as rival services like Google Gemini, Anthropic Claude, have been designed to provide information more accurately. Unlike simpler models focused on high-confidence predictions, GPT o3 and o4-mini often speculate, blurring the line between possible scenarios and outright fabrications. This raises red flags for users in high-stakes environments, from legal professionals to educators and healthcare providers, where reliance on AI could lead to significant missteps.

The more useful AI becomes, the greater the potential for critical errors. While AI models may outperform humans in certain tasks, the risk of inaccuracies diminishes AI's overall credibility. Until these hallucination issues are effectively addressed, users are advised to approach AI-generated information with caution and skepticism.

Source: Tech Radar

Comments

Anonymous said on May 8, 2025 at 9:34 am


Okay, who crawled the Yahoo Answers archives? Should have stuck with Reddit – nobody ever makes things up there…
Richard Hack said on May 8, 2025 at 3:24 am


So, in other words, they actually act like humans for once. LOL Most of the humans I see on the Internet are hallucinating “facts” at least 90 percent of the time.
1. boris said on May 8, 2025 at 3:35 am
  
  
  The problem is that when people ask questions from Chatbots or Google Snippets, people believe they’re asking a reputable source, not some rando from the internet.
boris said on May 7, 2025 at 10:21 pm


So OpenAI admitted that they have problems. Actually pretty refreshing. I still believe that Google Search snippets generate mostly fake responses because they train their model on Reddit answers.
KNTRO said on May 7, 2025 at 1:39 pm


“The New York Times has published an investigation conducted by OpenAI has revealed that points out OpenAI’s models are generating more fake content than others.”

It seems like gHacks articles uses these very same AI models! ?
1. Tom Hawack said on May 7, 2025 at 6:31 pm
  
  
  It seems? Doesn’t seem to me, at least not on the basis of fake content. Accuracy here is a tradition.
  But maybe has your very comment been carried out by an AI given speculation here confines to hallucination, my dear friend :)
  —
  Concerning AI, let’s face it : even with a narrow 10% of hallucinations it wouldn’t deserve the qualification of intelligence, IMO. Assertions must be bullet-proof in terms of factual, otherwise speculations must be notified as such. Like humans being when they state “I know” rather than “I think, I believe” … at least when they do differentiate their imagination from their knowledge which, nowadays, is on a negative climb, so to say (lol).

OpenAI's latest AI models are generating alarming levels of misinformation

Related content

Tutorials & Tips

How to use Google Docs dark mode on PC?

Here is why some NordVPN specialty servers are not displayed in your client

Wikipedia Book Creator now supporting epub format

Add Search the Internet to the Windows Start Menu

Comments

Leave a Reply Cancel reply

Advertisement

Spread the Word

Advertisement

Advertisement

Recently Updated

Advertisement

About gHacks

OpenAI's latest AI models are generating alarming levels of misinformation

Related content

Vivaldi 7.4 gives you full control over browser and specific website shortcuts

OpenAI releases GPT-4.1 and GPT-4.1 mini AI models for ChatGPT

OpenAI expands ChatGPT Search with shopping features

AdGuard for Linux released with command line support

WhatsApp Web set to launch voice and video calling features

Netflix introduces dialogue-only subtitles for a simplified viewing experience

Tutorials & Tips

How to use Google Docs dark mode on PC?

Here is why some NordVPN specialty servers are not displayed in your client

Wikipedia Book Creator now supporting epub format

Add Search the Internet to the Windows Start Menu

Comments

Leave a Reply Cancel reply

Advertisement

Spread the Word

Advertisement

Advertisement

Recently Updated

Advertisement

About gHacks