{"id":168468,"date":"2025-09-08T17:37:37","date_gmt":"2025-09-08T17:37:37","guid":{"rendered":"https:\/\/teknomers.com\/en\/openai-believes-it-has-figured-out-why-the-ai-sometimes-hallucinates-it-struggles-to-express-i-dont-know\/"},"modified":"2025-09-08T17:37:38","modified_gmt":"2025-09-08T17:37:38","slug":"openai-believes-it-has-figured-out-why-the-ai-sometimes-hallucinates-it-struggles-to-express-i-dont-know","status":"publish","type":"post","link":"https:\/\/teknomers.com\/en\/openai-believes-it-has-figured-out-why-the-ai-sometimes-hallucinates-it-struggles-to-express-i-dont-know\/","title":{"rendered":"OpenAI believes it has figured out why the AI sometimes hallucinates: it struggles to express &#8220;I don&#8217;t know.&#8221;"},"content":{"rendered":"\n<h2>Understanding AI Hallucinations: Why Chatbots Get It Wrong<\/h2>\n<p>\n    Artificial Intelligence (AI) has made remarkable strides in recent years, particularly in natural language processing. However, even the most sophisticated AI models, such as those developed by OpenAI, exhibit a troubling phenomenon known as \u00a0&#8220;hallucinations.&#8221;\u00a0 These occurrences manifest when AI generates responses that are entirely inaccurate or fabricated, often leading to confusion and frustration for users. Understanding the reasons behind these errors is crucial for improving AI systems and their reliability.\n<\/p>\n<p>\n    According to a recent <a rel=\"noopener, noreferrer nofollow\" href=\"https:\/\/cdn.openai.com\/pdf\/d04913be-3f6f-4d2b-b283-ff432ef4aaa5\/why-language-models-hallucinate.pdf\" target=\"_blank\">report published by OpenAI<\/a>, hallucinations arise primarily due to the \u00a0&#8220;statistical pressures&#8221;\u00a0 present during the training and evaluation phases of AI development. The AI is often compelled to provide answers even in cases where it should acknowledge uncertainty. This can be likened to a student facing a tough exam question who guesses an answer instead of admitting they don&#8217;t know.\n<\/p>\n<div class=\"article-asset article-asset-normal article-asset-center\">\n<div class=\"desvio-container\">\n<div class=\"desvio\">\n<div class=\"desvio-figure js-desvio-figure\"><\/div>\n<div class=\"desvio-summary\">\n<div class=\"desvio-taxonomy js-desvio-taxonomy\">In Xataka<\/div>\n<p>                Good news, you don&#8217;t have to choose model using GPT-5. Bad news, it is GPT-5 who chooses it without notifying you.\n            <\/p><\/div>\n<\/p><\/div>\n<\/p><\/div>\n<\/div>\n<p>\n    One core issue with the AI&#8217;s training is the way it learns from vast text corpora. In the \u00a0pre-training\u00a0 phase, AIs learn to predict the next word in a sentence based on patterns found in previous examples. However, these systems lack true\/false labels for the sentences they generate, relying solely on positive examples of language. This approach increases the likelihood of producing inaccurate content.\n<\/p>\n<p>\n    The report cites a method to mitigate these hallucinations: implementing a binary classification system known as \u00a0IIV (Is it Valid?)\u00a0. This system would help determine whether a response is valid or erroneous, improving the model&#8217;s self-awareness. When this binary system is integrated into models like GPT-5, the AI shows signs of \u00a0&#8220;humility,&#8221;\u00a0 with the ability to classify its answers as correct, incorrect, or an abstention. Preliminary data indicate that GPT-5 has made strides in reducing its hallucination rate; it is reported that 52% of its responses abstain from providing an answer altogether, compared to just 1% for its predecessor, O4-mini.\n<\/p>\n<div class=\"article-asset-image article-asset-normal article-asset-center\">\n<div class=\"asset-content\">\n        <img decoding=\"async\" alt=\"Screen capture 2025 09 08 at 13 35 49\" class=\"centro_sinmarco\" src=\"https:\/\/teknomers.com\/en\/wp-content\/uploads\/2025\/09\/OpenAI-believes-it-has-figured-out-why-the-AI-sometimes.jpeg\">\n    <\/div>\n<\/div>\n<p>\n    The \u00a0research\u00a0 reveals a systemic issue: current benchmarks tend to emphasize successes while glossing over failures such as hallucinations. Amidst continual advancements, it appears that AI models are still prone to generating invalid information. There&#8217;s an urgent need to create a balanced framework that holds models accountable for both correct answers and the cases in which they should admit ignorance.\n<\/p>\n<p>\n    To draw parallels with a more familiar context, consider how educational assessments are structured. A strategy exists to discourage guessing by penalizing incorrect answers while offering neutral points for abstaining from answers altogether. Implementing a similar approach could reinforce accountability in AI models, discouraging inaccuracies while promoting the reliability of the responses.\n<\/p>\n<p>\n    The ongoing quest to enhance AI capabilities continues to evolve, with companies like OpenAI leading the charge. As the \u00a0development of AI\u00a0 progresses, addressing the dilemma of hallucinations and implementing measures such as the IIV system will be essential in ensuring the responsible integration of AI into various industries.\n<\/p>\n<p>\n    While AI has already transformed numerous sectors, it remains essential to proceed cautiously, ensuring that users can trust the information these technologies provide. The future of AI hinges upon our ability to refine its development processes and enhance transparency, paving the way for a more accurate and reliable interaction between humans and machines.\n<\/p>\n<p><br \/>\n<br \/><a href=\"https:\/\/teknomers.com\/category\/general\/\" rel=\"dofollow\">General News &#8211; 2<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Understanding AI Hallucinations: Why Chatbots Get It Wrong Artificial Intelligence (AI) has made remarkable strides in recent years, particularly in natural language processing. However, even the most sophisticated AI models, such as those developed by OpenAI, exhibit a troubling phenomenon known as \u00a0&#8220;hallucinations.&#8221;\u00a0 These occurrences manifest when AI generates responses that are entirely inaccurate or [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":166203,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[36399],"tags":[211,4929,1635,41425,41877,18785,1734],"class_list":["post-168468","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-technology","tag-believes","tag-dont","tag-express","tag-figured","tag-hallucinates","tag-openai","tag-struggles"],"_links":{"self":[{"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/posts\/168468","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/comments?post=168468"}],"version-history":[{"count":0,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/posts\/168468\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/media\/166203"}],"wp:attachment":[{"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/media?parent=168468"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/categories?post=168468"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/teknomers.com\/en\/wp-json\/wp\/v2\/tags?post=168468"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}