AIandBusiness.com
X(Twitter)
English
English
  • Welcome to AI&Business (Beta)
  • INTO AI
    • AI Basics
      • AI in the Enterprise
      • AI Learning Resources
        • Industry Reports and White Papers
    • Large Language Models (LLMs)
      • ChatGPT
        • GPTs
        • Create your own GPTs
      • Claude
      • Gemini
      • How to use an opensource LLM locally?
      • Prompt Engineering
      • Understand Token
      • Hallucination Problem
    • Image Generation/Synthesis
    • AI Aggregator/AI Hub
    • AI Ethics/Copyright
    • Policy and Regulation
    • AI and Work
  • Industries & Cases
    • 🤖Robotics/Embodied Intelligence
      • Humanoid Robots
        • Figure AI
        • Boston Dynamics
        • Tesla Optimus
      • Industrial Robot
      • Four-legged Robot
      • Service Robots
      • Field Robots
    • 🖱️AI Browser / AI Search Engine
      • Perplexity
    • ℹ️News/Info Gathering
      • Case: how to use Grok to get the trending topic on X
    • 🔍Scholar & Research
      • Scholar & Research Prompts
      • Research & Scholar GPTs
      • Research Examples using Perplexity
    • ✍️Writing
      • Writing Prompts
      • AI Detector & Text Humanizer
      • Novel Writing
    • 👩‍💻Data Analysis
      • Case: Los Angeles Rams Harness AI for Competitive Edge
    • 🚗Autonomous Driving
      • Robotaxi & Robovan
        • Tesla
        • Waymo
        • Case: Challenges in Implementing Baidu's Apollo Go Autonomous Driving Business
    • Marketing
      • Tools
      • 📱Social Media
        • Social Media Prompts
        • Case: Meta Advantage+
      • 🤩Branding
        • Branding Prompts
    • 📎Customer Service
      • Customer Service Prompts
      • Online Shopping Assistants
    • Legal
    • 💽Software Development Tools/Assistants/Agents
      • Website Development
      • GPTs for Coding
      • Cursor Resources & Cases
      • AI App Development Tool
      • Product Development
    • 🕹️Game Development
      • AI NPC
      • 🎮Gaming Assistant
        • Case: OpenAI Five
    • 🖋️Smart Working
      • PDF
      • Excel
      • Slides
      • AI Mindmap
      • AI Meeting
      • Note-taking
    • 🎙️Podcasting
      • Podcast Summaries
      • AI-Generated News Podcasting Cases
    • 👥Virtual Avatar / Digital Human
      • Case: Ai.lonso
    • 📐Industrial drafting and modeling
    • Startups
    • 🏛️Corporate Management
      • Management Prompts
      • Process Management
    • 🤝Negotiation
    • 🎓Education
      • AI Education Tools
        • AI STEAM Learning Tools
        • AI Language Learning Tools
      • Education Prompt Examples
    • 🫂AI Companion
    • Sales/Business Development
    • 🚚Supply Chain
      • Manufacturing & Production
      • Warehousing & Logistics
    • Spatial Computing/3D
      • Metaverse & Digital Twin
    • Translation
    • AI + Finance
      • Fraud Detection
    • E-commerce
    • Biotechnology and Medicine
    • Agriculture
    • Jobs & Interview
  • AI Agent/Apps
    • 🪪What Are AI Agents?
      • Case: Altera Project Sid
      • Case: Generative Agents: Interactive Simulacra of Human Behavior
    • 👾AI Agent Tools
      • Make.com Tutorial
      • Microsoft AutoGen
      • Crew AI
    • Agent to Agent Protocol
      • Model Context Protocol (MCP)
      • Google A2A
    • AI Assistant
    • Computer Use Agent
    • AI App Development Tool
  • Image
    • 🪄Text to Image / Image Editing Tools
      • GPT-4o Image Collection
      • Midjourney Guide
        • Midjourney Cases
      • Grok 2 w/ Flux
      • Comfy UI
    • 🖌️Comics & Manga
    • AI Image Detector
    • Create Avatar
  • Video & Films
    • 🎥GenAI Film Making
      • Veo 3 Cases
      • Sora
      • Runway
        • Runway AIFF
    • AI Video Case Studies
      • ASML Ad: Midjourney+Stable Diffusion+Runway
      • ToysRus New AI Commercial
      • CocaCola 2024 Christmas Ad
      • McDonald JP Ad
    • 🎛️AI Video Editing Tools/Platforms
      • Eyesight Correction
      • AI Video Clipping
  • Sound & Music
    • 🎤Voice Generation Tools
      • ElevenLabs Tutorial
    • 🎼Music Generation
      • Suno Guide
    • 🪇Sound Effect
  • AI Hardware
    • 🕶️AI Glasses
      • Case: Meta Glasses
    • 📱AI Cell Phone & System
      • Apple Intelligence
      • Samsung OneUI
      • Google Gemini/Android
    • 🖲️AI Gadgets
    • AI Computer
  • Other
    • 😀About
    • 💭Contact
    • 🗒️Changelog
    • Terms of Service
Powered by GitBook
On this page
  • What is AI Hallucination?
  • Causes of AI Hallucinations
  • Hallucination Leaderboard
  • Some Real Life Cases:
  • Mitigation Techniques
  • Hallucination Detector

Was this helpful?

  1. INTO AI
  2. Large Language Models (LLMs)

Hallucination Problem

PreviousUnderstand TokenNextImage Generation/Synthesis

Last updated 1 month ago

Was this helpful?

What is AI Hallucination?

AI hallucination occurs when a Generative AI Model generates false, misleading, or illogical information and presents it as fact. This issue arises because LLMs are trained on vast amounts of text data and use statistical patterns to predict the next word in a sequence, rather than understanding the underlying reality of the content they generate.

Causes of AI Hallucinations

There are several reasons for AI hallucinations. Some large models suffer from insufficient training data, while others have low-quality training data containing errors. Additionally, some models were trained earlier and lacked updated information. User prompts that are unclear can also lead to hallucinations. Furthermore, chatbots designed to maintain conversation flow might generate inaccurate information in the absence of correct data (it is easy for an LLm) to fulfill their primary task of keeping the dialogue going.

Hallucination Leaderboard

Some Real Life Cases:

Case 1: Lawyer Citing Fake Cases Generated by ChatGPT

Case 2: AI-Generated Misinformation

AI-generated false information can spread rapidly, leading to public misunderstanding. An example of this occurred with Google’s Bard chatbot, which incorrectly claimed that the James Webb Space Telescope had taken the first image of an exoplanet. This misinformation quickly disseminated after its release, demonstrating the potential for AI to propagate erroneous news​.

These cases underscore the importance of rigorous verification processes and the responsible use of AI technology to prevent the spread of false information and ensure the integrity of legal documents.

Mitigation Techniques

  1. Use Better Large Language Models: Whenever possible, utilize more powerful large models, as they tend to produce fewer hallucinations.

  2. Use AI Search Engines: Employ applications like Perplexity, which are optimized for search tasks using large models. These applications provide answers based on internet data content. (*Although they can still have hallucinations, the chances are significantly lower.)

  3. Human-in-the-Loop: Have humans meticulously verify AI-generated indexes, cases, and other content.

  4. Refine Your Prompts: Use more detailed and sophisticated prompts and prompt structures to obtain more accurate results.

  5. Use Advanced Reasoning LLM: Choose models specifically trained or fine-tuned for logical reasoning and step-by-step problem-solving tasks, such as GPT-4 with chain-of-thought (CoT) prompting or Claude 3, which systematically reduce hallucinations through explicit reasoning processes.

Here is a page on the OpenAI Cookbook that provides guidelines and examples for developing guardrails to prevent hallucinations in AI models.

In summary, always meticulously verify any content generated by large models before using it.


Hallucination Detector

In 2023, a New York lawyer, Steven Schwartz, used ChatGPT to draft a legal brief for a personal injury case. The brief included several fictitious court cases fabricated by the AI, which opposing counsel could not verify. When challenged, Schwartz and his colleague, Peter LoDuca, doubled down on their assertions until the court ordered them to provide the cases. Upon failing to do so, they were fined $5,000. Furthermore, the court mandated that any future filings involving generative AI content must explicitly disclose such use to ensure accuracy checks​ ()​​ ()​.

ABA Journal
Law.com
https://demo.exa.ai/hallucination-detector
https://github.com/vectara/hallucination-leaderboard
Google’s AI chatbot Bard makes factual error in first demoThe Verge
Developing Hallucination Guardrails | OpenAI Cookbook
Logo
Logo