Note_Tech

All technological notes.

Project maintained by simonangel-fong Hosted on GitHub Pages — Theme by mattgraham

Hallucination

Hallucination
- when a large language model (LLM) generates information that is incorrect, fabricated, or not grounded in reality, yet presents it as if it were true.

In short: confident-sounding nonsense or unsupported claims.

Hallucination is not a bug—it’s a byproduct of how LLMs are trained.

Probabilistic Nature
- LLMs predict the next most likely token, not the “true” answer
- No built-in fact-checking mechanism
Training Objective Mismatch
- Objective: minimize prediction error (loss)
- Not: ensure factual correctness
Incomplete / Noisy Training Data
- Training data may contain:
  - Errors
  - Conflicting information
  - Gaps in knowledge
Lack of Grounding
- No real-time connection to:
  - Databases
  - External APIs
  - Verified sources (unless explicitly integrated)
Overgeneralization
- Models pattern-match aggressively
- Fill missing details using learned patterns → can fabricate
Prompt Ambiguity
- Vague or underspecified prompts → model “guesses”

Factual Hallucination
- Incorrect facts
- Example: wrong dates, wrong definitions
Fabricated Content
- Completely made-up information
- Example: fake papers, fake APIs, fake commands
Contextual Hallucination
- Contradicts given input/context
- Example: ignoring constraints in prompt
Logical Hallucination
- Reasoning errors
- Example: flawed step-by-step logic
Citation Hallucination
- Fake references or sources
- Common in academic-style answers
Instruction Drift
- Ignores or partially follows instructions

Human Evaluation (Most Reliable)
- Domain expert verification
- Checklist
  - Is it factually correct?
  - Is it supported by evidence?
  - Does it follow instructions?

Automatic Evaluation Methods
- a. Ground Truth Comparison
  - Compare output vs. known correct answer
  - Metrics
    - Exact Match (EM)
    - F1 Score
- b. Retrieval-Based Verification
  - Cross-check with trusted sources (RAG)
  - If unsupported → likely hallucination
- c. Self-Consistency Check
  - Ask model multiple times
  - If answers vary → low reliability
- d. LLM-as-a-Judge
  - Use another model to verify:
    - factuality
    - consistency

Look for patterns like:

Good practices

If you are unsure, say "I don’t know".
Only answer based on the provided context.

Inject external knowledge into the prompt to ground responses in real data.

Flow:

Allow the model to interact with external systems to replace “guessing” with real-time data retrieval:

Train the model on specialized datasets to improve performance:

Use structured outputs to force the model to be more precise:

Add a verification layer after the model generates a response:

Improve transparency by asking the model to:

Include references for each claim.
If no source is available, say so.