The choice of the term “hallucination” to describe what is happening with these language models is fundamentally misleading, and feels calculated to obscure their probabilistic nature.
Characterizing the fabricated output as “hallucination” means, by implication, that correct output is “knowledge”—that the incorrect results are aberrations, not the result of the exact same blind processes that sometimes produce factually sound results.
You could argue that "hallucination" is a more accurate description - these systems literally have no mechanism to separate facts from lies - they have no intent to lie or tell the truth and can't represent those concepts.
Humans recognize hallucinations as wrong because they have systems in the brain that say "that can't have been real".
LLMs can't recognize lies because they don't have referents for "real".
LLMs can’t “recognize” anything. They can’t “perceive” anything. And that’s why using sensory-oriented terminology (like “hallucination”) with LLMs is misleading and incorrect. It’s wrong both about what human hallucinations are and what’s going on in an LLM.
It’s more like when Trump is rambling on in one of his speeches, just stringing together phrases and thoughts haphazardly. So I’d like to propose that it be called “trumpeting”.