ChatGPT Overuses Words Like 'Delve' Due to an Inherent Bias for 'Fancier' Latin Words

Related Insights

LLMs Mistakenly Favor Frequent Numbers Like '30' Over '29'

An LLM's core training objective—predicting the next token—makes it sensitive to the raw frequency of words and numbers online. This creates a subtle but profound flaw: it's more likely to output '30' than '29' in a counting task, not because of logic, but because '30' is statistically more common in its training data.

969: The Laws of Thought: The Math of Minds and Machines, with Prof. Tom Griffiths

Super Data Science: ML & AI Podcast with Jon Krohn·5 months ago

Advanced LLMs Prioritize Grammatical Structure Over Semantic Meaning, a Critical Failure Mode

MIT research reveals that large language models develop "spurious correlations" by associating sentence patterns with topics. This cognitive shortcut causes them to give domain-appropriate answers to nonsensical queries if the grammatical structure is familiar, bypassing logical analysis of the actual words.

The LM Brief: The Syntax Illusion

"World of DaaS"·7 months ago

AIs Are Developing Internal Jargon, Proving They're Not Just Predicting Next Tokens

Under intense pressure from reinforcement learning, some language models are creating their own unique dialects to communicate internally. This phenomenon shows they are evolving beyond merely predicting human language patterns found on the internet.

AI Scouting Report: the Good, Bad, & Weird @ the Law & AI Certificate Program, by LexLab, UC Law SF

"The Cognitive Revolution" | AI Builders, Researchers, and Live Player Analysis·4 months ago

LLM Training on Prior Model Outputs Creates "Style Burn-In," Reducing Quality

Newer LLMs exhibit a more homogenized writing style than earlier versions like GPT-3. This is due to "style burn-in," where training on outputs from previous generations reinforces a specific, often less creative, tone. The model’s style becomes path-dependent, losing the raw variety of its original training data.

Meta AI Vibes & ChatGPT Pulse Reactions, Friend’s Billboard Blitz | Roon

TBPN·10 months ago

AI's Verbose, Hedging Style Is Its New 'Tell' in the Turing Test

Current AI models often provide long-winded, overly nuanced answers, a stark contrast to the confident brevity of human experts. This stylistic difference, not factual accuracy, is now the easiest way to distinguish AI from a human in conversation, suggesting a new dimension to the Turing test focused on communication style.

Altman's Long-Term Vision, The GPU Bubble, Acquired Hosts Live in The Ultradome | Ben Gilbert & David Rosenthal, David Faugno, Sergiy Nesterenko, Justin Lopas, Ryan Daniels, Zack Ganieany, Yash Rathod, Alex Shieh

TBPN·9 months ago

Modern Chatbots Are Preference-Maximizers, Not Truth-Maximizers

AI models are not optimized to find objective truth. They are trained on biased human data and reinforced to provide answers that satisfy the preferences of their creators. This means they inherently reflect the biases and goals of their trainers rather than an impartial reality.

The Epstein Files Just EXPOSED the AI Mind Control Agenda (2026 Warning) | Tom's Deepdive

Tom Bilyeu's Impact Theory·5 months ago

OpenAI's Codex Model Performs Better When Tools Are Named 'rg' Instead of 'grip'

AI models develop strong 'habits' from training data, leading to unexpected performance quirks. The Codex model is so accustomed to the command-line tool 'ripgrep' (aliased as 'rg') that its performance improves significantly when developers name their custom search tool 'rg', revealing a surprising lack of generalization.

⚡️GPT5-Codex-Max: Training Agents with Personality, Tools & Trust — Brian Fioca + Bill Chen, OpenAI

Latent Space: The AI Engineer Podcast·7 months ago

ChatGPT Overuses Em Dashes Due to Training on 19th-Century Literature Archives

The tendency for AI models to overuse em dashes may stem from their training data. To expand their knowledge, companies digitized millions of older books, including 19th-century classics where dash usage was at its historical peak. The models simply adopted this stylistic habit.

The Em Dash

99% Invisible·5 months ago

LLM Language Gaps Stem From Inefficient English-Centric 'Tokenization,' Not Just Data Scarcity

Beyond the obvious lack of non-English training data, Large Language Models are architecturally biased. Their tokenization process, designed for English, inefficiently breaks down other languages into more fragments. This increases operational costs and reduces comprehension, creating a structural disadvantage.

Over the moon: Artemis II launches

Economist Podcasts·4 months ago

Generative AI Has Likely Hit Its Accuracy Ceiling Due to Its Statistical Nature

Contrary to popular belief, generative AI like LLMs may not get significantly more accurate. As statistical engines that predict the next most likely word, they lack true reasoning or an understanding of "accuracy." This fundamental limitation means they will always be prone to making unfixable mistakes.

How AI Could Freeze Progress with Hilary Allen

Masters in Business·5 months ago

Get your free personalized podcast brief

Related Insights