RecSys and IR Research Feels Like Modeling in a World with Different Laws of Physics

Related Insights

Designing LLM-Friendly APIs Is a New Ergonomics Challenge, Not Just an Engineering One

Making an API usable for an LLM is a novel design challenge, analogous to creating an ergonomic SDK for a human developer. It's not just about technical implementation; it requires a deep understanding of how the model "thinks," which is a difficult new research area.

MCP Servers: Teaching AI to Use the Internet Like Humans

AI & I·5 months ago

Progress in Foundational AI Research is Binary and Unpredictable, Not Incremental

Unlike traditional engineering, breakthroughs in foundational AI research often feel binary. A model can be completely broken until a handful of key insights are discovered, at which point it suddenly works. This "all or nothing" dynamic makes it impossible to predict timelines, as you don't know if a solution is a week or two years away.

This AI Makes a Video Game World in 40 Milliseconds

AI & I·6 months ago

AI Fails From Lack of Context, Not Poor Prompts

People struggle with AI prompts because the model lacks background on their goals and progress. The solution is 'Context Engineering': creating an environment where the AI continuously accumulates user-specific information, materials, and intent, reducing the need for constant prompt tweaking.

Context Engineering: The Secret Behind $10M ARR in 60 Days, with Kuse Founder Xiankun Wu

Product Growth Podcast·3 months ago

AI Models Struggle Most with Uncodified 'Taste-Based' Expert Knowledge

AI performs poorly in areas where expertise is based on unwritten 'taste' or intuition rather than documented knowledge. If the correct approach doesn't exist in training data or isn't explicitly provided by human trainers, models will inevitably struggle with that particular problem.

Brendan Foody on Teaching AI and the Future of Knowledge Work

Conversations with Tyler·a month ago

LLM Personalization Still Lacks a "Magical" UX, Hinting at Deeper Architectural Issues

Even with access to user data from apps like Gmail, LLMs are struggling to deliver a deeply personalized, indispensable experience. This indicates that the challenge may be more than just connecting data sources; it could be a core model-level or architectural limitation preventing true user context lock-in and a killer application.

Microsoft’s Energy Tab, OpenAI Goes Super Bowl, Ellison Makes His Move | Marc Benioff, Brian Chesky, Baiju Bhatt, Gabriel Carafa, Alfred Wahlforss

TBPN·a month ago

Recommender Systems Prove AIs Can Be Superhuman at Predicting Human Tastes

The common belief that AI can't truly understand human wants is debunked by existing technology. Adam D'Angelo points out that recommender systems on platforms like Instagram and Quora are already far better than any individual human at predicting what a user will find engaging.

Amjad Masad & Adam D’Angelo: How Far Are We From AGI?

The a16z Show·3 months ago

AI Fails at Serendipitous Recommendations Because It Lacks a Real-World Model

AI struggles to provide truly useful, serendipitous recommendations because it lacks any understanding of the real world. It excels at predicting the next word or pixel based on its training data, but it can't grasp concepts like gravity or deep user intent, a prerequisite for truly personalized suggestions.

Dave Morin, Offline Ventures - how venture studios work

"World of DaaS"·3 months ago

Building with LLMs Involves Navigating Three "Infinite Problem Spaces"

Developing LLM applications requires solving for three infinite variables: how information is represented, which tools the model can access, and the prompt itself. This makes the process less like engineering and more like an art, where intuition guides you to a local maxima rather than a single optimal solution.

We Taught AI to Play Games—Now It’s a $3.6 Million Company

AI & I·4 months ago

AI Search Results Are Personalized, Rendering Universal "Rankings" Obsolete

AI tailors recommendations to individual user history and inferred intent, such as being budget-minded versus quality-focused. This means there is no single, universal ranking; visibility depends on aligning with specific user profiles, not a monolithic algorithm.

Recommended or Rejected: Does AI Trust You

Social Media Marketing Podcast·a month ago

Poor Generalization is the Fundamental Flaw Holding Back Current AI Models

The central challenge for current AI is not merely sample efficiency but a more profound failure to generalize. Models generalize 'dramatically worse than people,' which is the root cause of their brittleness, inability to learn from nuanced instruction, and unreliability compared to human intelligence. Solving this is the key to the next paradigm.

Dwarkesh and Ilya Sutskever on What Comes After Scaling

The a16z Show·2 months ago