/

© 2026 RiffOn. All rights reserved.

AI & I
We Taught AI to Play Games—Now It’s a $3.6 Million Company

We Taught AI to Play Games—Now It’s a $3.6 Million Company

AI & I · Oct 15, 2025

Alex from Every spins out Good Star Labs, a company using games like Diplomacy to test, train, and improve AI models' strategic capabilities.

AI Game Environments Reveal Deeper Model Traits Static Benchmarks Miss

Static benchmarks are easily gamed. Dynamic environments like the game Diplomacy force models to negotiate, strategize, and even lie, offering a richer, more realistic evaluation of their capabilities beyond pure performance metrics like reasoning or coding.

We Taught AI to Play Games—Now It’s a $3.6 Million Company thumbnail

We Taught AI to Play Games—Now It’s a $3.6 Million Company

AI & I·4 months ago

Building with LLMs Involves Navigating Three "Infinite Problem Spaces"

Developing LLM applications requires solving for three infinite variables: how information is represented, which tools the model can access, and the prompt itself. This makes the process less like engineering and more like an art, where intuition guides you to a local maxima rather than a single optimal solution.

We Taught AI to Play Games—Now It’s a $3.6 Million Company thumbnail

We Taught AI to Play Games—Now It’s a $3.6 Million Company

AI & I·4 months ago

Games Demystify AI for the Public by Showcasing Its Flaws and Strategies

When Good Star Labs streamed their AI Diplomacy game on Twitch, it attracted 50,000 viewers from the gaming community. Watching AIs make mistakes, betray allies, and strategize made the technology more relatable and less intimidating, helping to bridge the gap between AI experts and the general public.

We Taught AI to Play Games—Now It’s a $3.6 Million Company thumbnail

We Taught AI to Play Games—Now It’s a $3.6 Million Company

AI & I·4 months ago

Subjective Games like 'Cards Against Humanity' Target AI Models' Humor Deficit

Good Star Labs' next game will be a subjective, 'Cards Against Humanity'-style experience. This is a strategic move away from objective games like Diplomacy to specifically target and create training data for a key LLM weakness: humor. The goal is to build an environment that improves a difficult, subjective skill.

We Taught AI to Play Games—Now It’s a $3.6 Million Company thumbnail

We Taught AI to Play Games—Now It’s a $3.6 Million Company

AI & I·4 months ago

Training Vision Models on Games Can Unexpectedly Improve Their Math Skills

A Rice PhD showed that training a vision model on a game like Snake, while prompting it to see the game as a math problem (a Cartesian grid), improved its math abilities more than training on math data directly. This highlights how abstract, game-based training can foster more generalizable reasoning.

We Taught AI to Play Games—Now It’s a $3.6 Million Company thumbnail

We Taught AI to Play Games—Now It’s a $3.6 Million Company

AI & I·4 months ago

Prompt Optimization Can Drastically Alter an AI Model's Performance Rankings

Good Star Labs found GPT-5's performance in their Diplomacy game skyrocketed with optimized prompts, moving it from the bottom to the top. This shows a model's inherent capability can be masked or revealed by its prompt, making "best model" a context-dependent title rather than an absolute one.

We Taught AI to Play Games—Now It’s a $3.6 Million Company thumbnail

We Taught AI to Play Games—Now It’s a $3.6 Million Company

AI & I·4 months ago

Good Star Labs' Business Model Uses AI Games for B2B Evaluation and Training

Good Star Labs is not a consumer gaming company. Its business model focuses on B2B services for AI labs. They use games like Diplomacy to evaluate new models, generate unique training data to fix model weaknesses, and collect human feedback, creating a powerful improvement loop for AI companies.

We Taught AI to Play Games—Now It’s a $3.6 Million Company thumbnail

We Taught AI to Play Games—Now It’s a $3.6 Million Company

AI & I·4 months ago

During Tech Shifts, Hire Hungry Juniors Over Experts Weighed Down by Old Methods

In a paradigm shift like AI, an experienced hire's knowledge can become obsolete. It's often better to hire a hungry junior employee. Their lack of preconceived notions, combined with a high learning velocity powered by AI tools, allows them to surpass seasoned professionals who must unlearn outdated workflows.

We Taught AI to Play Games—Now It’s a $3.6 Million Company thumbnail

We Taught AI to Play Games—Now It’s a $3.6 Million Company

AI & I·4 months ago