Skip to main content

Blog

Notes on ML, tooling, and building.

Jan 20, 202520 min read

What Actually Happens Inside LLMs When You Use RL?

We peeked under the hood to see how reinforcement learning changes what's going on inside language models. Spoiler: it's way cooler than we thought.

Dec 15, 202418 min read

Can Moderation Help Multi-LLM Cooperation?

What happens when you add a neutral moderator to help LLMs cooperate in strategic games? Spoiler: it works way better than you'd think.