🐍 Newsletters
AI Snake Oil
7 min read
New paper: AI agents that matter
Rethinking AI agent benchmarking and evaluation
Rethinking AI agent benchmarking and evaluation