🐍 Newsletters
AI Snake Oil
14 min read
AI leaderboards are no longer useful. It's time to switch to Pareto curves.
What spending $2,000 can tell us about evaluating AI agents
Explore the latest AI news and research tagged #cost-efficiency — curated from top sources including OpenAI, Anthropic, Google DeepMind, and more.
What spending $2,000 can tell us about evaluating AI agents