Hey folks,

I’m a professional procrastinator — I need to ship this course and re-write my fundraising deck for fund II, buuuut yesterday I finally built something I’ve wanted for a while.

I loved Superhuman, I used it for many years ($40/mo!) but all I really loved was split inboxes based on labels, and how quick and nice it is to use.

I don’t need (…yet? ever?) AI to read all my emails, draft replies, chase me to do things and be a PA for me, I’ve had PA’s and I always let them go.

I just want email rules - if this address is labelled ‘pitch’ (for PR pitches), archive it. If no label - it’s ‘important’ and needs a reply for me, if ‘investing’ it’s from an LP or portfolio founder, if ‘newsletter’ archive it (don’t do this or you wont see these! 😊).

Gmail has filters and labels but it’s limited and just can’t give me the UI I want to work in. So naturally, I built my own. I’ll send a ‘Ben’s Builds’ email on Saturday with more details on what I actually did. It took me ~2 hours for the first version.

I was ‘pushed’ to this by seeing Dan Shipper’s Codex-native email workflow - but again, I don’t need most of the stuff he’s doing. Email is very different for everyone. But now I can completely customise my experience…If I want daily briefs to summarise all my newsletters - my agent can do it, if I want automated actions - my agent can, and so on.

So if you send me an email, it is me who will reply - but my agent may have you labelled and organised to make it easier for me to respond.

Personal software can be built by anyone!


Ben’s Bites is brought to you by Attio

GTM Atlas is the map for modern go-to-market.

Curated by Attio. Mapped by operators.

Read now

Fun fact: Attio’s founder, Nicolas was interviewed by me in Sept 2020 for my previous company’s podcast. Attio is very good software!! I should’ve invested at the time 😩. But for now… I’m going to build something with it 😈


  • Free users in ChatGPT are now on “GPT-5.5 Instant” - a new model that replaces GPT-5.3 Instant. It’s significantly better at vision, understanding PDFs, web search and using your memories and past chats smartly. Its responses are also shorter in general with less emojis. It also hallucinates 52.5% less than the previous model on high-stakes prompts.

    Though recently I’ve been recommending Codex to friends with free plans of ChatGPT a lot. Yep, Codex is available on free plans. It takes them some time to understand the concept of how reading-writing a file on computer unlocks much more capability but in each case, they have come back after a day or two saying “we’re addicted to using codex”.

    Keshav

  • You can now use twice as much of Claude on all paid plans. How? Anthropic signed a deal to use all of SpaceX’s Colossus 1 data centre. (I guess no one needs/uses Grok)

  • Code with Claude was a bit meh! The only new launch they did was introducing some features in Claude Managed Agents -

    • Dreaming - Review past chats and save memories from them.

    • Outcomes - Describe what success looks like, and a grader will judge the agent’s work.

    • Multi-agent orchestration - Let a lead agent break the job into pieces and delegate to specialist subagents.

  • Posthog is building a code editor. Not literally, but they are making a Codex-like app that uses the data (like product usage patterns, bugs observed, errors in logs etc.) as the primary signal to code/build stuff. Here’s how they are thinking about the self-driving product loop.



  • Gravitee makes APIs agent-ready, helping teams govern APIs, events, and AI Agents while reducing silos and cost.*

  • Skills by Entire to teach agents to explain code, search prior session context, investigate why a change happened and hand work off between agents.

  • pookie - Slack helper to search messages across your workspace. It also generates memes, and connect to tools like Linear, GitHub and Stripe.

  • The lines between vibe coding and agentic engineering are starting to blur.

  • Clicky can now click, save ideas/links/inspo and run Gmail, Calendar and Drive by voice.

  • deepsec - security harness for finding vulnerabilities in your codebase.

  • Raindrop Triage - an agent to debug your agents already in production. Also works via MCP.

  • Prime Intellect Lab lets you fine-tune your own models ranging from 1B to 400B params.

  • How we improved agentic search.

  • Everyone should have an OPINIONS.md

  • Gemini API’s File Search can now search over images & audio i.e. finding 2-3 relevant images from big folder based on what’s in the image (not its name).

  • @supabase/server - public beta package for server-side auth verification, client setup and request context across Edge Functions, Cloudflare Workers, Hono and Bun.

  • Anthropic released 10 finance agent templates for pitchbooks, KYC screening, valuation reviews, month-end close and more. They run as Claude Cowork/Claude Code plugins or Managed Agents cookbooks.

  • The “AI Job Apocalypse” is a complete fantasy.

  • The artistry of text-to-speech models.

  • Dharmesh says HubSpot’s goal is full API parity with the UI: agents can run on HubSpot, and agents can run HubSpot. More headless SaaS / AUX energy.

  • How to use Codex for knowledge work


Share Ben's Bites


* sponsors who make this newsletter possible :)
Wanna partner with us for the next quarter?