Note from Andrey: this ep came out a week ago on RSS, but I was delayed posting it to youtube and therefore also Substack. My bad!

Our 238th episode with a summary and discussion of last week’s big AI news!

Recorded on 03/18/2026

Hosted by Andrey Kurenkov and Jeremie Harris

Feel free to email us your questions and feedback at andreyvkurenkov@gmail.com and/or hello@gladstone.ai

In this episode:

  • OpenAI released GPT-5.4 mini and nano with 400k-token context windows, higher per-token prices but claimed token-efficiency gains in Codex; nano is API-only and pitched for high-volume classification/data extraction despite a major price increase.
  • Mistral open-sourced the Small 4 model family (MoE, 119B total/6B active) combining reasoning, multimodal, and coding-agent capabilities, and announced Forge to help businesses train or post-train custom models.
  • Agent “operating system” competition intensified with Meta’s acquired Manus launching a local Mac agent, Nvidia announcing NeMo/“Open Shell” sandboxed agent runtime, and Nvidia also unveiling DLSS 5 plus major hardware forecasts including Groq LPU integration.
  • Business and safety updates included OpenAI shifting focus toward productivity/enterprise amid competition, Microsoft reorganizing Copilot and frontier-model efforts, Meta delaying its next model, China-linked ByteDance deploying large Nvidia clusters abroad, and new safety work on steganography, chain-of-thought faithfulness, fine-tuning defenses, cyber-attack evals, and constitution/spec compliance.

A thank you to our current sponsors:

  • Box - visit Box.com/AI to learn more
  • ODSC AI - go to odsc.ai/east and use promo code LWAI for an additional 15% off your pass to ODSC AI East 2026.

Timestamps:

  • (00:00:10) Intro / Banter
  • (00:01:56) News Preview
  • Tools & Apps
  • Applications & Business
  • Policy & Safety
  • Research & Advancements
  • (01:40:050) [[2603.15031] Attention Residuals](https://arxiv.org/abs/2603.15031)