the lede
may 9, 2026
Agent safety and operability took center stage: OpenAI detailed Codex’s sandboxed architecture and examined chain-of-thought monitoring tradeoffs, while Anthropic pushed another wave of Claude Code reliability fixes. On the tooling side, Next.js shipped a canary with meaningful tweaks, the Agents SDK defaulted to gpt-realtime-2, and zero-native promised tiny, cross‑platform web‑UI apps. Research cautioned that delegated LLM edits can corrupt documents and explored emergent modularity in MoE.
swipe →