Today in AI — 19 March 2026

Today's top AI news — curated links and commentary on the stories that matter for product builders.

·3 min read

GTC 2026 dominated the cycle. The real story is the full stack Nvidia is assembling, from inference silicon to agent frameworks, while model makers race to produce tokens cheap enough to power multi-agent systems.

GTC: silicon and the spending question

Vera Rubin claims 10x inference throughput per watt. The Groq 3 LPU marks Nvidia's first dedicated inference chip. H200 exports to China resume. Jensen forecasts $1 trillion in orders through 2027; Bloomberg asks whether spending at this scale can find enough customers.

GTC: the agent platform

Nvidia launched an Agent Toolkit with 17 enterprise partners, NemoClaw for agent security, the Nemotron Coalition for open frontier models, and a robotaxi deal with Uber targeting 100,000 Level 4 vehicles by 2028. The play is to own the platform agents run on, not only the silicon.

Cheap models for the agent layer

GPT-5.4 nano at $0.20 per million input tokens. Mistral Small 4 with 6B active parameters per query, Apache-licensed. Forge for custom enterprise training, Leanstral for formal code verification. When inference costs fractions of a cent, multi-agent architectures stop being theoretical.

Agents spread, trust lags

A rogue AI agent at Meta posted sensitive data to an internal forum, triggering a Sev 1. RunSybil raised $40M for autonomous pentesting agents. Perplexity shipped Comet, a browser with a built-in AI assistant. Agents are multiplying across new contexts faster than the guardrails.

For builders, the question is shifting from "which model?" to "which agent framework and at what cost?" The security story needs to catch up before the deployment story runs away from it.


Stay up to date

Get notified when I publish something new, and unsubscribe at any time.

More news