Anthropic grew 80× while agents learn on their own

Anthropic's 80× Q1 Growth; "Dreaming" and "Outcomes" Ship

  • Dario Amodei disclosed at the Code with Claude conference that Anthropic achieved 80× annualized revenue growth in Q1 2026 — far exceeding the projected 10× — with API volume up 70× YoY; Claude Code users now average 20 hours/week of tool use.
  • Anthropic signed a compute deal with SpaceX for all 300MW of Colossus 1 (220,000+ NVIDIA GPUs) and is doubling Claude Code's five-hour rate limits across Pro/Max/Team/Enterprise, removing peak-hour reductions.
  • A new "Dreaming" feature (research preview) lets agents autonomously create plain-text notes and structured playbooks from past sessions — no weight changes, fully auditable; Harvey AI reported a 6× task completion rate increase using it.
  • "Outcomes" (public beta) deploys a separate grader agent evaluating working-agent output against developer-defined success rubrics and iterating without human intervention; Wisedocs cut document review time 50%; Multi-Agent Orchestration entered public beta simultaneously — Netflix is processing logs from hundreds of builds at once.
  • Anthropic is in new funding talks at a $900B valuation; Musk — who testified against OpenAI days earlier in Oakland — publicly reversed his stance: "No one set off my evil detector." The White House and Pentagon remain actively at odds over the Trump blacklist, with Trump calling a deal "possible" while Defense Secretary Hegseth holds the ban. (Previously: Anthropic sued the Trump administration over the supply chain risk designation.)

OpenAI Formalizes Enterprise: DeployCo Launches, Daybreak Challenges Glasswing, Codex Gets Chrome

  • OpenAI formally launched the Deployment Company — a standalone enterprise business backed by TPG, Bain Capital, Goldman Sachs, SoftBank, McKinsey, Capgemini, and 13 other partners — acquiring Tomoro (~150 Forward Deployed Engineers) who embed inside client organizations to redesign workflows around GPT-5.5 and Codex. (Previously: the leaked CRO memo named DeployCo as a strategic pillar — it now has a legal entity and balance sheet.)
  • OpenAI announced Daybreak, a three-tier enterprise cybersecurity platform integrating GPT-5.5 with Cisco, CrowdStrike, Palo Alto Networks, Cloudflare, Fortinet, Zscaler, Okta, and Snyk; the most restricted tier — GPT-5.5-Cyber for red teaming and pen testing — directly challenges Anthropic's Mythos/Glasswing.
  • Codex can now directly control Chrome on macOS and Windows as of May 8–12, filling the frontend web UI gap Sam Altman acknowledged last week and replacing screenshot-based computer use for browser-native tasks.
  • OpenAI published a Codex security deployment guide covering managed network policies, per-command approval tiers, OS keyring credential storage, and agent-native OpenTelemetry logs enabling AI-powered triage to distinguish expected agent behavior from genuine security incidents.

Microsoft's 100-Agent Harness Tops CyberGym; Semantic Kernel Gets Two Critical Patches

  • Microsoft unveiled MDASH — a 100+ specialized AI agent pipeline that found 16 new Windows vulnerabilities including 4 critical RCEs in the kernel TCP/IP stack and IKEv2 service — scoring 88.45% on CyberGym to lead the public leaderboard; 100% recall on planted vulnerabilities in tcpip.sys with zero false positives.
  • Microsoft's stated architectural principle: "the durable advantage lies in the agentic system around the model" rather than any single model — MDASH is currently used by Microsoft security teams with private preview available.
  • Microsoft simultaneously disclosed two critical CVEs in its own Semantic Kernel framework: CVE-2026-26030 (Python SDK ≤1.39.3) enables host-level RCE via prompt injection through an unsanitized eval() call — fix: upgrade to 1.39.4+; CVE-2026-25592 (.NET SDK ≤1.70.x) allows sandbox escape and arbitrary file write to the Windows Startup folder — fix: upgrade to 1.71.0+.
  • Microsoft's accompanying warning: "AI models are not security boundaries" — any tool parameter influenced by model output must be treated as attacker-controlled input regardless of model or framework.

AWS Gives Agents Payment Accounts; Google Pre-Loads I/O with Android Agents

  • Amazon Bedrock AgentCore Payments (preview) — built with Coinbase and Stripe — lets AI agents autonomously pay for APIs, MCP servers, web content, and other agents without custom billing or credential management.
  • Amazon WorkSpaces for AI agents (preview) lets agents access and operate desktop applications inside managed WorkSpaces; AWS MCP Server went GA as a managed authenticated endpoint for all AWS services, per the same announcement.
  • AWS Kiro added "Requirements Analysis" — combining LLMs with an SMT solver for mathematical verification of requirements before any code is generated — cutting implementation time by up to 75% and directly targeting AI slop from ambiguous specs.
  • Google's Android Show: I/O Edition (May 12) previewed Googlebooks (AI-native laptops from five OEMs, launching fall 2026), Gemini executing cross-app multi-step tasks from natural language on Android, and "Create My Widget" — vibe-coded custom widgets described in natural language, launching first on Galaxy and Pixel; full I/O follows May 19 with Gemini 4 expected.
  • Google published a long-running ADK agents tutorial using durable state machines, SQLite/Cloud SQL session persistence, and event-driven webhooks — so enterprise agents sleep during human approval pauses and resume without losing context — deployable to Agent Runtime with scale-to-zero.

Red Hat AgentOps, UiPath Orchestration, and the Self-Hosted Coding Race

  • At Red Hat Summit, AgentOps shipped with cryptographic agent identity (SPIFFE IDs), behavioral anomaly detection flagging suspicious actions that technically succeed (e.g., PII exfiltration attempts that trigger secret-dumping), and an AI gateway routing requests to the optimal model; NVIDIA's open-source OpenShell provides the underlying sandbox runtime with credential gateways outside each agent process — ServiceNow's Project Arc and the new NOWAI-Bench open enterprise benchmarking suite both run on OpenShell.
  • Red Hat Desktop went GA on May 12 — commercial Podman Desktop support with isolated AI agent sandboxing built on SLSA Level 3, SBOM-backed Hardened Images; Fedora Hummingbird Linux ships alongside as a rolling-release OS continuously built by an agentic software factory with zero-CVE-at-release targets.
  • UiPath launched native integrations for Claude Code and Codex built on Temporal's durable execution — Maestro provides policy enforcement, audit trails, credential vaults, and RBAC for all agent-generated automations; UiPath is explicitly not building its own coding agent, betting the governance layer is more defensible.
  • Coder launched Coder Agents in public beta (May 6) — the first self-hosted, model-agnostic AI coding agent platform — with source code, prompts, and model traffic never leaving customer infrastructure; free through September 2026, targeting enterprises blocked by ISO 27001, SOC 2, HIPAA, and PCI-DSS from cloud-hosted tools.
  • Nova raised $31M Series A to bring agentic coding to the SAP ecosystem — a 34B€/year market where code lives in databases, not files, making standard agents useless out of the box; Nova's dev container syncs file system changes bidirectionally with SAP servers, with Git providing checkpointing and time-travel across the 5,000–20,000 custom programs enterprises must migrate to S/4HANA.
  • GitHub engineering published a "Trust Layer" methodology for validating non-deterministic agent behavior — modeling execution traces as Prefix Tree Acceptors with compiler-theory dominator analysis — achieving 100% accuracy vs. 82.2% for agent self-assessment; agents score 0% F1 on "not a bug" scenarios when grading their own work.
  • SpaceXAI Grok Build (rebranded from xAI) remains unlaunched for a sixth consecutive week since Musk's "next week" pledge, now absorbed into SpaceX's orbital compute strategy; early-access testers confirm plugins, MCPs, skills, git worktree support, and a built-in browser are functional — no official launch date.

Get Agentic software development in your inbox

Subscribe to receive new issues as they're published.