AI Ecosystem · May 25, 2026

Weekly AI Infra Brief, 2026-05-25

78 updates

TL;DR

- **Biggest shift:** Cost economics flipped this week. Cursor Composer 2.5 hit Claude Opus 4.7 parity at one-tenth the per-token cost, Microsoft started canceling Claude Code licenses because AI coding cost exceeded engineer salaries, and Anthropic acquired Stainless plus hired Karpathy on the same week. Multi-model routing is no longer optional. - **Strongest install candidate:** Patch every Next.js app to 15.5.18 or 16.2.6 this week. The May 6 coordinated security release covers 13 CVEs including CVE-2026-44578 (unauthenticated WebSocket SSRF). Vercel-hosted is safe, but webapp-defrag and any self-hosted child app needs the bump. - **Strongest publish-back opportunity:** The viral Karpathy CLAUDE.md (110k+ stars in 28 days) ships 4 generic behavioral rules. This workspace's CLAUDE.md is 10x deeper (Karpathy 3-layer wiki, tier-specific branching, multi-account CLI routing, MEMORY.md governance, persist-to-disk subagent discipline). A "Karpathy CLAUDE.md is the appetizer, here's the meal" teardown is a clean DFP content angle.

AI media production engineering

HeyGen integrates Sora 2 + Veo 3.1 + Kling + ElevenLabs + Flux in one editor

HeyGen integrates Sora 2 + Veo 3.1 + Kling + ElevenLabs + Flux in one editor ([elevenlabs.io/blog](https://elevenlabs.io/blog/how-heygen-uses-elevenlabs-to-deliver-lifelike-voice-for-ai-video)). Past 6 months: Video Agent 2.0, LiveAvatar redesign, Avatar Memory, Brand System. Single-pane consolidation continues. [INFORM]. We already have this assembled across tools via the auth-routing matrix. HeyGen consolidates UX, doesn't beat the routing.

Source →

AI media production engineering

Kling in House of David (44M viewers)

Kling in House of David (44M viewers) (last30days/ai-video-generation, score 47). First mainstream Hollywood production to publicly acknowledge AI video gen at production scale. [INFORM]. Proof point that the gen-then-gate QA discipline this workspace runs (per `feedback_generate_then_gate`) is now studio-grade.

AI media production engineering

Veo 4 still unreleased.

Veo 4 still unreleased. Google I/O 2026 (May 19-20) shipped Gemini 3.5 Flash + Antigravity 2.0 but NOT Veo 4. Window keeps slipping. Community sentiment on Veo 3.1 is sharply negative (last30days/veo-3, cluster "Veo 3.1 sucks"). [INFORM]. Hold any avatar-voice-veo architecture change. ElevenLabs Image & Video remains the primary route per `project_production_stack_2026_04`.

Content intelligence pipelines

Microsoft cancels Claude Code licenses (The Verge)

Microsoft cancels Claude Code licenses (The Verge) ([reddit.com/r/StockMarket](https://www.reddit.com/r/StockMarket/comments/1tmuxde/microsoft_reportedly_pulled_claude_ai_licenses/), Hacker News 484pts). AI coding cost per engineer exceeded human engineer salaries at scale. Drop-in proof point for any DFP "foundations not 100% AI" content angle and Defrag positioning. [INFORM]. Pairs cleanly with Andreessen "vampires not replacements" quote from the same week.

Source →

Content intelligence pipelines

House of David + Kling production storyline.

House of David + Kling production storyline. Same item as media, but lands here too as the breakout "AI in mainstream content" narrative. Drop-in source for any spirit-molecule or content-intel piece on "AI video gen crosses into prestige TV." [INFORM].

Agent orchestration + multi-model economics

Anthropic acquires Stainless

Anthropic acquires Stainless ([anthropic.com/news](https://www.anthropic.com/news/anthropic-acquires-stainless)). Reported $300M+. Stainless wind-down of hosted SDK generator. Existing SDKs stay with customers. Strategic move. Anthropic absorbing the SDK + MCP toolchain that also powers OpenAI and Google libraries. [INSTALL NOW]. No action required. Flag to any client building Claude SDKs that hosted gen pauses.

Source →

Agent orchestration + multi-model economics

Andrej Karpathy joins Anthropic pre-training team

Andrej Karpathy joins Anthropic pre-training team ([TechCrunch](https://techcrunch.com/2026/05/19/openai-co-founder-andrej-karpathy-joins-anthropics-pre-training-team/)). Working under Nick Joseph on using Claude to accelerate pre-training research. Plus the viral Karpathy CLAUDE.md file (110k+ stars). The workspace's wiki is already named after Karpathy's 3-layer pattern. [INFORM]. Talent signal + content opportunity (publish-back column).

Source →

Agent orchestration + multi-model economics

Cursor Composer 2.5 matches Opus 4.7 at one-tenth cost

Cursor Composer 2.5 matches Opus 4.7 at one-tenth cost ([cursor.com/blog](https://cursor.com/blog/composer-2-5), [artificialanalysis.ai](https://artificialanalysis.ai/articles/cursor-composer-2-5-coding-agent-index)). 79.8% SWE-Bench Multilingual (vs Opus 4.7's 80.5%), built on Moonshot Kimi K2.5 with 85% Cursor RL training, $0.50/$2.50 per M tokens. [EVALUATE]. Real threat to Claude Code lock-in for cost-sensitive client work. Doesn't unseat Opus 4.7 on this workspace's parent + app tier where reasoning quality matters more than cost. Worth testing on cowork-code-corp where token budgets get scrutinized.

Source →

Agent orchestration + multi-model economics

Microsoft cancels Claude Code licenses (The Verge, HN 484pts)

Microsoft cancels Claude Code licenses (The Verge, HN 484pts) ([Hacker News thread](https://news.ycombinator.com/item?id=48231575)). Hard cost-economics data point. Microsoft's internal cost-per-engineer math broke. [INFORM]. Informs the Composer 2.5 evaluation above. If Microsoft can't sustain Claude Code billing, smaller shops definitely should look at multi-model routing.

Source →

Agent orchestration + multi-model economics

Anthropic Dreaming GA in Managed Agents

Anthropic Dreaming GA in Managed Agents ([platform.claude.com docs](https://platform.claude.com/docs/en/managed-agents/dreams)). Background consolidation reads memory + session transcripts, produces reorganized memory. Harvey reported 6x task completion lift in internal testing. `[CONFLICTS WITH: feedback_subagent_persist_to_disk]` already flagged in the 2026-05-11 brief. This is the same arch tension at GA. [EVALUATE]. Request research preview access. Manual analog already runs via wiki-reconcile-daily.

Source →

Agent orchestration + multi-model economics

Anthropic doubles rate limits + SpaceX Colossus 1 lease (300 MW, 220k GPUs)

Anthropic doubles rate limits + SpaceX Colossus 1 lease (300 MW, 220k GPUs) ([anthropic.com/news](https://www.anthropic.com/news/higher-limits-spacex)). Repeat from 2026-05-14 brief but the London keynote May 20 reconfirmed. Workspace already benefits. [INFORM].

Source →

Agent orchestration + multi-model economics

Google Antigravity 2.0 + Managed Agents in Gemini API

Google Antigravity 2.0 + Managed Agents in Gemini API ([blog.google](https://developers.googleblog.com/all-the-news-from-the-google-io-2026-developer-keynote/)). Standalone agent-first desktop app + CLI, Linux-sandboxed Managed Agents via single API call. Direct competitor to Claude Code agents. [EVALUATE]. Track for cowork-code-corp client conversations where Gemini Enterprise is already on contract.

Source →

Agent orchestration + multi-model economics

Gemini CLI deprecates June 18, must migrate to Antigravity CLI

Gemini CLI deprecates June 18, must migrate to Antigravity CLI ([Google Developers Blog](https://developers.googleblog.com/an-important-update-transitioning-gemini-cli-to-antigravity-cli/)). Workspace uses Gemini CLI via OAuth per `feedback_gemini_cli_oauth`. Antigravity CLI is closed-source at launch (Gemini CLI was Apache 2.0). [INSTALL NOW]. Schedule the migration before June 18 or accept service interruption on any child project using Gemini CLI.

Source →

App factory

Next.js coordinated security release (13 CVEs, May 6)

Next.js coordinated security release (13 CVEs, May 6) ([vercel.com/changelog](https://vercel.com/changelog/next-js-may-2026-security-release)). Patch versions 15.5.18 and 16.2.6. CVE-2026-44578 is the worst (unauthenticated WebSocket SSRF). Vercel-hosted apps are safe. Self-hosted (any AWS-Terraform or Cloudflare Pages child app) needs the patch. [INSTALL NOW]. Repeat from 2026-05-11 brief with the patched version refined upward (15.5.16 / 16.2.5 shipped with incomplete middleware fix).

Source →

App factory

shadcn/ui 4.7.0 package imports + target aliases

shadcn/ui 4.7.0 package imports + target aliases ([ui.shadcn.com/docs/changelog](https://ui.shadcn.com/docs/changelog/2026-05-package-imports-target-aliases)). `package.json#imports` aliases instead of relying on tsconfig paths. Helps the monorepo path setup in every cowork app. [INSTALL NOW]. Adopt on next shadcn install. Allowlist-clear (shadcn is on the active stack).

Source →

App factory

Vercel AI Gateway WordPress plugin (40+ providers, hundreds of models)

Vercel AI Gateway WordPress plugin (40+ providers, hundreds of models) ([vercel.com/changelog](https://vercel.com/changelog/vercel-ai-gateway-plugin-for-wordpress)). Single API key, unified billing, automatic fallback across providers including Anthropic, Google, OpenAI, xAI, DeepSeek. [INSTALL NOW] for any WordPress-shaped client surface (psychedelicsafari has a WordPress-style content layer). Allowlist-clear (Vercel official). For Next.js apps the AI SDK 6 covers this already.

Source →

App factory

Qwen 3.7 Max on Vercel AI Gateway (May 21)

Qwen 3.7 Max on Vercel AI Gateway (May 21) ([vercel changelog](https://vercel.com/changelog)). Alibaba agent-foundation model for office-workflow + long-horizon autonomous execution. [EVALUATE]. Add to the multi-model routing mix as the Composer 2.5 / Opus 4.7 cost-tier reshuffles. Allowlist-clear (Vercel official) but use case isn't urgent.

Source →

App factory

Cloudflare WAF + framework adapter mitigations for the Next.js/React vulns

Cloudflare WAF + framework adapter mitigations for the Next.js/React vulns ([developers.cloudflare.com](https://developers.cloudflare.com/changelog/post/2026-05-06-react-nextjs-vulnerabilities/)). For any Cloudflare-fronted child app, CF managed rules cover the worst of the May 6 disclosures even if you can't patch immediately. [INFORM]. Defense in depth. Still patch the apps.

Source →

Workspace + wiki governance

Karpathy CLAUDE.md viral file (110k+ stars, 28 consecutive days #1 GitHub trending)

Karpathy CLAUDE.md viral file (110k+ stars, 28 consecutive days #1 GitHub trending) ([multica-ai/andrej-karpathy-skills](https://github.com/multica-ai/andrej-karpathy-skills)). Four generic rules: Think Before Coding, Simplicity First, Surgical Changes, Goal-Driven Execution. The barrier to adoption is zero (paste into project root). [PUBLISH-BACK]. This workspace's CLAUDE.md is dramatically deeper. Direct content opportunity (see TL;DR bullet 3).

Source →

Workspace + wiki governance

safemcp.info indexes 28,577 verified MCP servers

safemcp.info indexes 28,577 verified MCP servers ([Reddit r/mcp](https://www.reddit.com/r/mcp/comments/1tm7duq/i_built_the_largest_free_directory_of_mcp_servers/)). Free, individually verified directory. Complements PulseMCP (15k+). Indie builder, 95 upvotes/29 comments in r/mcp. [EVALUATE]. Add to wiki-query MCP discovery options. PulseMCP is the hand-reviewed source still.

Source →

Workspace + wiki governance

"How to connect 100 MCP servers without context window exploding"

"How to connect 100 MCP servers without context window exploding" ([Reddit r/mcp 144pts](https://www.reddit.com/r/mcp/comments/1t73igk/how_to_connect_100_mcp_servers_without_the/)). Platform engineer post on the Context Tax (GitHub MCP alone loads ~50k tokens before user types). Confirms this workspace's discipline of running 4 MCP servers (apify, defrag, vercel, claude-in-chrome) is correct shape. [INFORM].

Source →

Workspace + wiki governance

Claude Cowork desktop app GA for non-developers

Claude Cowork desktop app GA for non-developers ([Anthropic news](https://www.anthropic.com/product/claude-cowork)). Confirmed at Code with Claude London May 20. Knowledge work in local files/folders with read/edit/create permissions. Legal vertical is the most engaged segment. [EVALUATE]. Relevant for any healing-vertical client who would balk at terminal Claude Code. Defrag could ship Cowork instructions in client onboarding.

Source →

Claude Code

Top items:

Top items: Microsoft canceling Claude Code licenses (HN 484pts, Reddit r/StockMarket 3,084pts). Cursor Composer 2.5 (Theo t3.gg 89,678 views).

Claude Code

Angle:

Angle: Cost-economics narrative breaking against Claude Code at the enterprise tier just as Cursor's 1/10-cost parity model lands. The "use multi-model" thesis is becoming consensus.

Anthropic

Top items:

Top items: Karpathy joining (May 19) dominated. Stainless acquisition (May 18). Code w/ Claude London May 20 keynote.

Anthropic

Angle:

Angle: Anthropic continues consolidating developer-tooling moat (Stainless = MCP/SDK shop) and talent (Karpathy = pre-training research). Public narrative shift from "underdog" to "incumbent" continues.

MCP servers

Top items:

Top items: safemcp.info 28,577 indexed (Reddit 95pts). Stainless reaction "Anthropic just bought the company that generates most MCP servers" (r/ClaudeAI 355pts/78cmt). "How to connect 100 MCPs without context exploding" (r/mcp 144pts/38cmt).

MCP servers

Angle:

Angle: Ecosystem maturing past discovery into governance (Context Tax, security, registry consolidation). Builder skepticism on whether deterministic SDK gen still matters in an "agents slap together MCPs in any language" world.

Claude agents

Top items:

Top items: "What 16 Parallel Claude Agents Built" (HN). Agent View launch (r/ClaudeAI 698pts/118cmt). "Ruflo deploys 50+ parallel Claude agents" (IG).

Claude agents

Angle:

Angle: Parallel-agent fleet pattern is now table stakes. Workspace's `Skill` parallel fan-out is in line with the trend but lighter weight than fleet tools.

Lovable AI

Top items:

Top items: $100M ARR celebrations. "Lovable is outdated, Base44 is 10x better" (Mikey No-Code).

Lovable AI

Angle:

Angle: Tool churn cycle visible in real-time. "Base44 better than Lovable" sentiment growing among no-code creators. Workspace's Lovable-as-starting-point + app-build hardening discipline still correct.

v0 dev

Top items:

Top items: Thin week. No specific v0 dev signal in scraper.

v0 dev

Angle:

Angle: Quiet relative to Cursor + Antigravity noise. v0 still the Vercel-default prompt-to-UI.

Cursor

Top items:

Top items: Composer 2.5 (May 18 launch). Theo deep-dive 89k views. Matches Opus 4.7 at 1/10 cost.

Cursor

Angle:

Angle: Cursor recapturing narrative ground after Claude Code's redesign. The Moonshot Kimi K2.5 base + Cursor RL approach is a new playbook (open base + proprietary post-training).

AI coding agents

Top items:

Top items: Herdr tmux-like multiplexer (HN). Verytis shared error memory MCP. Cplt kernel-level sandbox. Andreessen "vampires" quote.

AI coding agents

Angle:

Angle: Tooling for fleet management is the new battleground (multiplexers, shared memory, sandboxes). Discipline catching up with raw capability.

Veo 3

Top items:

Top items: "Veo 3.1 sucks" thread. Community pinned on Veo 4 announcement. House of David / Kling crossover.

Veo 3

Angle:

Angle: Veo 3.1 sentiment is poor and Veo 4 anticipation has slipped past Google I/O 2026 (no announcement May 19-20). Kling is taking the high-end content narrative. Holds workspace decision. Keep ElevenLabs + Gemini CLI route per `feedback_veo_continuity_modes`.

AI video generation

Top items:

Top items: House of David / Kling 44M viewers (r/singularity 456pts). Dreamina Seedance 2.0 in CapCut (IG 9k likes). Hyper-realistic character continuity techniques (IG).

AI video generation

Angle:

Angle: AI video crosses prestige-TV threshold this month. Production discipline (multi-frame structural coherence, character weight consistency) is now public conversation, matching workspace's generate-then-gate playbook.

ElevenLabs

Top items:

Top items: Scrape ran at brief deadline. Spot-check on r/ElevenLabs and r/IndieDev showed steady creator content with no major API changes this week. ElevenLabs Image & Video remains the primary stack route.

ElevenLabs

Angle:

Angle: Quiet week from ElevenLabs ecosystem. No urgent action.

HeyGen

Top items:

Top items: Scrape ran at brief deadline. Signal covered via HeyGen integration items in domain 1.

HeyGen

Angle:

Angle: Sora 2 + Veo 3.1 + Kling integration story dominates. Same item already in primary brief.

HeyGen

Project:

Project: projects/cowork-defrag/TODO.md

HeyGen

Section:

Section: this-week

HeyGen

Line:

Line: "- [ ] Patch webapp-defrag to Next.js 15.5.18 or 16.2.6 (May 6 security release, 13 CVEs incl CVE-2026-44578 WebSocket SSRF). Run after current Drizzle migration cycle settles."

HeyGen

Why:

Why: Self-hosted Next.js apps remain exposed until patched. Vercel-hosted is safe but webapp-defrag is on Vercel-managed and should still bump to clear knip/typecheck against newest types.

HeyGen

Project:

Project: projects/cowork-defrag/TODO.md

HeyGen

Section:

Section: next-week

HeyGen

Line:

Line: "- [ ] Draft DFP blog post: 'Karpathy CLAUDE.md is the appetizer, here's the meal'. Teardown of parent CLAUDE.md vs viral 110k-star file. Pull from feedback_synthesize_dont_cite, tier-specific branching, multi-account CLI, persist-to-disk subagent."

HeyGen

Why:

Why: Strongest publish-back opportunity this brief. Viral pattern + workspace already 10x deeper. Direct content angle for DFP foundations thesis.

HeyGen

Project:

Project: projects/cowork-code-corp/TODO.md (if exists)

HeyGen

Section:

Section: inbox

HeyGen

Line:

Line: "- [ ] Evaluate Cursor Composer 2.5 for CodeCorp client work where token cost matters. SWE-Bench Multilingual 79.8% vs Opus 4.7's 80.5% at 1/10 cost. Don't replace parent Opus 4.7 default."

HeyGen

Why:

Why: Cost economics flipped this week (Microsoft cost story + Composer 2.5 launch). CodeCorp's client work is the right environment to test. Parent + Defrag work stays Opus.

HeyGen

Project:

Project: parent (root TODO.md)

HeyGen

Section:

Section: inbox

HeyGen

Line:

Line: "- [ ] Migrate Gemini CLI usage to Antigravity CLI before June 18, 2026. Audit which child projects use gemini CLI via OAuth per feedback_gemini_cli_oauth.md. Antigravity CLI is closed-source at launch (Gemini CLI was Apache 2.0)."

HeyGen

Why:

Why: Hard deadline. Service interruption on any project still on Gemini CLI after June 18.

HeyGen

Project:

Project: parent (root TODO.md)

HeyGen

Section:

Section: inbox

HeyGen

Line:

Line: "- [ ] Add Dreaming research-preview access request to next Anthropic touchpoint. Compare to manual wiki-reconcile-daily + MEMORY.md governance loop."

HeyGen

Why:

Why: Dreaming is GA in Managed Agents but research preview elsewhere. 6x lift on Harvey is significant. Our analog process is manual.

HeyGen

Newsletters:

Newsletters: newsletter-digest skill invoked. No Gmail MCP results captured at brief deadline. Cross-reference release-bot output instead.

HeyGen

YouTube channels + playlists:

YouTube channels + playlists: Theo (t3.gg) Cursor Composer 2.5 (89k views), AIM Network "Microsoft Killed Claude Code" (745 views).

HeyGen

Web/forums:

Web/forums: TechCrunch, Anthropic news, Vercel changelog, blog.google, Google Developers Blog, Reddit (r/ClaudeAI, r/StockMarket, r/mcp, r/singularity, r/devops, r/VEO3), Hacker News (484+ pt thread on Microsoft canceling), MIT Tech Review, simonwillison.net, releasebot.io.

HeyGen

GitHub topics:

GitHub topics: anthropics/claude-code, multica-ai/andrej-karpathy-skills (110k+ stars), mattpocock/skills (55k stars), github/github-mcp-server, modelcontextprotocol/servers.

HeyGen

PulseMCP:

PulseMCP: dental planning MCP (May 12), Make integration update (May 12), Haas.my (May 7), Agent Times MCP (May 7).

HeyGen

Cowork-specific (delegated to existing skills):

Cowork-specific (delegated to existing skills): newsletter-digest, cowork-news-research, cowork-youtube-research all invoked. Horizontal last30days ran 12 of 12 terms successfully.

HeyGen

Date:

Date: 2026-05-25

HeyGen

Brief generated by:

Brief generated by: infra-improver agent

HeyGen

Domain scope file version:

Domain scope file version: 94c6af4

HeyGen

Items considered:

Items considered: ~85 before filter, 22 after filter

HeyGen

Failure modes triggered:

Failure modes triggered: None hard.

HeyGen

Time:

Time: 08:30 PT