AI Ecosystem · May 25, 2026
Weekly AI Infra Brief, 2026-05-25
78 updates
TL;DR
- **Biggest shift:** Cost economics flipped this week. Cursor Composer 2.5 hit Claude Opus 4.7 parity at one-tenth the per-token cost, Microsoft started canceling Claude Code licenses because AI coding cost exceeded engineer salaries, and Anthropic acquired Stainless plus hired Karpathy on the same week. Multi-model routing is no longer optional. - **Strongest install candidate:** Patch every Next.js app to 15.5.18 or 16.2.6 this week. The May 6 coordinated security release covers 13 CVEs including CVE-2026-44578 (unauthenticated WebSocket SSRF). Vercel-hosted is safe, but webapp-defrag and any self-hosted child app needs the bump. - **Strongest publish-back opportunity:** The viral Karpathy CLAUDE.md (110k+ stars in 28 days) ships 4 generic behavioral rules. This workspace's CLAUDE.md is 10x deeper (Karpathy 3-layer wiki, tier-specific branching, multi-account CLI routing, MEMORY.md governance, persist-to-disk subagent discipline). A "Karpathy CLAUDE.md is the appetizer, here's the meal" teardown is a clean DFP content angle.
AI media production engineering
HeyGen integrates Sora 2 + Veo 3.1 + Kling + ElevenLabs + Flux in one editor
HeyGen integrates Sora 2 + Veo 3.1 + Kling + ElevenLabs + Flux in one editor ([elevenlabs.io/blog](https://elevenlabs.io/blog/how-heygen-uses-elevenlabs-to-deliver-lifelike-voice-for-ai-video)). Past 6 months: Video Agent 2.0, LiveAvatar redesign, Avatar Memory, Brand System. Single-pane consolidation continues. [INFORM]. We already have this assembled across tools via the auth-routing matrix. HeyGen consolidates UX, doesn't beat the routing.
Source →
Kling in House of David (44M viewers)
Kling in House of David (44M viewers) (last30days/ai-video-generation, score 47). First mainstream Hollywood production to publicly acknowledge AI video gen at production scale. [INFORM]. Proof point that the gen-then-gate QA discipline this workspace runs (per `feedback_generate_then_gate`) is now studio-grade.
Veo 4 still unreleased.
Veo 4 still unreleased. Google I/O 2026 (May 19-20) shipped Gemini 3.5 Flash + Antigravity 2.0 but NOT Veo 4. Window keeps slipping. Community sentiment on Veo 3.1 is sharply negative (last30days/veo-3, cluster "Veo 3.1 sucks"). [INFORM]. Hold any avatar-voice-veo architecture change. ElevenLabs Image & Video remains the primary route per `project_production_stack_2026_04`.
Content intelligence pipelines
Microsoft cancels Claude Code licenses (The Verge)
Microsoft cancels Claude Code licenses (The Verge) ([reddit.com/r/StockMarket](https://www.reddit.com/r/StockMarket/comments/1tmuxde/microsoft_reportedly_pulled_claude_ai_licenses/), Hacker News 484pts). AI coding cost per engineer exceeded human engineer salaries at scale. Drop-in proof point for any DFP "foundations not 100% AI" content angle and Defrag positioning. [INFORM]. Pairs cleanly with Andreessen "vampires not replacements" quote from the same week.
Source →
House of David + Kling production storyline.
House of David + Kling production storyline. Same item as media, but lands here too as the breakout "AI in mainstream content" narrative. Drop-in source for any spirit-molecule or content-intel piece on "AI video gen crosses into prestige TV." [INFORM].
Agent orchestration + multi-model economics
Anthropic acquires Stainless
Anthropic acquires Stainless ([anthropic.com/news](https://www.anthropic.com/news/anthropic-acquires-stainless)). Reported $300M+. Stainless wind-down of hosted SDK generator. Existing SDKs stay with customers. Strategic move. Anthropic absorbing the SDK + MCP toolchain that also powers OpenAI and Google libraries. [INSTALL NOW]. No action required. Flag to any client building Claude SDKs that hosted gen pauses.
Source →
Andrej Karpathy joins Anthropic pre-training team
Andrej Karpathy joins Anthropic pre-training team ([TechCrunch](https://techcrunch.com/2026/05/19/openai-co-founder-andrej-karpathy-joins-anthropics-pre-training-team/)). Working under Nick Joseph on using Claude to accelerate pre-training research. Plus the viral Karpathy CLAUDE.md file (110k+ stars). The workspace's wiki is already named after Karpathy's 3-layer pattern. [INFORM]. Talent signal + content opportunity (publish-back column).
Source →
Cursor Composer 2.5 matches Opus 4.7 at one-tenth cost
Cursor Composer 2.5 matches Opus 4.7 at one-tenth cost ([cursor.com/blog](https://cursor.com/blog/composer-2-5), [artificialanalysis.ai](https://artificialanalysis.ai/articles/cursor-composer-2-5-coding-agent-index)). 79.8% SWE-Bench Multilingual (vs Opus 4.7's 80.5%), built on Moonshot Kimi K2.5 with 85% Cursor RL training, $0.50/$2.50 per M tokens. [EVALUATE]. Real threat to Claude Code lock-in for cost-sensitive client work. Doesn't unseat Opus 4.7 on this workspace's parent + app tier where reasoning quality matters more than cost. Worth testing on cowork-code-corp where token budgets get scrutinized.
Source →
Microsoft cancels Claude Code licenses (The Verge, HN 484pts)
Microsoft cancels Claude Code licenses (The Verge, HN 484pts) ([Hacker News thread](https://news.ycombinator.com/item?id=48231575)). Hard cost-economics data point. Microsoft's internal cost-per-engineer math broke. [INFORM]. Informs the Composer 2.5 evaluation above. If Microsoft can't sustain Claude Code billing, smaller shops definitely should look at multi-model routing.
Source →
Anthropic Dreaming GA in Managed Agents
Anthropic Dreaming GA in Managed Agents ([platform.claude.com docs](https://platform.claude.com/docs/en/managed-agents/dreams)). Background consolidation reads memory + session transcripts, produces reorganized memory. Harvey reported 6x task completion lift in internal testing. `[CONFLICTS WITH: feedback_subagent_persist_to_disk]` already flagged in the 2026-05-11 brief. This is the same arch tension at GA. [EVALUATE]. Request research preview access. Manual analog already runs via wiki-reconcile-daily.
Source →
Anthropic doubles rate limits + SpaceX Colossus 1 lease (300 MW, 220k GPUs)
Anthropic doubles rate limits + SpaceX Colossus 1 lease (300 MW, 220k GPUs) ([anthropic.com/news](https://www.anthropic.com/news/higher-limits-spacex)). Repeat from 2026-05-14 brief but the London keynote May 20 reconfirmed. Workspace already benefits. [INFORM].
Source →
Google Antigravity 2.0 + Managed Agents in Gemini API
Google Antigravity 2.0 + Managed Agents in Gemini API ([blog.google](https://developers.googleblog.com/all-the-news-from-the-google-io-2026-developer-keynote/)). Standalone agent-first desktop app + CLI, Linux-sandboxed Managed Agents via single API call. Direct competitor to Claude Code agents. [EVALUATE]. Track for cowork-code-corp client conversations where Gemini Enterprise is already on contract.
Source →
Gemini CLI deprecates June 18, must migrate to Antigravity CLI
Gemini CLI deprecates June 18, must migrate to Antigravity CLI ([Google Developers Blog](https://developers.googleblog.com/an-important-update-transitioning-gemini-cli-to-antigravity-cli/)). Workspace uses Gemini CLI via OAuth per `feedback_gemini_cli_oauth`. Antigravity CLI is closed-source at launch (Gemini CLI was Apache 2.0). [INSTALL NOW]. Schedule the migration before June 18 or accept service interruption on any child project using Gemini CLI.
Source →
App factory
Next.js coordinated security release (13 CVEs, May 6)
Next.js coordinated security release (13 CVEs, May 6) ([vercel.com/changelog](https://vercel.com/changelog/next-js-may-2026-security-release)). Patch versions 15.5.18 and 16.2.6. CVE-2026-44578 is the worst (unauthenticated WebSocket SSRF). Vercel-hosted apps are safe. Self-hosted (any AWS-Terraform or Cloudflare Pages child app) needs the patch. [INSTALL NOW]. Repeat from 2026-05-11 brief with the patched version refined upward (15.5.16 / 16.2.5 shipped with incomplete middleware fix).
Source →
shadcn/ui 4.7.0 package imports + target aliases
shadcn/ui 4.7.0 package imports + target aliases ([ui.shadcn.com/docs/changelog](https://ui.shadcn.com/docs/changelog/2026-05-package-imports-target-aliases)). `package.json#imports` aliases instead of relying on tsconfig paths. Helps the monorepo path setup in every cowork app. [INSTALL NOW]. Adopt on next shadcn install. Allowlist-clear (shadcn is on the active stack).
Source →
Vercel AI Gateway WordPress plugin (40+ providers, hundreds of models)
Vercel AI Gateway WordPress plugin (40+ providers, hundreds of models) ([vercel.com/changelog](https://vercel.com/changelog/vercel-ai-gateway-plugin-for-wordpress)). Single API key, unified billing, automatic fallback across providers including Anthropic, Google, OpenAI, xAI, DeepSeek. [INSTALL NOW] for any WordPress-shaped client surface (psychedelicsafari has a WordPress-style content layer). Allowlist-clear (Vercel official). For Next.js apps the AI SDK 6 covers this already.
Source →
Qwen 3.7 Max on Vercel AI Gateway (May 21)
Qwen 3.7 Max on Vercel AI Gateway (May 21) ([vercel changelog](https://vercel.com/changelog)). Alibaba agent-foundation model for office-workflow + long-horizon autonomous execution. [EVALUATE]. Add to the multi-model routing mix as the Composer 2.5 / Opus 4.7 cost-tier reshuffles. Allowlist-clear (Vercel official) but use case isn't urgent.
Source →
Cloudflare WAF + framework adapter mitigations for the Next.js/React vulns
Cloudflare WAF + framework adapter mitigations for the Next.js/React vulns ([developers.cloudflare.com](https://developers.cloudflare.com/changelog/post/2026-05-06-react-nextjs-vulnerabilities/)). For any Cloudflare-fronted child app, CF managed rules cover the worst of the May 6 disclosures even if you can't patch immediately. [INFORM]. Defense in depth. Still patch the apps.
Source →
Workspace + wiki governance
Karpathy CLAUDE.md viral file (110k+ stars, 28 consecutive days #1 GitHub trending)
Karpathy CLAUDE.md viral file (110k+ stars, 28 consecutive days #1 GitHub trending) ([multica-ai/andrej-karpathy-skills](https://github.com/multica-ai/andrej-karpathy-skills)). Four generic rules: Think Before Coding, Simplicity First, Surgical Changes, Goal-Driven Execution. The barrier to adoption is zero (paste into project root). [PUBLISH-BACK]. This workspace's CLAUDE.md is dramatically deeper. Direct content opportunity (see TL;DR bullet 3).
Source →
safemcp.info indexes 28,577 verified MCP servers
safemcp.info indexes 28,577 verified MCP servers ([Reddit r/mcp](https://www.reddit.com/r/mcp/comments/1tm7duq/i_built_the_largest_free_directory_of_mcp_servers/)). Free, individually verified directory. Complements PulseMCP (15k+). Indie builder, 95 upvotes/29 comments in r/mcp. [EVALUATE]. Add to wiki-query MCP discovery options. PulseMCP is the hand-reviewed source still.
Source →
"How to connect 100 MCP servers without context window exploding"
"How to connect 100 MCP servers without context window exploding" ([Reddit r/mcp 144pts](https://www.reddit.com/r/mcp/comments/1t73igk/how_to_connect_100_mcp_servers_without_the/)). Platform engineer post on the Context Tax (GitHub MCP alone loads ~50k tokens before user types). Confirms this workspace's discipline of running 4 MCP servers (apify, defrag, vercel, claude-in-chrome) is correct shape. [INFORM].
Source →
Claude Cowork desktop app GA for non-developers
Claude Cowork desktop app GA for non-developers ([Anthropic news](https://www.anthropic.com/product/claude-cowork)). Confirmed at Code with Claude London May 20. Knowledge work in local files/folders with read/edit/create permissions. Legal vertical is the most engaged segment. [EVALUATE]. Relevant for any healing-vertical client who would balk at terminal Claude Code. Defrag could ship Cowork instructions in client onboarding.
Source →
Claude Code
Top items:
Top items: Microsoft canceling Claude Code licenses (HN 484pts, Reddit r/StockMarket 3,084pts). Cursor Composer 2.5 (Theo t3.gg 89,678 views).
Angle:
Angle: Cost-economics narrative breaking against Claude Code at the enterprise tier just as Cursor's 1/10-cost parity model lands. The "use multi-model" thesis is becoming consensus.
Anthropic
Top items:
Top items: Karpathy joining (May 19) dominated. Stainless acquisition (May 18). Code w/ Claude London May 20 keynote.
Angle:
Angle: Anthropic continues consolidating developer-tooling moat (Stainless = MCP/SDK shop) and talent (Karpathy = pre-training research). Public narrative shift from "underdog" to "incumbent" continues.
MCP servers
Top items:
Top items: safemcp.info 28,577 indexed (Reddit 95pts). Stainless reaction "Anthropic just bought the company that generates most MCP servers" (r/ClaudeAI 355pts/78cmt). "How to connect 100 MCPs without context exploding" (r/mcp 144pts/38cmt).
Angle:
Angle: Ecosystem maturing past discovery into governance (Context Tax, security, registry consolidation). Builder skepticism on whether deterministic SDK gen still matters in an "agents slap together MCPs in any language" world.
Claude agents
Top items:
Top items: "What 16 Parallel Claude Agents Built" (HN). Agent View launch (r/ClaudeAI 698pts/118cmt). "Ruflo deploys 50+ parallel Claude agents" (IG).
Angle:
Angle: Parallel-agent fleet pattern is now table stakes. Workspace's `Skill` parallel fan-out is in line with the trend but lighter weight than fleet tools.
Lovable AI
Top items:
Top items: $100M ARR celebrations. "Lovable is outdated, Base44 is 10x better" (Mikey No-Code).
Angle:
Angle: Tool churn cycle visible in real-time. "Base44 better than Lovable" sentiment growing among no-code creators. Workspace's Lovable-as-starting-point + app-build hardening discipline still correct.
v0 dev
Top items:
Top items: Thin week. No specific v0 dev signal in scraper.
Angle:
Angle: Quiet relative to Cursor + Antigravity noise. v0 still the Vercel-default prompt-to-UI.
Cursor
Top items:
Top items: Composer 2.5 (May 18 launch). Theo deep-dive 89k views. Matches Opus 4.7 at 1/10 cost.
Angle:
Angle: Cursor recapturing narrative ground after Claude Code's redesign. The Moonshot Kimi K2.5 base + Cursor RL approach is a new playbook (open base + proprietary post-training).
AI coding agents
Top items:
Top items: Herdr tmux-like multiplexer (HN). Verytis shared error memory MCP. Cplt kernel-level sandbox. Andreessen "vampires" quote.
Angle:
Angle: Tooling for fleet management is the new battleground (multiplexers, shared memory, sandboxes). Discipline catching up with raw capability.
Veo 3
Top items:
Top items: "Veo 3.1 sucks" thread. Community pinned on Veo 4 announcement. House of David / Kling crossover.
Angle:
Angle: Veo 3.1 sentiment is poor and Veo 4 anticipation has slipped past Google I/O 2026 (no announcement May 19-20). Kling is taking the high-end content narrative. Holds workspace decision. Keep ElevenLabs + Gemini CLI route per `feedback_veo_continuity_modes`.
AI video generation
Top items:
Top items: House of David / Kling 44M viewers (r/singularity 456pts). Dreamina Seedance 2.0 in CapCut (IG 9k likes). Hyper-realistic character continuity techniques (IG).
Angle:
Angle: AI video crosses prestige-TV threshold this month. Production discipline (multi-frame structural coherence, character weight consistency) is now public conversation, matching workspace's generate-then-gate playbook.
ElevenLabs
Top items:
Top items: Scrape ran at brief deadline. Spot-check on r/ElevenLabs and r/IndieDev showed steady creator content with no major API changes this week. ElevenLabs Image & Video remains the primary stack route.
Angle:
Angle: Quiet week from ElevenLabs ecosystem. No urgent action.
HeyGen
Top items:
Top items: Scrape ran at brief deadline. Signal covered via HeyGen integration items in domain 1.
Angle:
Angle: Sora 2 + Veo 3.1 + Kling integration story dominates. Same item already in primary brief.
Project:
Project: projects/cowork-defrag/TODO.md
Section:
Section: this-week
Line:
Line: "- [ ] Patch webapp-defrag to Next.js 15.5.18 or 16.2.6 (May 6 security release, 13 CVEs incl CVE-2026-44578 WebSocket SSRF). Run after current Drizzle migration cycle settles."
Why:
Why: Self-hosted Next.js apps remain exposed until patched. Vercel-hosted is safe but webapp-defrag is on Vercel-managed and should still bump to clear knip/typecheck against newest types.
Project:
Project: projects/cowork-defrag/TODO.md
Section:
Section: next-week
Line:
Line: "- [ ] Draft DFP blog post: 'Karpathy CLAUDE.md is the appetizer, here's the meal'. Teardown of parent CLAUDE.md vs viral 110k-star file. Pull from feedback_synthesize_dont_cite, tier-specific branching, multi-account CLI, persist-to-disk subagent."
Why:
Why: Strongest publish-back opportunity this brief. Viral pattern + workspace already 10x deeper. Direct content angle for DFP foundations thesis.
Project:
Project: projects/cowork-code-corp/TODO.md (if exists)
Section:
Section: inbox
Line:
Line: "- [ ] Evaluate Cursor Composer 2.5 for CodeCorp client work where token cost matters. SWE-Bench Multilingual 79.8% vs Opus 4.7's 80.5% at 1/10 cost. Don't replace parent Opus 4.7 default."
Why:
Why: Cost economics flipped this week (Microsoft cost story + Composer 2.5 launch). CodeCorp's client work is the right environment to test. Parent + Defrag work stays Opus.
Project:
Project: parent (root TODO.md)
Section:
Section: inbox
Line:
Line: "- [ ] Migrate Gemini CLI usage to Antigravity CLI before June 18, 2026. Audit which child projects use gemini CLI via OAuth per feedback_gemini_cli_oauth.md. Antigravity CLI is closed-source at launch (Gemini CLI was Apache 2.0)."
Why:
Why: Hard deadline. Service interruption on any project still on Gemini CLI after June 18.
Project:
Project: parent (root TODO.md)
Section:
Section: inbox
Line:
Line: "- [ ] Add Dreaming research-preview access request to next Anthropic touchpoint. Compare to manual wiki-reconcile-daily + MEMORY.md governance loop."
Why:
Why: Dreaming is GA in Managed Agents but research preview elsewhere. 6x lift on Harvey is significant. Our analog process is manual.
Newsletters:
Newsletters: newsletter-digest skill invoked. No Gmail MCP results captured at brief deadline. Cross-reference release-bot output instead.
YouTube channels + playlists:
YouTube channels + playlists: Theo (t3.gg) Cursor Composer 2.5 (89k views), AIM Network "Microsoft Killed Claude Code" (745 views).
Web/forums:
Web/forums: TechCrunch, Anthropic news, Vercel changelog, blog.google, Google Developers Blog, Reddit (r/ClaudeAI, r/StockMarket, r/mcp, r/singularity, r/devops, r/VEO3), Hacker News (484+ pt thread on Microsoft canceling), MIT Tech Review, simonwillison.net, releasebot.io.
GitHub topics:
GitHub topics: anthropics/claude-code, multica-ai/andrej-karpathy-skills (110k+ stars), mattpocock/skills (55k stars), github/github-mcp-server, modelcontextprotocol/servers.
PulseMCP:
PulseMCP: dental planning MCP (May 12), Make integration update (May 12), Haas.my (May 7), Agent Times MCP (May 7).
Cowork-specific (delegated to existing skills):
Cowork-specific (delegated to existing skills): newsletter-digest, cowork-news-research, cowork-youtube-research all invoked. Horizontal last30days ran 12 of 12 terms successfully.
Date:
Date: 2026-05-25
Brief generated by:
Brief generated by: infra-improver agent
Domain scope file version:
Domain scope file version: 94c6af4
Items considered:
Items considered: ~85 before filter, 22 after filter
Failure modes triggered:
Failure modes triggered: None hard.
Time:
Time: 08:30 PT