The $38B Plot Twist: OpenAI Just Bet Big on AWS

This Week in Cloud — November 6, 2025

Welcome back to The Cloud Cover, your essential guide to navigating the shifting landscape of cloud and AI infrastructure. This week, OpenAI surprises everyone with a $38B pivot towards AWS, Microsoft doubles down on sovereignty and GPU supply, and Google turns “agents” into a native part of the data stack. The cloud wars are getting physical — let’s dive in.

A $38B Handshake Shakes the Cloud Landscape

The defining story of the week is OpenAI’s stunning 7-year, $38 billion strategic partnership with Amazon Web Services. This is a monumental development, primarily because OpenAI has long been the flagship AI partner for Microsoft, relying on Azure as its "sole source of computing power" for years.

This move signals that the primary bottleneck for AI dominance is no longer cloud allegiance but access to raw compute. OpenAI's demand for next-generation NVIDIA GPUs (specifically the GB200 and GB300 models) is so voracious that no single provider—not even Microsoft—can satisfy it. This $38 billion deal is an aggressive diversification to de-risk its most critical supply chain.

For AWS, this is a brilliant counter-move. After being perceived as trailing in the initial GenAI narrative, AWS has secured a massive contract to host the world's most prominent AI workloads. As AWS CEO Matt Garman noted, it's a "powerful reminder" of why organizations trust AWS for massive scale—a direct shot at Microsoft, implying it couldn't handle OpenAI's needs alone.

One crucial clarification: this is an infrastructure deal. It's about AWS providing EC2 UltraServers to run OpenAI's core workloads. This does not mean OpenAI's models are coming to Amazon Bedrock. AWS is continuing to build Bedrock around its own key partners, like Anthropic, which is using AWS's custom-built "Project Rainier" supercomputer.

🔍 The Rundown

AWS

Q3 Financials Impress: AWS posted strong Q3 2025 earnings with $33 billion in revenue, marking 20% year-over-year growth. CEO Andy Jassy noted it was the "strongest growth in years," a pace not seen since 2022, all achieved despite a major global outage just before the quarter's end.

Building Vertical Infrastructure: AWS is building its own physical infrastructure, bringing its "Project Rainier" AI supercomputer (using nearly 500,000 custom Trainium2 chips) online for partner Anthropic. It also announced "Fastnet," a new private transatlantic subsea fiber cable connecting the U.S. and Ireland, giving it end-to-end control of its global network.

New AMD Instances: AWS launched two new instance families: the general-purpose M8a and the memory-optimized R8a. Both are powered by 5th-gen AMD EPYC processors and offer up to 30% higher performance than the previous generation.

Accelerating Top Secret Cloud: The Global Security and Compliance Acceleration (GSCA) partner program is being expanded to AWS Secret Cloud and AWS Top Secret Cloud. This is a strategic move to help AI startups navigate compliance hurdles and get cutting-edge tools into U.S. national security environments faster.

Azure

The "Sovereignty" Play: In a direct response to regulators, Microsoft announced a new EU Data Boundary to keep all EU customer AI data processing within the EU. It also revealed major upgrades to Azure Local (its on-prem, air-gapped solution), including higher scale, new NVIDIA GPU support, and a "fully disconnected" mode planned for 2026.

SAP "Agent-to-Agent" Chat: At SAP TechEd, Microsoft announced a new bi-directional integration between Microsoft Copilot and SAP's AI agent, Joule. This "agent-to-agent" communication is critical for Microsoft's strategy to make Copilot the universal interface for enterprise systems.

Securing AI Compute: Azure is aggressively buying up GPU capacity. Reports this week detailed a multi-year NVIDIA supply agreement with Lambda and a massive $9.7 billion, 5-year deal with IREN for 200 MW of data center capacity.

GCP

"Data Engineering Agent" Launch: Google embedded a new, Gemini-powered "Data Engineering Agent" directly into the BigQuery interface. This agent uses natural language to automate complex tasks like "create a pipeline to load data," troubleshoot failures, and generate data schemas, creating massive lock-in for its core data platform.

AI-Powered Malware Detected:Google's Threat Intelligence Group (GTIG) reported it has observed malware (tracked as PROMPTSTEAL) used by a state-sponsored actor (APT28) that queries an LLM (Qwen2.5) during execution to dynamically generate malicious commands. AI-powered attacks are officially no longer theoretical.

New .NET SDK Release: Google released a new, unified .NET SDK, allowing C# developers to use Gemini from a single library. This is an important "catch-up" move to court the vast, traditionally Microsoft-centric enterprise developer community.

OCI

Guiding for Massive Growth:Third-party analysis this week highlighted OCI's impressive FY2025 performance (49.3% YoY revenue growth). More importantly, Oracle management is now guiding for 77% OCI growth in FY2026, driven almost entirely by its massive build-out of AI infrastructure.

"Long-Running" Serverless Functions:For OCI Functions, the maximum function timeout has been extended to 60 minutes. This transforms OCI's serverless offering into a viable platform for lightweight batch and data processing workloads.

The "Agentic Finance" Leader:Oracle was named a Leader in two 2025 Gartner Magic Quadrant reports for Cloud ERP. Oracle is successfully linking this SaaS dominance to its OCI infrastructure, promoting the term "Agentic Finance" and proving its strategy of using its app layer to pull customers to its infrastructure layer is working.

📈 Trending Now: The Great Cloud Buildout Rolls On

This week, the "cloud" is looking decidedly physical. The AI arms race isn't just about who has the best model or agents; it's a brutal competition for a scarce physical supply chain: next-gen GPUs, power, and low-latency networking.

We're seeing every provider make massive, tangible infrastructure plays. AWS is vertically integrating, building its own custom-silicon AI supercomputer ("Project Rainier") for Anthropic while simultaneously laying its own private transatlantic subsea cable ("Fastnet") to own the network from end to end.

Microsoft, needing capacity now, is simply buying it, inking a $9.7 billion deal with IREN and another with Lambda just to secure data center space and GPUs. At the same time, it's "weaponizing sovereignty" by building specialized on-prem Azure Local hardware and new European regions (Austria and Belgium) to create regulatory moats that competitors can't easily cross. This is all about data gravity and data residency.

The game is no longer just about elastic, on-demand software. The winners will be those who can physically build—or buy—the most compute, power, and private networking, and place it in the right sovereign locations, fastest. The only question remaining is whether demand for this infrastructure can continue to keep pace.

📅 Event Radar

Nov
16-17
Google Cloud Summit Saudi Arabia | Riyadh
Not too late to register.
Nov
18-21
Microsoft Ignite | San Francisco
Major event coming up fast!
Dec
1-5

👋 Until Next Week

The buzzword of the week (or maybe the year), echoed by every single provider, was "agent." We saw GCP's "Data Engineering Agent," Oracle's "Agentic Finance," the Microsoft-SAP "agent-to-agent" integration, and the $38B AWS deal to power "agentic workloads."

We are clearly moving from Phase 2 of AI (calling a model via API) to Phase 3 (deploying and orchestrating autonomous agents). The big question for 2026 is no longer "who has the best model" but "who has the best platform for building, hosting, and orchestrating these agents."

It's a busy run-up to the end of the year. Stay tuned.

Do you enjoy these emails? Your friends and colleagues might, too! Help us grow the cloud community by sharing the newsletter with others.