Google Cloud's eighth-generation TPUs and the agentic-era infrastructure race

Dyanet Admin

25 Jun 2026 • 1 min read

At Google Cloud Next '26, Google expanded its AI Hypercomputer platform with two new eighth-generation TPUs: the TPU 8t for training, packing 9,600 chips in a single superpod for nearly 3x the compute of the prior generation, and the TPU 8i tuned for low-latency inference and reinforcement learning, delivering 80% better performance per dollar. Around them came the Virgo Network fabric, capable of linking 134,000 TPUs in one data center and over a million across sites, plus native PyTorch support (TorchTPU), GKE nodes that start up to 4x faster, and Managed Lustre storage at 10 TB/s. The connective thread is agentic workloads: fleets of specialized agents that decompose goals, preserve state, and demand both high throughput and very low latency, rather than simple chat. For teams planning cloud and AI roadmaps, the signal is clear: infrastructure is being re-architected around agents, and hardware/software co-design with open frameworks like JAX, PyTorch and vLLM is becoming the competitive lever.

This is a summary by our content curator. Read the original at Google Cloud Blog: https://cloud.google.com/blog/products/compute/ai-infrastructure-at-next26.

Sign up for more like this.

Enter your email

IBM Sovereign Core: digital sovereignty becomes an operational product for AI

At Think 2026, IBM moved digital sovereignty from policy to operations with the general availability of IBM Sovereign Core, a self-managed software platform for deploying and running AI and cloud workloads in environments governed entirely by the customer. The pitch is that sovereignty now reaches well beyond data residency: it

25 Jun 2026 1 min read

IBM and Red Hat's $5B Project Lightwell bets on securing open source in the AI era

IBM and Red Hat have committed $5 billion and more than 20,000 engineers to Project Lightwell, an initiative to secure the open-source software that underpins enterprise IT against AI-accelerated threats. The premise: frontier models are collapsing the exploit window from weeks to hours, with one preview model reportedly flagging

25 Jun 2026 1 min read