GTC 2026 Telecom: The AI Grid
Category: Telecom Special Address · Year: 2026 · ▶ Watch
Segments (21)
- 00:00 · Introduction to the AI Grid
- Ronnie Vasishta introduces the concept of the AI Grid, a new infrastructure buildout connecting the physical world with AI.
- 03:24 · Three Tectonic Shifts in Telecom
- Discussion of accelerated computing, the evolution of AI applications, and the transition from 5G to 6G.
- 05:57 · Merging the Telecom and AI Stacks
- How accelerated computing enables the convergence of purpose-built telecom stacks with general-purpose AI stacks.
- 07:55 · Telcos’ Prime Locations for AI
- Highlighting the unique advantage of telecom operators in possessing distributed land, power, and shell infrastructure.
- 10:15 · Sovereign AI Factories
- Overview of 24 telcos globally building sovereign AI factories to serve local regions and enterprises.
- 11:25 · Distributed Edge Primed for AI
- The massive untapped opportunity of utilizing spare power at regional POPs and central offices for AI compute.
- 12:20 · New AI Applications and Tokenomics
- Exploring the killer apps for the edge and the new economic model based on token generation and latency.
- 15:55 · Personal AI Edge Inference Demo
- A demonstration comparing the latency and cost of running an AI assistant in the cloud versus at the edge.
- 17:30 · Decart Edge Inference Demo
- A demonstration showing the performance benefits of edge rendering for real-time video personalization.
- 19:10 · AT&T and Cisco IoT AI Grid
- Announcement of AT&T and Cisco building an AI Grid for IoT using NVIDIA RTX 6000 GPUs.
- 20:45 · Comcast AI at the Edge
- Comcast’s initiative to bring AI to the edge for personalized advertising and small business agents.
- 21:40 · Spectrum AI Grid for Movie Makers
- Spectrum’s deployment of low-latency hubs for high-performance graphics rendering in the media industry.
- 22:35 · Akamai Intelligent AI Orchestration
- Akamai’s global-scale implementation of AI inference orchestration across thousands of edge locations.
- 23:25 · RAN as a Workload on the AI Grid
- The shift towards AI-RAN, allowing radio access networks to run as software workloads alongside AI applications.
- 25:30 · T-Mobile Physical AI Apps on AI-RAN
- T-Mobile’s integration of physical AI apps, like traffic monitoring, onto their AI-RAN infrastructure in San Jose.
- 28:25 · Indosat Ooredoo Hutchison Sovereign AI Factory
- IOH’s efforts to build an AI Grid for local innovation and close the digital divide in Indonesia.
- 30:05 · NVIDIA AI Grid Reference Architecture
- Introduction of the full-stack reference architecture and ecosystem partners for building the AI Grid.
- 31:40 · Aerial Omniverse Digital Twin Demo
- A video presentation showcasing the simulation of AI-native wireless networks in a digital twin environment.
- 33:25 · 6G as the Fabric for Physical AI
- The role of 6G in connecting billions of devices, robots, and AI agents in the physical world.
- 36:45 · Democratizing 6G Research
- NVIDIA’s initiatives and developer programs aimed at accelerating 6G research and development.
- 39:55 · Road to the AI-Native Telco
- Concluding remarks on the generational opportunity for telecom operators to embrace AI-native architectures.
Product Announcements (3)
- [30:05] NVIDIA AI Grid Reference Architecture
- A full-stack hardware and software reference architecture for deploying AI and RAN workloads on shared infrastructure.
- specs: Integrates AI Grid Control Plane, NVIDIA AI-RAN Computer, and supports various ecosystem partners.
- availability: Available through ecosystem partners.
- [30:05] RTX Pro 6000 Blackwell Server Edition
- A server-class GPU designed for AI-RAN and edge AI workloads.
- specs: Optimized for distributed edge deployments and AI inference.
- availability: Not specified.
- [31:40] Aerial Omniverse Digital Twin
- A platform for simulating and optimizing wireless networks and physical AI agents.
- specs: Physically accurate 3D environments, ray-tracing channel simulation, and integration with 3rd party digital twins.
- availability: Not specified.
Specific Numbers (12)
| Timestamp | Metric | Value | Context |
|---|---|---|---|
| 10:21 | Telco AI Factories | 24 | Number of telcos building Sovereign AI Factories across 5 continents. |
| 11:25 | Regional POPs | 100,000 | Estimated number of regional points of presence globally. |
| 11:25 | Spare Energy | 1 MW | Average spare energy available per regional POP site. |
| 11:25 | AI Capacity | 100 GW | Total estimated AI capacity available today at distributed edge locations. |
| 16:45 | Latency | 400ms | End-to-end latency for the Personal AI app running on an Edge SLM, compared to 2000ms on a Cloud LLM. |
| 16:55 | Cost per 1M Tokens | $0.02 | Cost of running the Personal AI app on an Edge SLM, compared to $0.80 on a Cloud LLM. |
| 18:40 | Frames Per Second (FPS) | 30 | FPS achieved by Decart’s video personalization app at the edge, compared to 7-16 FPS in the cloud. |
| 20:57 | Edge Locations | 65 million | Number of locations reached by Comcast’s edge compute network. |
| 22:48 | Edge and Core Locations | 4,400+ | Number of locations in Akamai’s global AI inference network. |
| 27:23 | Performance Improvement | 5x | Improvement in speed of seeing actions at a traffic intersection using T-Mobile’s AI-RAN infrastructure. |
| 37:20 | Developer Program Members | 7,000+ | Number of members in the NVIDIA 6G Developer Program. |
| 37:38 | Sionna Downloads | 375,000+ | Number of downloads for the Sionna link-level simulator. |
Benchmark Claims (4)
- [16:45] End-to-End Latency (Personal AI): 400ms
- vs: 2000ms (Cloud LLM)
- gain: 5x reduction in latency for a more responsive conversational experience.
- [16:55] Cost per 1M Tokens (Personal AI): $0.02
- vs: $0.80 (Cloud LLM)
- gain: 40x reduction in inference cost by utilizing edge SLMs.
- [18:40] Frames Per Second (Decart Video Personalization): 30 FPS
- vs: 7-16 FPS (Cloud)
- gain: Smooth, real-time video rendering by mitigating cloud jitter through edge proximity.
- [27:23] Vision AI Processing Speed (T-Mobile/San Jose): 5x faster
- vs: Previous non-edge implementations
- gain: Significantly faster anomaly detection at traffic intersections.
Customer Stories (6)
- [19:15] AT&T and Cisco
- Teamed up with NVIDIA to build an AI Grid for IoT, deploying RTX 6000 GPUs in UCS servers.
- outcome: Created a zero-trust, on-demand AI Grid to deliver intelligence at the cellular network edge for public safety and other use cases.
- [20:45] Comcast
- Partnered with Personal AI and Decart to bring AI to their massive edge compute network.
- outcome: Enabled hyper-personalized ads and small business concierge agents with low latency.
- [21:40] Spectrum
- Transformed data centers into low-latency hubs using RTX 6000 GPUs.
- outcome: Provided high-performance graphics rendering capabilities for movie makers requiring complex 3D collaboration.
- [22:35] Akamai
- Implemented a global-scale Intelligent AI Orchestration Platform across over 4,400 locations.
- outcome: Enabled SLA-aware distribution of content and AI applications globally.
- [25:30] T-Mobile
- Integrated physical AI apps, like Metropolis VSS, onto their AI-RAN ready infrastructure in San Jose.
- outcome: Achieved a 5x performance improvement in processing traffic intersection camera feeds for anomaly detection.
- [28:25] Indosat Ooredoo Hutchison (IOH)
- Built a Sovereign AI Factory and deployed AI applications like the Sahabat AI app to the distributed edge.
- outcome: Provided localized AI services for education, agriculture, and healthcare to help close the digital divide in Indonesia.
Key Technologies (4)
- AI Grid: A distributed infrastructure that connects the physical world and deploys AI at scale into the hands of end-users.
- AI-RAN: Allows Radio Access Network (RAN) and AI workloads to run concurrently on the same software-defined, accelerated computing infrastructure.
- Tokenomics: An economic model for AI inference that values compute based on metrics like tokens per second and time to first token.
- Aerial Omniverse Digital Twin: A simulation platform that uses physically accurate 3D environments and ray-tracing to optimize wireless networks and physical AI agents.
Demos Shown (3)
- [15:55] A side-by-side comparison of a Personal AI auto repair voice assistant running on a Cloud LLM versus an Edge SLM, highlighting latency and cost differences.
- True
- [17:30] A demonstration by Decart showing real-time video personalization (inserting a product into a video stream), comparing the frame rate of cloud rendering versus edge rendering.
- True
- [31:40] A video presentation of the Aerial Omniverse Digital Twin, showing how developers can simulate 6G networks, beamforming, and autonomous robots in a virtual city.
- True
Predictions / Commitments (3)
- [04:55, Next generation of telecom networks (6G)] 6G is being born in the era of AI and will be deployed to be the fabric of AI.
- [06:55, Near future] The convergence of the compute stack will create completely new KPIs for the industry and lead to unicorns built overnight.
- [33:40, 2-3+ years from now] 6G will provide the connectivity fabric required to connect hundreds of billions of humans, machines, and AI agents.
Companies Mentioned (7)
Cisco · Nokia · Ericsson · SoftBank · AWS · Fujitsu · Samsung
Notable Quotes (3)
AI is redefining computing and driving the largest infrastructure buildout in human history—and telecommunications is next. — Ronnie Vasishta (quoting Jensen Huang) @ 05:30
The ability to provide tokenomics, tokenization in that distributed network of value is very, very unique to this infrastructure. — Ronnie Vasishta @ 12:28
There has never been a better time to be in Telcos. — Ronnie Vasishta @ 40:49
Key Topics
AI Grid · Telecommunications · AI-RAN · Edge Computing · 6G Networks · Accelerated Computing · Tokenomics · Physical AI · Sovereign AI · Digital Twins · Network Orchestration · Software-Defined Infrastructure
Takeaways
- The telecommunications industry is undergoing a massive transformation as its purpose-built infrastructure merges with the general-purpose AI compute stack.
- Telcos possess a unique advantage with their highly distributed real estate, power, and connectivity, making them prime candidates to build the ‘AI Grid’.
- Deploying AI at the edge (AI Grid) significantly reduces latency and costs compared to cloud-based LLMs, enabling new ‘killer apps’ like real-time video personalization and physical AI.
- AI-RAN allows operators to run radio access networks as software workloads alongside AI applications on the same accelerated hardware, maximizing utilization.
- Major telecom and tech companies (AT&T, T-Mobile, Comcast, Cisco, Nokia, Ericsson) are already actively deploying and testing AI Grid and AI-RAN architectures.
- 6G will be the first network generation ‘born in the era of AI’ and will serve as the essential connectivity fabric for billions of physical AI agents and robots.
- NVIDIA is providing full-stack reference architectures, GPUs (RTX Pro 6000), and simulation tools (Aerial Omniverse Digital Twin) to accelerate the telecom industry’s transition to AI-native networks.