GTC 2019 Keynote (San Jose)
Category: Main Keynote · Year: 2019 · ▶ Watch
Speakers: Eyal Waldman - CEO, Mellanox Technologies · Jensen Huang - CEO, NVIDIA · Jensen Huang, CEO, NVIDIA · Matt Garman - Vice President of Compute Services, Amazon Web Services · Microsoft Engineer - Microsoft · OmniSci Engineer - OmniSci
Segments (30)
- 00:00 · I am AI Intro
- A video montage showcasing the diverse applications of NVIDIA AI across various industries.
- 02:48 · Accelerated Computing Growth
- Jensen Huang highlights the massive growth in NVIDIA’s developer ecosystem and CUDA downloads.
- 06:15 · Introducing CUDA-X and PRADA
- NVIDIA consolidates its acceleration libraries into CUDA-X, driven by the PRADA philosophy.
- 15:05 · Chapter 1: Graphics and Real-Time Ray Tracing
- A showcase of real-time ray tracing capabilities using RTX hardware and the Unity engine.
- 21:00 · Turing Architecture
- Deep dive into the Turing GPU architecture, featuring RT cores, Tensor cores, and advanced shading techniques.
- 27:00 · RTX in Gaming: Dragonhound and Quake II
- Demonstrations of RTX technology enhancing visual fidelity in modern and classic games.
- 34:50 · RTX for Offline Rendering
- How RTX servers drastically reduce the time and cost of offline rendering for film and design.
- 41:15 · Announcing NVIDIA Omniverse
- Introduction of an open collaboration platform that connects various 3D design tools in real-time.
- 49:00 · GeForce NOW Cloud Gaming
- A brief mention of NVIDIA’s cloud gaming service bringing high-end graphics to all PC users.
- 50:00 · Cloud Gaming Partnerships
- Jensen discusses cloud gaming and announces the GeForce NOW alliance with Softbank and LGU+.
- 54:00 · RTX Server and Pod
- Introduction of the RTX Server and RTX Server Pod for datacenter graphics and cloud gaming.
- 56:25 · Project Sol Demo
- A real-time cinematic rendering demo showcasing the power of RTX.
- 60:45 · Chapter 2: AI & HPC
- Focus shifts to data science, introducing the CUDA-X AI ecosystem and RAPIDS partnerships.
- 73:55 · AI Conversational Search
- Discussion on the complexity of conversational AI, followed by a live demo of Microsoft Bing.
- 84:35 · Clara AI Toolkit
- Announcement of the Clara AI Toolkit to accelerate AI applications in radiology.
- 87:30 · Data Science and OmniSci Demo
- Highlighting data science as the new HPC challenge, featuring a live data analytics demo with OmniSci.
- 1:40:00 · Data Analytics and RAPIDS
- Jensen discusses the RAPIDS open-source framework and its impact on accelerating data science workflows.
- 1:42:40 · Data Science Workstations
- Introduction of new workstations optimized for the growing field of data science.
- 1:45:30 · Data Science - The New HPC Challenge
- Comparing supercomputing, hyperscale, and the unique computational demands of data science.
- 1:56:25 · Mellanox Partnership
- Eyal Waldman joins to discuss the importance of networking and the NVIDIA-Mellanox acquisition.
- 2:01:50 · Enterprise Data Science Solutions
- Announcing DGX Pods and T4-powered enterprise servers for scalable data science.
- 2:06:40 · AWS Partnership and G4 Instances
- Matt Garman discusses ML on AWS, customer success stories, and new T4-powered G4 instances.
- 2:13:30 · Robotics and Jetson Nano
- Introduction of the $99 Jetson Nano computer and the Isaac robotics platform.
- 2:21:15 · Autonomous Vehicles and DRIVE AP2X
- Updates on the DRIVE platform, including Release 9.0 and the Safety Force Field path planning.
- 150:00 · DRIVE AV Safety Force Field
- Jensen Huang announces DRIVE AV Safety Force Field for path planning and prediction.
- 151:45 · DRIVE Constellation
- Announcement of the availability of DRIVE Constellation, a virtual AV test fleet.
- 152:30 · DRIVE Constellation Demo
- A demonstration of the DRIVE Constellation simulator showing various driving conditions.
- 156:20 · Autonomous Driving Demo
- Video demonstration of an autonomous vehicle driving on a highway with perception overlays.
- 157:55 · Toyota Partnership
- Announcement of a partnership with Toyota TRI-AD for autonomous vehicles.
- 158:25 · Keynote Summary
- Jensen Huang summarizes the key announcements from the entire keynote.
Product Announcements (20)
- [06:15] CUDA-X
- A unified suite of GPU-accelerated computing libraries.
- specs: Combines domain-specific libraries (RTX, HPC, AI, DR, IS, CL, ME) into a single stack compatible with all NVIDIA GPUs.
- availability: Available now via NGC.
- [20:30] Unity RTX Experimental Package
- Real-time ray tracing support for the Unity game engine.
- specs: Enables developers to build physically accurate lighting and reflections in real-time.
- availability: Available April 4th.
- [26:00] New Turing GPUs and Laptops
- Expansion of the Turing architecture lineup.
- specs: Includes lower-tier GPUs without RT cores and 40 new gaming laptop models.
- availability: Starting at $219 for the entry-level GPU.
- [30:50] Quake II RTX
- A fully path-traced version of the classic game Quake II.
- specs: Features dynamic time of day, volumetric lighting, and physically based materials.
- availability: To be open-sourced soon.
- [41:15] NVIDIA Omniverse
- An open collaboration platform for 3D production pipelines.
- specs: Uses Universal Scene Description (USD) and Material Definition Language (MDL) to sync tools like Maya, Unreal, and Substance in real-time.
- availability: Early access available at developer.nvidia.com/nvidia-omniverse.
- [54:05] RTX Server
- Datacenter graphics server design
- specs: 40 Turing GPUs in 8U
- availability: N/A
- [55:00] RTX Server Pod
- Modular design for enterprise and cloud edge datacenters
- specs: 32 RTX servers, 1280 GPUs in 10 racks, up to 10,000 concurrent users
- availability: N/A
- [68:45] CUDA-X AI
- End-to-end AI and data science ecosystem
- specs: Integrates RAPIDS, TensorRT, cuDNN, and other libraries
- availability: N/A
- [84:35] Clara AI Toolkit
- Toolkit for building and deploying AI applications for radiology
- specs: AI-assisted annotation, transfer learning, AI deployment
- availability: Available at developer.nvidia.com/clara
- [1:43:30] Data Science Workstation
- A workstation optimized for data scientists.
- specs: Powered by NVIDIA GPU and CUDA-X AI, Dual Quadro RTX 8000 with 96 GB Memory.
- availability: Available from top computer makers (Dell, HP, Lenovo).
- [1:50:50] NVIDIA T4 Tensor Core GPU
- A GPU designed for scale-out enterprise servers.
- specs: 70 watts, fits in standard servers, 4x T4 provides ~260 Teraflops FP16.
- [2:01:50] NVIDIA DGX Pod
- A hyper-converged infrastructure reference architecture for AI.
- specs: Integrates compute, storage, and networking from partners like DDN, Dell EMC, NetApp, Pure Storage, Arista, Cisco, Mellanox.
- availability: Can be installed in 1 day.
- [2:02:50] T4 Enterprise Servers
- Enterprise servers optimized for data science.
- specs: Powered by NVIDIA T4 and CUDA-X AI, NGC Certified.
- availability: Available from Cisco, Dell EMC, Fujitsu, HPE, Inspur, Lenovo, Sugon.
- [2:12:10] Amazon EC2 G4 Instances
- New cloud instances on AWS.
- specs: Featuring NVIDIA T4 Tensor Core GPUs, designed for ML inference, graphics, and video transcoding.
- availability: Coming soon.
- [2:14:30] Jetson Nano
- A small, low-cost AI computer for robotics and edge devices.
- specs: CUDA-X acceleration stack, high-resolution sensor support, runs all CUDA-X AI models.
- availability: $99
- [2:16:20] Isaac Open SDK
- A software development kit for robotics.
- specs: Includes Isaac Robot Engine, Isaac Sim, and Isaac Gym.
- availability: Available at developer.nvidia.com/isaac-sdk
- [2:23:00] DRIVE AP2X Release 9.0
- High function L2+ autopilot system software.
- specs: On-ramp to off-ramp, surround perception, localization to HD maps, real-time mapping.
- [2:27:50] Safety Force Field
- A computational method for autonomous vehicle path planning.
- specs: Mathematically verifiable, designed to computationally avoid causing harm.
- availability: Open platform.
- [150:00] DRIVE AV Safety Force Field
- Path planning and prediction software for autonomous vehicles.
- specs: Designed for AV safety and drive comfort, mathematically verifiable, validated in simulation, open platform.
- availability: Announced
- [151:45] DRIVE Constellation
- Virtual AV Test Fleet simulator.
- specs: Bit-accurate, hardware-in-the-loop simulator, tests corner and rare conditions, cloud-based workflow.
- availability: Available Now
Specific Numbers (17)
| Timestamp | Metric | Value | Context |
|---|---|---|---|
| 03:30 | NVIDIA Developers | 1.8 Million | A 50% growth from the previous year. |
| 03:30 | CUDA Downloads | 12 Million | A 50% growth from the previous year. |
| 05:30 | Performance Increase | 40x | Improvement in accelerated computing performance from 2010 to 2019. |
| 21:00 | Turing Transistors | 18 Billion | Transistor count on the high-end Turing RTX architecture. |
| 21:00 | Tensor Core Performance | 130 TFLOPS | Compute capability of the Tensor cores on Turing. |
| 34:50 | 3D Creators | 9 Million | The addressable market of creators getting access to RTX in 2019. |
| 38:30 | Render Cost Savings | $220,000 | The cost difference between a 25-node CPU render farm ($250k) and a 1-node RTX Server ($30k). |
| 54:05 | GPUs per server | 40 | Number of Turing GPUs in an 8U RTX Server |
| 55:00 | GPUs per pod | 1280 | Number of GPUs in a 10-rack RTX Server Pod |
| 55:00 | Concurrent users | 10,000 | Supported users per RTX Pod |
| 83:50 | TensorRT Downloads | 300k | Growth from 50k in 2017 to 300k in 2018 (6X increase) |
| 1:41:15 | Query Time | 4 minutes | Reduced from 8 days using the new accelerated platform. |
| 1:42:40 | Data Scientists | 3 million | Estimated number of data scientists worldwide. |
| 1:47:45 | Supercomputer Compute Load | 1 billion petaflops | Measured in instances of compute, not time. |
| 1:50:50 | T4 Power Consumption | 70 watts | Power draw of the T4 GPU. |
| 2:13:40 | Jetson Developers | 200,000 | Number of developers using the Jetson platform. |
| 2:14:30 | Price | $99 | Cost of the new Jetson Nano computer. |
Benchmark Claims (5)
- [05:30] Accelerated Computing Performance: 40x
- vs: 2010 baseline performance
- gain: 40x improvement over 9 years through full-stack optimization.
- [38:30] Offline Rendering (Incredibles 2): 6 Hours (1 RTX Server)
- vs: 38 Hours (25 Dual Skylake CPU nodes)
- gain: Over 6x faster rendering at a fraction of the hardware and power cost.
- [1:44:30] End-to-End Data Science Workflow: Significantly faster
- vs: CPU
- gain: Visual chart shows a massive reduction in time for data prep, training, and end-to-end tasks.
- [2:04:25] Acceleration of Data Science Clusters (End to End): 3 minutes
- vs: 10x CPU Nodes (35 minutes)
- gain: Over 10x faster using 10x T4 Nodes.
- [2:05:50] Deep Learning Scaling (ResNet-50): Linear scaling
- vs: Standard Ethernet
- gain: Using RoCE RDMA prevents the performance plateau seen with standard networking as server count increases.
Customer Stories (9)
- [37:20] Image Engine
- Utilized NVIDIA rendering technology for complex visual effects.
- outcome: Achieved high-fidelity, photorealistic rendering for film and television production.
- [38:30] Pixar
- Benchmarked rendering a frame of Incredibles 2 on CPU nodes versus an RTX Server.
- outcome: Reduced render time from 38 hours to 6 hours and cut hardware costs drastically.
- [53:30] Softbank and LGU+
- Partnered with NVIDIA for cloud gaming
- outcome: Announcing GeForce NOW alliance
- [77:25] Microsoft Bing
- Implemented AI conversational search using NVIDIA technology
- outcome: Improved search accuracy and user experience with complex, multi-part queries
- [86:50] MGH, NIH, OSU, DKFZ
- Adopted Clara AI for radiology workflows
- outcome: Reduced annotation time from hours to minutes, deployed clinical models in less than 24 hours
- [2:09:50] Western Digital
- Used P3s and Voltas on AWS for material science properties and magnetic/heat flows.
- outcome: Improved the quality of their disk drives.
- [2:10:15] Celgene
- Used AI for drug design, moving from an on-premise cluster to AWS.
- outcome: Reduced processing time from 2 months to 6 hours.
- [2:11:10] Lyft
- Ran all 50 million monthly rides on AWS, using AI/ML on P3s and Voltas with SageMaker.
- outcome: Calculated fares, optimized drop-off/pick-up, and performed fraud detection.
- [157:55] Toyota (TRI-AD)
- Partnered with NVIDIA to create the future of autonomous vehicles.
- outcome: Collaboration on AV Core Systems, Driving Simulation, In-Car Computer, and AI for AV.
Key Technologies (16)
- CUDA-X: A unified collection of GPU-accelerated libraries for AI, HPC, and graphics.
- RTX (Real-Time Ray Tracing): Hardware-accelerated calculation of light paths to create photorealistic reflections, shadows, and global illumination.
- Turing Architecture: GPU architecture featuring dedicated RT Cores for ray tracing and Tensor Cores for AI processing.
- Variable Rate Shading (VRS): Optimizes rendering performance by varying the shading rate across different areas of a frame.
- Mesh Shaders: A new geometry pipeline that allows for highly complex and detailed scenes.
- Omniverse: A platform that enables real-time, multi-tool collaboration for 3D content creation using USD.
- RTX Server: Provides scalable, high-performance graphics and compute for datacenters.
- CUDA-X AI: A comprehensive software acceleration library for AI and data science.
- Clara AI: A platform for developing and deploying AI in medical imaging.
- RAPIDS: An open-source machine learning and data science framework.
- CUDA-X: The underlying engine and acceleration stack for NVIDIA’s AI and data science platforms.
- InfiniBand and Ethernet (Mellanox): High-speed, low-latency networking protocols for data centers.
- RoCE RDMA: RDMA over Converged Ethernet, used to improve network efficiency and scaling for deep learning.
- Safety Force Field: A computational path-planning algorithm for autonomous vehicles to ensure safety.
- Safety Force Field: Provides path planning and prediction to ensure autonomous vehicle safety.
- Hardware-in-the-loop simulation: Allows bit-accurate testing of autonomous vehicle software in a virtual environment.
Demos Shown (13)
- [15:40] A high-fidelity rendering of BMW cars to demonstrate photorealism.
- True
- [19:05] Live manipulation of a BMW interior and exterior using real-time ray tracing in Unity.
- True
- [27:10] Gameplay footage of Nexon’s Dragonhound toggling RTX reflections and shadows on and off.
- True
- [30:55] A fully path-traced version of Quake II showing dynamic lighting, glass refraction, and volumetric effects.
- True
- [37:20] A complex visual effects rendering sequence by Image Engine.
- True
- [45:35] Live collaboration in NVIDIA Omniverse showing updates in Maya, Unreal Engine, and Substance Painter reflecting instantly in a shared viewer.
- True
- [56:25] Project Sol cinematic rendering
- Yes
- [77:25] Microsoft Bing conversational search on a mobile device
- Yes
- [89:15] OmniSci analyzing and visualizing WiFi access point data
- Yes
- [2:18:50] A small, green robot named Kaya powered by Jetson Nano driving on stage.
- Yes
- [2:24:20] Video demonstration of DRIVE AP2X features including MyRoute, WaitNet, Sensor Fusion, and DRIVE Sim.
- Yes (pre-recorded video)
- [152:30] DRIVE Constellation simulation showing multiple camera views of a car driving through different weather and lighting conditions.
- True
- [156:20] Video of a real car driving autonomously on a highway, alongside visualizations of the car’s perception and planning systems.
- True
Predictions / Commitments (4)
- [11:15, Future] We believe that in the future, they will all be high-performance computing customers.
- [36:45, End of 2019] By the end of this year, we should have all of them [major design tools] in production with RTX.
- [1:55:10, Current/Ongoing trend] East-West networking traffic in data centers is going up exponentially.
- [1:55:55, Future] The way you design a data center is going to change; networking and compute will become one continuous computing fabric.
Companies Mentioned (22)
Unity · Microsoft · Epic Games · Nexon · Intel · Autodesk · Softbank · LGU+ · Databricks · Google Cloud · Microsoft Azure · Accenture · ONNX Runtime · OmniSci · Dell, HP, Lenovo · Mellanox Technologies · DDN, Dell EMC, NetApp, Pure Storage, Arista, Cisco · Cisco, Dell EMC, Fujitsu, HPE, Inspur, Lenovo, Sugon · Amazon Web Services (AWS) · Toyota · AWS · Mellanox
Notable Quotes (6)
PRADA stands for PRogrammable Acceleration of multiple Domains with one Architecture. — Jensen Huang @ 14:00
Which one is real? Left or right? … This is not real. … That one is real. — Jensen Huang @ 16:45
The more you buy, the more you save… I think I was wrong. RTX servers are free. — Jensen Huang @ 39:50
Data Science is the new HPC. — Jensen Huang @ 1:45:30
The network is going to become really, really important. — Jensen Huang @ 1:56:10
We have more machine learning is done in AWS in the cloud than anywhere. — Matt Garman @ 2:09:40
Key Topics
Accelerated Computing · CUDA-X · Real-Time Ray Tracing · NVIDIA RTX · Turing Architecture · Unity Engine · Unreal Engine · Offline Rendering · NVIDIA Omniverse · 3D Collaboration · Path Tracing · Cloud Gaming · Cloud Gaming · RTX Server · CUDA-X AI · Data Science · RAPIDS · Conversational AI · Medical Imaging · Clara AI · Data Analytics · Data Science · RAPIDS · High Performance Computing (HPC) · Workstations · Networking · Mellanox · Enterprise Servers · Cloud Computing · AWS · Robotics · Jetson Nano · Isaac SDK · Autonomous Vehicles · DRIVE AP2X · Autonomous Vehicles · DRIVE AV · Safety Force Field · Path Planning · DRIVE Constellation · Simulation · Hardware-in-the-loop · Toyota Partnership · Accelerated Computing · CUDA-X
Takeaways
- NVIDIA’s developer ecosystem is experiencing massive growth, driven by the adoption of accelerated computing.
- CUDA-X unifies NVIDIA’s software libraries, providing a programmable architecture across multiple domains (PRADA).
- Real-time ray tracing is now a reality, with major game engines like Unity and Unreal integrating RTX support.
- The Turing architecture represents a massive leap in graphics, utilizing dedicated RT and Tensor cores to handle complex lighting and AI tasks.
- RTX Servers offer disruptive cost and time savings for offline rendering, making traditional CPU render farms obsolete.
- NVIDIA Omniverse aims to revolutionize 3D production pipelines by enabling seamless, real-time collaboration across disparate design tools globally.
- NVIDIA is expanding its RTX technology from desktop gaming to cloud servers and enterprise pods.
- CUDA-X AI consolidates NVIDIA’s software libraries to accelerate data science and AI workflows.
- Strong partnerships with major cloud providers and integrators are driving the adoption of RAPIDS.
- AI is significantly enhancing conversational search capabilities, as demonstrated by Microsoft Bing.
- The Clara AI Toolkit aims to accelerate AI adoption and model deployment in the radiology field.
- GPU-accelerated data analytics platforms like OmniSci provide real-time insights into massive datasets.
- NVIDIA is heavily targeting the data science market with optimized hardware (workstations, servers) and software (RAPIDS).
- Data science workloads require a new architecture that bridges the gap between traditional supercomputing and hyperscale cloud.
- The acquisition of Mellanox highlights NVIDIA’s belief that high-speed, low-latency networking is critical for scaling AI and data science.
- NVIDIA is partnering broadly across hardware OEMs, storage providers, and cloud providers (like AWS) to deliver end-to-end AI solutions.
- The $99 Jetson Nano aims to democratize AI robotics development for makers, students, and edge applications.
- NVIDIA’s autonomous vehicle strategy includes full-stack software (DRIVE AP2X) and verifiable safety models (Safety Force Field).
- NVIDIA introduced DRIVE AV Safety Force Field to provide mathematically verifiable path planning for autonomous vehicles.
- DRIVE Constellation is now available, offering a cloud-based, hardware-in-the-loop virtual testing fleet for AVs.
- NVIDIA demonstrated the capabilities of DRIVE Constellation to simulate complex and rare driving conditions.
- A major partnership with Toyota TRI-AD was announced to develop the future of autonomous vehicles.
- The keynote concluded with a summary of NVIDIA’s full-stack approach, spanning from RTX graphics to data science and autonomous machines.