当 HBM 不再唯一,AI 内存的新棋局
Category: China Perspective (Bilibili) · Duration: 5 min · ▶ Watch
Speakers: ICVIEWS 半导体产业纵横
Segments (8)
- 00:00 · HBM技术简介
- 介绍了高带宽内存(HBM)作为下一代DRAM技术,通过垂直堆叠实现超高带宽,成为AI大模型训练与推理的关键组件。
- 00:20 · HBM市场格局变化
- 阐述了SK海力士凭借HBM技术优势,在市场中地位持续攀升,与三星的份额差距拉大,并首次在DRAM领域超越三星。
- 01:10 · 三星Z-NAND技术
- 三星计划重启搁置七年的Z-NAND存储技术,作为HBM的替代方案,旨在大幅提升性能、降低功耗和延迟。
- 01:34 · NEO Semiconductor的3D X-DRAM
- NEO Semiconductor推出面向AI芯片的X-HBM架构,基于其自研的3D X-DRAM技术,旨在突破传统HBM在带宽和容量上的瓶颈。
- 02:29 · Saimemory的全新堆叠式DRAM
- 由软银、英特尔和东京大学联合创立的Saimemory正在研发全新堆叠式DRAM架构,目标是成为HBM的直接替代者或更优方案。
- 02:55 · 存内计算(PIM)技术
- 介绍了存内计算(PIM)技术,通过将计算单元集成至数据存储位置,从根本上解决冯·诺依曼架构的数据移动瓶颈。
- 03:33 · 华为UCM推理记忆数据管理器
- 华为发布UCM技术,通过管理KV Cache来优化大模型推理过程,实现高吞吐、低时延,降低对HBM的依赖。
- 04:05 · 未来AI内存展望
- 总结认为,未来AI内存市场将呈现架构多样化,多种技术并存,共同构成一个异构、多元的层级体系,以适应不同应用场景。
Memory Facts (8)
- [00:02] HBM (High Bandwidth Memory) is a type of DRAM that vertically stacks multiple DRAM dies.
- Layers
- [00:13] HBM’s 3D architecture provides bandwidth far exceeding traditional GDDR memory.
- Bandwidth
- [01:20] Z-NAND performance is close to DRAM, with system latency much lower than traditional SSDs.
- Latency, Performance
- [01:25] Z-NAND is based on an improved V-NAND design, using a 48-layer structure and operating in SLC mode.
- Layers, Mode
- [01:44] 3D X-DRAM is described as a ‘3D NAND-like DRAM’.
- Architecture
- [02:14] X-HBM can achieve a 32K-bit data bus and a single-chip capacity of 512Gbit.
- Bus width, Capacity (Gbit)
- [02:20] X-HBM’s bandwidth and density are 16x and 10x that of existing memory, respectively.
- Ratio (x)
- [02:45] Saimemory’s technology aims to double storage capacity and reduce power consumption by 40-50%.
- Capacity, Power
Bottleneck Claims (4)
- [01:45] Traditional HBM faces bottlenecks in bandwidth and capacity.
- Evidence: The video introduces 3D X-DRAM as a technology that breaks through these bottlenecks.
- [01:51] Traditional DRAM has 2D scaling limitations.
- Evidence: 3D X-DRAM is presented as a solution that breaks this 2D scaling limit.
- [01:54] Horizontal expansion of memory leads to area and power consumption problems.
- Evidence: 3D X-DRAM’s vertical stacking is shown as a solution to this problem.
- [02:59] The traditional von Neumann architecture suffers from a data movement bottleneck, where system performance is limited by data transfer delays.
- Evidence: The video explains that processing units and storage are separate, causing delays.
Predictions (4)
- [04:20, Future] The future AI memory market will not be a simple binary of ‘replacement vs. being replaced’.
- [04:31, Future] The AI computing and memory domain will not see a single winner that completely replaces HBM.
- [04:41, Near Future] The era of a single memory solution dominating high-performance computing is ending.
- [04:44, Future] The future AI memory landscape will be a heterogeneous, multi-level system with different technologies for different scenarios (HBM for training, PIM for high-efficiency inference, etc.).
Key Technologies (6)
- HBM (High Bandwidth Memory): Vertically stacks DRAM dies to achieve ultra-high bandwidth for AI accelerators.
- TSV (Through-Silicon Via): Provides vertical electrical connections between stacked dies in HBM.
- Z-NAND: Samsung’s high-performance, low-latency NAND flash technology, positioned as an alternative to HBM for certain workloads.
- 3D X-DRAM: NEO Semiconductor’s technology that uses a 3D NAND-like structure for DRAM, aiming to overcome traditional DRAM scaling limits.
- PIM (Processing-in-Memory): Integrates computation directly into memory to reduce or eliminate the data movement bottleneck between CPU/GPU and memory.
- UCM (Unified Cache Manager): Huawei’s technology for managing KV Cache in AI inference, optimizing performance and reducing reliance on HBM capacity.
Companies Mentioned (9)
SK海力士 (SK Hynix) · 美光 (Micron) · 三星 (Samsung) · NEO Semiconductor · 软银 (SoftBank) · 英特尔 (Intel) · 东京大学 (University of Tokyo) · Saimemory · 华为 (Huawei)
Notable Quotes (3)
HBM替代方案的创新版图呈现出“架构哲学的多样性”,而非单一技术的迭代。 — Narrator @ 04:24
取而代之的将是更复杂、分散化且贴合应用场景的内存层级结构。 — Narrator @ 04:35
单一内存解决方案主导高性能计算的时代正在落幕。 — Narrator @ 04:41
Key Topics
AI Memory · Semiconductor · HBM (High Bandwidth Memory) · DRAM · NAND Flash · Processing-in-Memory (PIM) · System Architecture · Market Competition · Technological Innovation
Takeaways
- HBM is the current standard for high-performance AI memory but is facing increasing competition from alternative technologies.
- SK Hynix has established a dominant position in the HBM market, surpassing competitors like Samsung.
- Companies are developing diverse alternatives to HBM, such as Samsung’s Z-NAND and NEO Semiconductor’s 3D X-DRAM, to address cost, power, and scaling limitations.
- New system architectures like Processing-in-Memory (PIM) and Huawei’s UCM are emerging to tackle the fundamental data movement bottleneck, reducing reliance on pure memory bandwidth.
- The future of AI memory is not a single-winner market but a heterogeneous ecosystem where different memory technologies are optimized for specific tasks (e.g., HBM for training, PIM for inference).
- For China, there is a significant gap in domestic HBM technology, making the development of alternative memory architectures a strategic priority to support its domestic AI industry.