Picture
SEARCH
What are you looking for?
Need help finding what you are looking for? Contact Us
Compare

PUBLISHER: TrendForce | PRODUCT CODE: 2043020

Cover Image

PUBLISHER: TrendForce | PRODUCT CODE: 2043020

Crossing AI Memory Wall: Storage Layer Reallocation and HBF Analysis

PUBLISHED:
PAGES: 13 Pages
DELIVERY TIME: 1-2 business days
SELECT AN OPTION
PDF (Corporate License)
USD 2000

Add to Cart

In AI inference, MoE architectures and long-context processing have sharply increased memory-capacity requirements for model weights and KV cache, shifting the bottleneck from insufficient compute to limited memory capacity. As warm data grows rapidly, this will drive a restructuring of the storage hierarchy, where HBM will handle hot data, while HBF will carry warm data to optimize cost–performance. However, commercialization of HBF still needs to overcome challenges in advanced packaging processes and the inherent characteristics of NAND flash.

Key Highlights

  • Bottleneck: AI advancements shifted the bottleneck from compute power to memory capacity.
  • Hierarchy: Surging warm data demands tiered storage: HBM for hot data and HBF for warm, maximizing cost-efficiency.
  • HBF Hurdles: Commercialization requires overcoming advanced packaging and NAND flash limitations.
Product Code: TRi-182

Table of Contents

1. Development Bottlenecks of LLM: Impact on Computing Structures by Transformation of Model Architectures

  • Figure 1: Features of MoE
  • Figure 2: Deployment Strategies among AI Storage Vendors

2. From Computing Bottlenecks to Restructuring of Storage Layers

  • Figure 3: Hot, Warm, and Cold Architectures of Storage Layers
  • Figure 4: “H³” Architecture
  • Table 1: Comparison between HBM and HBF

3. TRI’s View

Have a question?
Picture

Jeroen Van Heghe

Manager - EMEA

+32-2-535-7543

Picture

Christine Sirois

Manager - Americas

+1-860-674-8796

Questions? Please give us a call or visit the contact form.
Hi, how can we help?
Contact us!