The Definitive Guide toAI Data Centers
Ask the Guide
GuideGlossaryMBU

MBU · Model Bandwidth Utilization

Achieved memory bandwidth over peak for memory-bound decode inference; the MBU is the MFU analog when HBM-bound.

← All terms