MBU · Model Bandwidth Utilization
Achieved memory bandwidth over peak for memory-bound decode inference; the MBU is the MFU analog when HBM-bound.
Achieved memory bandwidth over peak for memory-bound decode inference; the MBU is the MFU analog when HBM-bound.