Model FLOPs Utilization · MFU
How much of a chip's theoretical compute a training run actually uses; the headline training-efficiency metric.
How much of a chip's theoretical compute a training run actually uses; the headline training-efficiency metric.