Inference efficiency framed as tokens produced per watt of power, used to compare accelerators and fleets.
← All terms