Splitting a model's layers across GPU groups that process different micro-batches in an assembly line.
← All terms