Applied Computational Intelligence and Soft Computing

Research Article

Pipelined Training with Stale Weights in Deep Convolutional Neural Networks

Pipelined stochastic gradient descent (PPL-SGD).

	Initialize weights
	Given learning rate sequence
	fortodo
	fortoin parallel do
	Compute gradient using stale weights:

	Update weights:

	end for
	end for