Batch size vs epochs

1 min readApr 3, 2020

Batch Size:

How many training data size is used each time when the model parameter is updated?

For example, the batch size of Stochastic Gradient Descent is one. While the batch size of Gradient Descent equals to the whole training data size. Mini Batch Gradient Descent is in between them.

Epochs :

When the ENRITE training data is used only ONCE, it’s called one epoch. Epochs stand for how many times the whole training data is used repeatedly?

Batch Size:

Epochs :

Sign up to discover human stories that deepen your understanding of the world.

Free

Membership

Written by Jerry An

No responses yet