What is Stochastic Gradient Descent? Machine Learning from Scratch

Introduction In Previous post We discussed about gradient descent. The main problem with Batch Gradient Descent is the fact that it uses the whole training set to compute the gradients at every step, which makes it very slow when the training set is large. In this post, We will talk about Stochastic Gradient Descent. At … Continue reading What is Stochastic Gradient Descent? Machine Learning from Scratch