6 Optimization Techniques for Neural Networks (test)
⇦ Back to lesson
Question 2
How does stochastic gradient descent (SGD) differ from traditional gradient descent in terms of updating model parameters?