Gradient Descent in Neural Networks: Understanding How Machines Learn