What is linear regression?
What is the most popular loss function in regression?
The squared error
what is gradient descent?
what is used to iterate through the dataset using PyTorch?
a data loader
why are is the gradient stored with the weights when using PyTorch?
so they can be easily accessed and used to update the weights after calculating the loss