Finite difference rule and its difficulty
f’(x)= (f(x+h) - f(x))/h, difficulty = numerical instability for small values of h
Taylor Expansion
Saddle points/increasing/decreasing functions
Gradient Descent Update
theta_t <– theta_{t-1} - lr * grad of f at theta_{t-1}