-AI > Hessian Approximation > Flashcards
So if I had a large language model with 1 trillion parameters, how many entries would the Hessian have?
1 trillion by 1 trillion = 10^24 = one septillion
How much memory would you need to store 1 septillion 2byte params?
2 yottabytes